cs.purdue.edu

Similarity Group-By

Authors: 
Silva, Yasin N.; Aref, Walid G.; Ali, Mohamed H.
Year: 
2009
Venue: 
ICDE

Group-by is a core database operation that is used extensively in OLTP, OLAP, and decision support systems. In many application scenarios, it is required to group similar but not necessarily equal values. In this paper we propose a new SQL construct that supports similarity-based Group-by (SGB). SGB is not a new clustering algorithm, but rather is a practical and fast similarity grouping query operator that is compatible with other SQL operators and can be combined with them to answer similarity-based queries efficiently.

Efficient Private Record Linkage

Authors: 
Yakout, Mohamed; Atallah, Mikhail J.; Elmagarmid, Ahmed K.
Year: 
2009
Venue: 
ICDE

Record linkage is the computation of the associations among records of multiple databases. It arises in contexts like the integration of such databases, online interactions and negotiations, and many others. The autonomous entities who wish to carry out the record matching computation are often reluctant to fully share their data. In such a framework where the entities are unwilling to share data with each other, the problem of carrying out the linkage computation without full data exchange has been called private record linkage.

Syndicate content