central.cis.upenn.edu

Merging the Results of Approximate Match Operations

Authors: 
Guha, S.; Koudas, N.; Marathe, A.; Srivastava, D.
Year: 
2004
Venue: 
Proceedings of the 30th International Conference on Very Large Databases (VLDB 2004), 2004

Data Cleaning is an important process that has been at
the center of research interest in recent years. An important
end goal of effective data cleaning is to identify
the relational tuple or tuples that are “most related” to
a given query tuple. Various techniques have been proposed
in the literature for efficiently identifying approximate
matches to a query string against a single attribute
of a relation. In addition to constructing a ranking (i.e.,
ordering) of these matches, the techniques often associate,
with each match, scores that quantify the extent

Syndicate content