korea.ac.kr

Improving Grouped-Entity Resolution using Quasi-Cliques

Authors: 
On, BW; Elmacioglu, E; Lee, D; Kang, J; Pei, J
Year: 
2006
Venue: 
ICDM

The entity resolution (ER) problem, which identifies duplicate
entities that refer to the same real world entity, is
essential in many applications. In this paper, in particular,
we focus on resolving entities that contain a group of
related elements in them (e.g., an author entity with a list
of citations, a singer entity with song list, or an intermediate
result by GROUP BY SQL query). Such entities, named
as grouped-entities, frequently occur in many applications.
The previous approaches toward grouped-entity resolution
often rely on textual similarity, and produce a large number

Syndicate content