Training Selection for Tuning Entity Matching

Guided search

Click a term to initiate a search.

Keyword search

Training Selection for Tuning Entity Matching

Tue, 01/20/2009 - 10:34 — erhard

Authors:

Köpcke, Hanna; Rahm, Erhard

Author:

Köpcke, H

Rahm, E

Year:

2008

Venue:

Proc. VLDB workshop on Quality in Databases and Management of Uncertain Data (QDB/MUD 2008)

URL:

http://www.vldb.org/conf/2008/workshops/WProc_qdbmud/linkage1.pdf

Citations:

Citations range:

10 - 49

Attachment	Size
Kpcke2008TrainingSelectionforTuningEntityMatching.pdf	307.31 KB

Entity matching is a crucial and difficult task for data integration.
An effective solution strategy typically has to combine several
techniques and to find suitable settings for critical configuration
parameters such as similarity thresholds. Supervised (training-based)
approaches promise to reduce the manual work for
determining (learning) effective strategies for entity matching.
However, they critically depend on training data selection which
is a difficult problem that has so far mostly been addressed
manually by human experts. In this paper we propose a training-based
framework called STEM for entity matching and present
different generic methods for automatically selecting training data
to combine and configure several matching techniques. We
evaluate the proposed methods for different match tasks and
small- and medium-sized training sets.

informatik.uni-leipzig.de

websearch

Data Cleaning publication categorizer

Guided search

Data Cleaning

Data sets

Data type

Paper type

Venue type

Author

Year

mailpart

Citations range

Keyword search

Training Selection for Tuning Entity Matching

Related categories

User login