almaden.ibm.com

Object Identification Quality

Authors: 
Neiling, M; Jurk, S; Lenz, HJ; Naumann, F
Year: 
2003
Venue: 
Proc. DQCIS Workshop, 2003

Research and industry has tackled the object identification
problem of data integration in many different ways.
This paper presents a framework, that allows the evaluation of
competing approaches. To this end, complexity measures and
data characteristics are introduced, which reflect the hardness
of a given object identification problem. All characteristics can be
estimated by use of simple SQL queries and simple calculations.
Following the principle of benchmark definitions we specify a test
framework. It consists of a test database and its characteristics,

Syndicate content