Key Concepts in the ChoiceMaker 2 Record Matching System

Authors: 
Borthwick, A; Buechi, M; Goldberg, A
Author: 
Borthwick, A
Buechi, M
Goldberg, A
Year: 
2003
Venue: 
Procs. First Workshop on Data Cleaning
URL: 
http://www.reviewmaker.com/content/publications/deliver.php3?filename=20030717_key.pdf
Citations: 
8
Citations range: 
1 - 9
AttachmentSize
Borthwick2003KeyConceptsintheChoiceMaker.pdf34.54 KB

We describe an innovative record matching system called
ChoiceMaker 2 we developed at ChoiceMaker Technologies
(CMT). Firstly, we describe the process by which we use a
machine learning technique known as maximum entropy
modeling to tune the system to the problem at hand. Secondly,
we describe the ClueMakerâ„¢ programming language that is used
to describe record matching characteristics. Thirdly, we describe
our method for testing record matching systems and describe how
our IDE facilitates this process.