Verykios, V. S.; Moustakides, G. V.; Elfeky, M. G. A Bayesian decision model for cost optimal record matching |
2003 |
66 |
Oct06 |
Aizawa, A; Oyama, K A Fast Linkage Detection Scheme for Multi-Source Information Integration |
2005 |
35 |
Nov07 |
Navarro, G A guided tour to approximate string matching |
2001 |
1369 |
May07 |
Low, WL; Lee, ML; Ling, TW A knowledge-based approach for duplicate elimination in data cleaning |
2001 |
88 |
Apr07 |
Bhattacharya, I.; Getoor, L.; A Latent Dirichlet Model for Unsupervised Entity Resolution |
2006 |
144 |
Apr07 |
Jakoniene, V; Rundqvist, D;Lambrix, P A method for similarity-based grouping of biological data |
2006 |
8 |
Mar07 |
Kalashnikov, DV; Mehrotra, S A probabilistic model for entity disambiguation using relationships |
2005 |
16 |
Sep06 |
Nelder, J.A.; Mead, R. A simplex method for function minimization |
1965 |
16651 |
Sep06 |
Barateiro, José; Galhardas, Helena A Survey of Data Quality Tools |
2005 |
28 |
Apr07 |
Fellegi, I.P.; Sunter, A.B. A Theory for Record Linkage |
1969 |
1444 |
Oct06 |
Bilenko, M; Mooney, R; Cohen, W; P Ravikumar, S Adaptive name matching in information integration |
2003 |
339 |
Nov07 |
Winkler, W.E. Advanced methods for record linkage |
1994 |
196 |
Oct06 |
Galhardas, H; Florescu, D; Shasha, D; Simon, E AJAX: an extensible data cleaning tool |
2000 |
175 |
Sep06 |
Monge, A.E.; Elkan, C. An efficient domain-independent algorithm for detecting approximately duplicate database records |
1997 |
364 |
Oct06 |
Snae, C; Diaz, BM An interface for mining genealogical nominal data using the concept of linkage and a hybrid name matching algorithm |
2002 |
9 |
Apr07 |
Ukkonen, E Approximate string-matching with q-grams and maximal matches |
1992 |
|
May07 |
Lee, D; Kang, J; Mitra, P; Giles, CL; On, BW Are Your Citations Clean? New Scenarios and Challenges in Maintaining Digital Libraries |
2006 |
|
Feb07 |
Herbert, KG; Gehani, NH; Piel, WH; Wang, JTL; Wu, CH BIO-AJAX: an extensible framework for biological data cleaning |
2004 |
30 |
Mar07 |
Herbert, KG; Wang, JTL Biological data cleaning: a case study |
2007 |
|
Jun07 |
Lee, M.L.; Hsu, W.; Kothari, V. Cleaning the spurious links in data |
2004 |
40 |
Sep06 |
Bhattacharya, I.; Getoor, L.; Collective Entity Resolution in Relational Data |
2007 |
238 |
Apr07 |
Benjelloun, O.; Garcia-Molina, H.; Gong, H.; Kawai, H; Larson, T.E.; Menestrina, D.; Thavisomboon, S. D-Swoosh: A Family of Algorithms for Generic, Distributed Entity Resolution |
2007 |
32 |
Aug07 |
Benedikt, M.; Bohannon, P.; Bruns, G. Data Cleaning for Decision Support |
2006 |
6 |
Sep06 |
Rahm, Erhard; Do, Hong Hai Data Cleaning: Problems and Current Approaches |
2000 |
778 |
Aug06 |
Cohen, W.W. Data integration using similarity joins and a word-based information representation language |
2000 |
195 |
Oct06 |
Scannapieco, M; Missier, P; Batini, C Data Quality at a Glance |
2005 |
58 |
Apr07 |
Zhu, Yan; Bornhovd, Christof; Buchmann, Alejandro P. Data Transformation for Warehousing Web Data |
2001 |
14 |
Oct06 |
Karger, DR; Jones, W Data unification in personal information management |
2006 |
71 |
Apr07 |
Galhardas, H; Florescu, D; Shasha, D; Simon, E; Saita, C. Declarative data cleaning: Language, model, and algorithms |
2001 |
323 |
Sep06 |
Kalashnikov, DV; Mehrotra, S Domain-independent data cleaning via analysis of entity-relationship graph |
2006 |
98 |
Apr07 |
Elmagarmid, Ahmed; Ipeirotis, Panagiotis; Verykios, Vassilios Duplicate Record Detection: A Survey |
2007 |
785 |
Oct06 |
Goyal, P Duplicate record identification in bibliographic databases |
1987 |
12 |
Apr07 |
Lee, Dongwon; On, Byung-Won; Kang, Jaewoo; Park, Sanghyun Effective and scalable solutions for mixed and split citation problems in digital libraries |
2005 |
61 |
Oct06 |
McCallum, A; Nigam, K; Ungar, LH Efficient clustering of high-dimensional data sets with application to reference matching |
2000 |
550 |
Apr07 |
Kailing, K.; Kriegel, H.P.; Schonauer, S.; Seidl, T. Efficient similarity search for hierarchical data in large databases |
2004 |
|
Sep06 |
Schallehn, E; Sattler, KU; Saake, G Efficient similarity-based operations for data integration |
2004 |
39 |
Apr07 |
Song, Y; Huang, J; Councill, IG; Li, J; Giles, CL Efficient topic-based unsupervised name disambiguation |
2007 |
76 |
Nov07 |
Müller, H; Weis, M; Bleiholder, J; Leser, U Erkennen und Bereinigen von Datenfehlern in naturwissenschaftlichen Daten |
2005 |
3 |
Oct06 |
Kalashnikov, D.V.; Mehrotra, S.; Chen, Z. Exploiting relationships for domain-independent data cleaning |
2005 |
111 |
Oct06 |
Chen, Z; Kalashnikov, DV; Mehrotra, S Exploiting relationships for object consolidation |
2005 |
68 |
Sep06 |
Michalowski, M; Thakkar, S; Knoblock, CA Exploiting secondary sources for automatic object consolidation |
2003 |
23 |
Apr07 |
Christen, Peter; Churches, Tim Febrl - Freely extensible biomedical record linkage |
2002 |
29 |
Oct06 |
Borgman, CL; Siegfried, SL Getty's Synoname and its cousins: A survey of applications of personal name-matching algorithms |
1992 |
69 |
Apr07 |
Galhardas, H; Florescu, D; Shasha, D; Simon, E; E Simon, CA Improving data cleaning quality using a data lineage facility |
2001 |
30 |
Oct06 |
Hjaltason, G.R.; Samet, H. Incremental distance join algorithms for spatial databases |
1998 |
250 |
Oct06 |
Chua, CEH; Chiang, RHL; Lim, EP Instance-based attribute identification in database integration |
2003 |
36 |
Sep06 |
Sarawagi, S; Bhamidipaty, A Interactive deduplication using active learning |
2002 |
|
Sep06 |
Bhattacharya, I; Getoor, L Iterative record linkage for cleaning and integration |
2004 |
|
Sep06 |
Lee, D.; Kang, J.; Mitra, P.; Giles, C. Lee; On, B.-W. Large-Scale Citation Matching of Scientific Digital Libraries |
2006 |
|
Mar07 |
Tejada, S Learning Object Identification Rules for Information Integration |
2002 |
219 |
Oct06 |