Nelder, J.A.; Mead, R. A simplex method for function minimization |
1965 |
16651 |
Sep06 |
Fellegi, I.P.; Sunter, A.B. A Theory for Record Linkage |
1969 |
1444 |
Oct06 |
Bitton, D.; DeWitt, D.J. Duplicate record elimination in large data files |
1983 |
208 |
Oct06 |
Goyal, P Duplicate record identification in bibliographic databases |
1987 |
12 |
Apr07 |
Jokinen, P.; Ukkonen, E. Two algorithms for approximate string matching in static texts |
1991 |
|
Oct06 |
Kukich, K. Techniques for automatically correcting words in text |
1992 |
|
Oct06 |
Borgman, CL; Siegfried, SL Getty's Synoname and its cousins: A survey of applications of personal name-matching algorithms |
1992 |
69 |
Apr07 |
Ukkonen, E Approximate string-matching with q-grams and maximal matches |
1992 |
|
May07 |
Winkler, W.E. Advanced methods for record linkage |
1994 |
196 |
Oct06 |
Hernandez, M.A.; Stolfo, S.J. The merge/purge problem for large databases |
1995 |
751 |
Sep06 |
Ganesh, M.; Srivastava, J.; Richardson, T. Mining entity-identification rules for database integration |
1996 |
21 |
Sep06 |
Monge, A.; Elkan, C. The field matching problem: Algorithms and applications |
1996 |
443 |
Oct06 |
Lautemann, SE A Propagation Mechanism for Populated Schema Versions |
1997 |
33 |
Sep06 |
Liu, L; Zicari, R; Hursch, W; Lieberherr, KJ The role of polymorphic reuse mechanisms in schema evolution in an object-oriented database |
1997 |
|
Sep06 |
Chaudhuri, S; Dayal, U An overview of data warehousing and OLAP technology |
1997 |
|
Sep06 |
Kohavi, R.; John, G.H. Wrappers for Feature Subset Selection |
1997 |
4115 |
Sep06 |
Monge, A.E.; Elkan, C. An efficient domain-independent algorithm for detecting approximately duplicate database records |
1997 |
364 |
Oct06 |
Hernandez, MA; Stolfo, S. Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem |
1998 |
604 |
Sep06 |
Cohen, W.W.; Hirsh, H. Joins that generalize: text classification using Whirl |
1998 |
161 |
Sep06 |
Cohen, WW Integration of heterogeneous databases without common domains using queries based on textual similarity |
1998 |
438 |
Sep06 |
Pinheiro, J.C.; Sun, D.X. Methods for linking and mining massive heterogeneous databases |
1998 |
33 |
Sep06 |
Hjaltason, G.R.; Samet, H. Incremental distance join algorithms for spatial databases |
1998 |
250 |
Oct06 |
Ristad, ES; Yianilos, PN; Inc, M.T.; Princeton, NJ Learning string-edit distance |
1998 |
498 |
Oct06 |
Naumann, F; Leser, U; Freytag, J Quality-driven Integration of Heterogeneous Information Systems |
1999 |
246 |
Sep06 |
Winkler, W.E. The state of record linkage and current research problems |
1999 |
634 |
Oct06 |
Rahm, Erhard; Do, Hong Hai Data Cleaning: Problems and Current Approaches |
2000 |
778 |
Aug06 |
Maletic, J.I.; Marcus, A. Data Cleansing: Beyond Integrity Analysis |
2000 |
138 |
Sep06 |
Galhardas, H; Florescu, D; Shasha, D; Simon, E AJAX: an extensible data cleaning tool |
2000 |
175 |
Sep06 |
Cohen, WW; Kautz, H; McAllester, D Hardening soft information sources |
2000 |
91 |
Sep06 |
Do, H. H.; Stöhr, T.; Rahm, E.; Müller, R. Evaluierung von Data Warehouse-Werkzeugen |
2000 |
3 |
Sep06 |
Cohen, W.W. Data integration using similarity joins and a word-based information representation language |
2000 |
195 |
Oct06 |
Lee, M.L.; Ling, T.W.; Low, W.L. IntelliClean: a knowledge-based intelligent data cleaner |
2000 |
149 |
Oct06 |
Vassiliadis, Panos; Vagena, Zografoula; Skiadopoulos, Spiros; Karayannidis, Nikos; Sellis, Timos Arktos: A Tool For Data Cleaning and Transformation in Data Warehouse Environments |
2000 |
28 |
Oct06 |
McCallum, A; Nigam, K; Ungar, LH Efficient clustering of high-dimensional data sets with application to reference matching |
2000 |
550 |
Apr07 |
Monge, AE Matching Algorithms within a Duplicate Detection System |
2000 |
91 |
Apr07 |
Verykios, VS; Elfeky, MG; AK Elmagarmid, A On The Accuracy and Completeness of The Record Matching Process |
2000 |
16 |
Apr07 |
Demeyer, S; Mens, T; Wermelinger, M Towards a software evolution benchmark |
2001 |
29 |
Sep06 |
Raman, V; Hellerstein, J Potters Wheel: An Interactive Framework for Data Cleaning and Transformation |
2001 |
26 |
Sep06 |
Tejada, S; Knoblock, CA; Minton, S Learning object identification rules for information integration |
2001 |
219 |
May08 |
Galhardas, H; Florescu, D; Shasha, D; Simon, E; Saita, C. Declarative data cleaning: Language, model, and algorithms |
2001 |
323 |
Sep06 |
Gravano, L.; Ipeirotis, P.G.; Jagadish, H.V.; Koudas, N.; Muthukrishnan, S.; Srivastava, D. Approximate string joins in a database (almost) for free |
2001 |
411 |
Oct06 |
Zhu, Yan; Bornhovd, Christof; Buchmann, Alejandro P. Data Transformation for Warehousing Web Data |
2001 |
14 |
Oct06 |
Galhardas, H; Florescu, D; Shasha, D; Simon, E; E Simon, CA Improving data cleaning quality using a data lineage facility |
2001 |
30 |
Oct06 |
Low, WL; Lee, ML; Ling, TW A knowledge-based approach for duplicate elimination in data cleaning |
2001 |
88 |
Apr07 |
Navarro, G A guided tour to approximate string matching |
2001 |
1369 |
May07 |
Ananthakrishna, R; Chaudhuri, S; Ganti, V Eliminating fuzzy duplicates in data warehouses |
2002 |
334 |
Sep06 |
Jeh, G; Widom, J SimRank: a measure of structural-context similarity |
2002 |
|
Sep06 |
Sarawagi, S; Bhamidipaty, A Interactive deduplication using active learning |
2002 |
|
Sep06 |
Elfeky, MG; Verykios, VS; Elmagarmid, AK TAILOR: a record linkage tool box |
2002 |
|
Sep06 |
Tejada, S; Knoblock, CA; Minton, S Learning domain-independent string transformation weights for high accuracy object identification |
2002 |
202 |
Sep06 |