Search: no dataset

Results 1 - 50 of 82


Title/Author Year Citationssort icon added
Nelder, J.A.; Mead, R.
A simplex method for function minimization
1965 16651 Sep06
Kohavi, R.; John, G.H.
Wrappers for Feature Subset Selection
1997 4115 Sep06
Fellegi, I.P.; Sunter, A.B.
A Theory for Record Linkage
1969 1444 Oct06
Navarro, G
A guided tour to approximate string matching
2001 1369 May07
Elmagarmid, Ahmed; Ipeirotis, Panagiotis; Verykios, Vassilios
Duplicate Record Detection: A Survey
2007 785 Oct06
Rahm, Erhard; Do, Hong Hai
Data Cleaning: Problems and Current Approaches
2000 778 Aug06
Hernandez, M.A.; Stolfo, S.J.
The merge/purge problem for large databases
1995 751 Sep06
Winkler, W.E.
The state of record linkage and current research problems
1999 634 Oct06
Hernandez, MA; Stolfo, S.
Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem
1998 604 Sep06
McCallum, A; Nigam, K; Ungar, LH
Efficient clustering of high-dimensional data sets with application to reference matching
2000 550 Apr07
Ristad, ES; Yianilos, PN; Inc, M.T.; Princeton, NJ
Learning string-edit distance
1998 498 Oct06
Monge, A.; Elkan, C.
The field matching problem: Algorithms and applications
1996 443 Oct06
Cohen, WW
Integration of heterogeneous databases without common domains using queries based on textual similarity
1998 438 Sep06
Chaudhuri, S.; Ganjam, K.; Ganti, V.; Motwani, R.
Robust and efficient fuzzy match for online data cleaning
2003 378 Sep06
Monge, A.E.; Elkan, C.
An efficient domain-independent algorithm for detecting approximately duplicate database records
1997 364 Oct06
Bilenko, M; Mooney, R; Cohen, W; P Ravikumar, S
Adaptive name matching in information integration
2003 339 Nov07
Galhardas, H; Florescu, D; Shasha, D; Simon, E; Saita, C.
Declarative data cleaning: Language, model, and algorithms
2001 323 Sep06
Cohen, William; Richman, Jacob
Learning to match and cluster large high-dimensional data sets for data integration
2002 274 Oct06
Hjaltason, G.R.; Samet, H.
Incremental distance join algorithms for spatial databases
1998 250 Oct06
Bhattacharya, I.; Getoor, L.;
Collective Entity Resolution in Relational Data
2007 238 Apr07
Tejada, S
Learning Object Identification Rules for Information Integration
2002 219 Oct06
Bitton, D.; DeWitt, D.J.
Duplicate record elimination in large data files
1983 208 Oct06
Winkler, W.E.
Advanced methods for record linkage
1994 196 Oct06
Cohen, W.W.
Data integration using similarity joins and a word-based information representation language
2000 195 Oct06
Galhardas, H; Florescu, D; Shasha, D; Simon, E
AJAX: an extensible data cleaning tool
2000 175 Sep06
Bhattacharya, I.; Getoor, L.;
A Latent Dirichlet Model for Unsupervised Entity Resolution
2006 144 Apr07
Kalashnikov, D.V.; Mehrotra, S.; Chen, Z.
Exploiting relationships for domain-independent data cleaning
2005 111 Oct06
Kalashnikov, DV; Mehrotra, S
Domain-independent data cleaning via analysis of entity-relationship graph
2006 98 Apr07
Monge, AE
Matching Algorithms within a Duplicate Detection System
2000 91 Apr07
Low, WL; Lee, ML; Ling, TW
A knowledge-based approach for duplicate elimination in data cleaning
2001 88 Apr07
Song, Y; Huang, J; Councill, IG; Li, J; Giles, CL
Efficient topic-based unsupervised name disambiguation
2007 76 Nov07
Xi, W; Fox, EA; Fan, W; Zhang, B; Chen, Z; Yan, J; J Yan, D
SimFusion: measuring similarity using unified relationship matrix
2005 75 Oct06
Karger, DR; Jones, W
Data unification in personal information management
2006 71 Apr07
Borgman, CL; Siegfried, SL
Getty's Synoname and its cousins: A survey of applications of personal name-matching algorithms
1992 69 Apr07
Doan, AnHai; Lu, Ying; Lee, Yoonkyong; Han, Jiawei
Object Matching for Information Integration: A Profiler-Based Approach
2003 68 Sep06
Chen, Z; Kalashnikov, DV; Mehrotra, S
Exploiting relationships for object consolidation
2005 68 Sep06
Verykios, V. S.; Moustakides, G. V.; Elfeky, M. G.
A Bayesian decision model for cost optimal record matching
2003 66 Oct06
Tan, YF; Kan, MY; Lee, D
Search engine driven author disambiguation
2006 63 Apr07
Lee, Dongwon; On, Byung-Won; Kang, Jaewoo; Park, Sanghyun
Effective and scalable solutions for mixed and split citation problems in digital libraries
2005 61 Oct06
Scannapieco, M; Missier, P; Batini, C
Data Quality at a Glance
2005 58 Apr07
Bhattacharya, Indrajit; Getoor, Lise
Relational clustering for multi-type entity resolution
2005 49 Oct06
Lee, M.L.; Hsu, W.; Kothari, V.
Cleaning the spurious links in data
2004 40 Sep06
Schallehn, E; Sattler, KU; Saake, G
Efficient similarity-based operations for data integration
2004 39 Apr07
Chua, CEH; Chiang, RHL; Lim, EP
Instance-based attribute identification in database integration
2003 36 Sep06
Aizawa, A; Oyama, K
A Fast Linkage Detection Scheme for Multi-Source Information Integration
2005 35 Nov07
Benjelloun, O.; Garcia-Molina, H.; Gong, H.; Kawai, H; Larson, T.E.; Menestrina, D.; Thavisomboon, S.
D-Swoosh: A Family of Algorithms for Generic, Distributed Entity Resolution
2007 32 Aug07
Herbert, KG; Gehani, NH; Piel, WH; Wang, JTL; Wu, CH
BIO-AJAX: an extensible framework for biological data cleaning
2004 30 Mar07
Galhardas, H; Florescu, D; Shasha, D; Simon, E; E Simon, CA
Improving data cleaning quality using a data lineage facility
2001 30 Oct06
Shen, W.; DeRose, P.; Vu, L.; Doan, A.; Ramakrishnan, R.
Source-aware entity matching: A compositional approach
2007 29 Apr07
Christen, Peter; Churches, Tim
Febrl - Freely extensible biomedical record linkage
2002 29 Oct06