Search: no dataset

Results 1 - 50 of 82

Results

Title/Author Year Citations addedsort icon
Rahm, Erhard; Do, Hong Hai
Data Cleaning: Problems and Current Approaches
2000 778 Aug06
Galhardas, H; Florescu, D; Shasha, D; Simon, E
AJAX: an extensible data cleaning tool
2000 175 Sep06
Hernandez, MA; Stolfo, S.
Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem
1998 604 Sep06
Chaudhuri, S.; Ganjam, K.; Ganti, V.; Motwani, R.
Robust and efficient fuzzy match for online data cleaning
2003 378 Sep06
Bhattacharya, I; Getoor, L
Iterative record linkage for cleaning and integration
2004 Sep06
Chen, Z; Kalashnikov, DV; Mehrotra, S
Exploiting relationships for object consolidation
2005 68 Sep06
Cohen, WW
Integration of heterogeneous databases without common domains using queries based on textual similarity
1998 438 Sep06
Galhardas, H; Florescu, D; Shasha, D; Simon, E; Saita, C.
Declarative data cleaning: Language, model, and algorithms
2001 323 Sep06
Sarawagi, S; Bhamidipaty, A
Interactive deduplication using active learning
2002 Sep06
Doan, AnHai; Lu, Ying; Lee, Yoonkyong; Han, Jiawei
Object Matching for Information Integration: A Profiler-Based Approach
2003 68 Sep06
Elfeky, MG; Verykios, VS; Elmagarmid, AK
TAILOR: a record linkage tool box
2002 Sep06
Chua, CEH; Chiang, RHL; Lim, EP
Instance-based attribute identification in database integration
2003 36 Sep06
Ganesh, M.; Srivastava, J.; Richardson, T.
Mining entity-identification rules for database integration
1996 21 Sep06
Hernandez, M.A.; Stolfo, S.J.
The merge/purge problem for large databases
1995 751 Sep06
Kailing, K.; Kriegel, H.P.; Schonauer, S.; Seidl, T.
Efficient similarity search for hierarchical data in large databases
2004 Sep06
Kohavi, R.; John, G.H.
Wrappers for Feature Subset Selection
1997 4115 Sep06
Lee, M.L.; Hsu, W.; Kothari, V.
Cleaning the spurious links in data
2004 40 Sep06
Nelder, J.A.; Mead, R.
A simplex method for function minimization
1965 16651 Sep06
Quass, D.; Starkey, P.
Record linkage for genealogical databases
2003 24 Sep06
Weis, M; Naumann, F
DogmatiX tracks down duplicates in XML
2005 Sep06
Benedikt, M.; Bohannon, P.; Bruns, G.
Data Cleaning for Decision Support
2006 6 Sep06
Kalashnikov, DV; Mehrotra, S
A probabilistic model for entity disambiguation using relationships
2005 16 Sep06
Xi, W; Fox, EA; Fan, W; Zhang, B; Chen, Z; Yan, J; J Yan, D
SimFusion: measuring similarity using unified relationship matrix
2005 75 Oct06
Tejada, S
Learning Object Identification Rules for Information Integration
2002 219 Oct06
Cohen, William; Richman, Jacob
Learning to match and cluster large high-dimensional data sets for data integration
2002 274 Oct06
Verykios, V. S.; Moustakides, G. V.; Elfeky, M. G.
A Bayesian decision model for cost optimal record matching
2003 66 Oct06
Bitton, D.; DeWitt, D.J.
Duplicate record elimination in large data files
1983 208 Oct06
Cohen, W.W.
Data integration using similarity joins and a word-based information representation language
2000 195 Oct06
Christen, P.; Churches, T.; Zhu, J.
Probabilistic Name and Address Cleaning and Standardization
2002 Oct06
Fellegi, I.P.; Sunter, A.B.
A Theory for Record Linkage
1969 1444 Oct06
Hjaltason, G.R.; Samet, H.
Incremental distance join algorithms for spatial databases
1998 250 Oct06
Jokinen, P.; Ukkonen, E.
Two algorithms for approximate string matching in static texts
1991 Oct06
Kalashnikov, D.V.; Mehrotra, S.; Chen, Z.
Exploiting relationships for domain-independent data cleaning
2005 111 Oct06
Koudas, N.; Marathe, A.; Srivastava, D.
SPIDER: flexible matching in databases
2005 10 Oct06
Monge, A.; Elkan, C.
The field matching problem: Algorithms and applications
1996 443 Oct06
Winkler, W.E.
The state of record linkage and current research problems
1999 634 Oct06
Winkler, W.E.
Advanced methods for record linkage
1994 196 Oct06
Ristad, ES; Yianilos, PN; Inc, M.T.; Princeton, NJ
Learning string-edit distance
1998 498 Oct06
Monge, A.E.; Elkan, C.
An efficient domain-independent algorithm for detecting approximately duplicate database records
1997 364 Oct06
Christen, Peter; Churches, Tim
Febrl - Freely extensible biomedical record linkage
2002 29 Oct06
Lee, Dongwon; On, Byung-Won; Kang, Jaewoo; Park, Sanghyun
Effective and scalable solutions for mixed and split citation problems in digital libraries
2005 61 Oct06
Bhattacharya, Indrajit; Getoor, Lise
Relational clustering for multi-type entity resolution
2005 49 Oct06
Zhu, Yan; Bornhovd, Christof; Buchmann, Alejandro P.
Data Transformation for Warehousing Web Data
2001 14 Oct06
Kukich, K.
Techniques for automatically correcting words in text
1992 Oct06
Galhardas, H; Florescu, D; Shasha, D; Simon, E; E Simon, CA
Improving data cleaning quality using a data lineage facility
2001 30 Oct06
Elmagarmid, Ahmed; Ipeirotis, Panagiotis; Verykios, Vassilios
Duplicate Record Detection: A Survey
2007 785 Oct06
Müller, H; Weis, M; Bleiholder, J; Leser, U
Erkennen und Bereinigen von Datenfehlern in naturwissenschaftlichen Daten
2005 3 Oct06
Lee, D; Kang, J; Mitra, P; Giles, CL; On, BW
Are Your Citations Clean? New Scenarios and Challenges in Maintaining Digital Libraries
2006 Feb07
Lee, D.; Kang, J.; Mitra, P.; Giles, C. Lee; On, B.-W.
Large-Scale Citation Matching of Scientific Digital Libraries
2006 Mar07
Herbert, KG; Gehani, NH; Piel, WH; Wang, JTL; Wu, CH
BIO-AJAX: an extensible framework for biological data cleaning
2004 30 Mar07