Search: no paper type

Results 1 - 50 of 65

Results

Title/Author Year Citations addedsort icon
Hernandez, MA; Stolfo, S.
Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem
1998 604 Sep06
Chaudhuri, S.; Ganjam, K.; Ganti, V.; Motwani, R.
Robust and efficient fuzzy match for online data cleaning
2003 378 Sep06
Bhattacharya, I; Getoor, L
Iterative record linkage for cleaning and integration
2004 Sep06
Chen, Z; Kalashnikov, DV; Mehrotra, S
Exploiting relationships for object consolidation
2005 68 Sep06
Cohen, WW
Integration of heterogeneous databases without common domains using queries based on textual similarity
1998 438 Sep06
Sarawagi, S; Bhamidipaty, A
Interactive deduplication using active learning
2002 Sep06
Doan, AnHai; Lu, Ying; Lee, Yoonkyong; Han, Jiawei
Object Matching for Information Integration: A Profiler-Based Approach
2003 68 Sep06
Chua, CEH; Chiang, RHL; Lim, EP
Instance-based attribute identification in database integration
2003 36 Sep06
Ganesh, M.; Srivastava, J.; Richardson, T.
Mining entity-identification rules for database integration
1996 21 Sep06
Hernandez, M.A.; Stolfo, S.J.
The merge/purge problem for large databases
1995 751 Sep06
Kailing, K.; Kriegel, H.P.; Schonauer, S.; Seidl, T.
Efficient similarity search for hierarchical data in large databases
2004 Sep06
Kohavi, R.; John, G.H.
Wrappers for Feature Subset Selection
1997 4115 Sep06
Lee, M.L.; Hsu, W.; Kothari, V.
Cleaning the spurious links in data
2004 40 Sep06
Nelder, J.A.; Mead, R.
A simplex method for function minimization
1965 16651 Sep06
Quass, D.; Starkey, P.
Record linkage for genealogical databases
2003 24 Sep06
Bhattacharya, I; Getoor, L; Licamele, L
Query-time entity resolution
2006 39 Sep06
Kalashnikov, DV; Mehrotra, S
A probabilistic model for entity disambiguation using relationships
2005 16 Sep06
Xi, W; Fox, EA; Fan, W; Zhang, B; Chen, Z; Yan, J; J Yan, D
SimFusion: measuring similarity using unified relationship matrix
2005 75 Oct06
Tejada, S
Learning Object Identification Rules for Information Integration
2002 219 Oct06
Cohen, William; Richman, Jacob
Learning to match and cluster large high-dimensional data sets for data integration
2002 274 Oct06
Verykios, V. S.; Moustakides, G. V.; Elfeky, M. G.
A Bayesian decision model for cost optimal record matching
2003 66 Oct06
Bitton, D.; DeWitt, D.J.
Duplicate record elimination in large data files
1983 208 Oct06
Cohen, W.W.
Data integration using similarity joins and a word-based information representation language
2000 195 Oct06
Christen, P.; Churches, T.; Zhu, J.
Probabilistic Name and Address Cleaning and Standardization
2002 Oct06
Fellegi, I.P.; Sunter, A.B.
A Theory for Record Linkage
1969 1444 Oct06
Hjaltason, G.R.; Samet, H.
Incremental distance join algorithms for spatial databases
1998 250 Oct06
Jokinen, P.; Ukkonen, E.
Two algorithms for approximate string matching in static texts
1991 Oct06
Kalashnikov, D.V.; Mehrotra, S.; Chen, Z.
Exploiting relationships for domain-independent data cleaning
2005 111 Oct06
Monge, A.; Elkan, C.
The field matching problem: Algorithms and applications
1996 443 Oct06
Ristad, ES; Yianilos, PN; Inc, M.T.; Princeton, NJ
Learning string-edit distance
1998 498 Oct06
Monge, A.E.; Elkan, C.
An efficient domain-independent algorithm for detecting approximately duplicate database records
1997 364 Oct06
Lee, Dongwon; On, Byung-Won; Kang, Jaewoo; Park, Sanghyun
Effective and scalable solutions for mixed and split citation problems in digital libraries
2005 61 Oct06
Bhattacharya, Indrajit; Getoor, Lise
Relational clustering for multi-type entity resolution
2005 49 Oct06
Kukich, K.
Techniques for automatically correcting words in text
1992 Oct06
Galhardas, H; Florescu, D; Shasha, D; Simon, E; E Simon, CA
Improving data cleaning quality using a data lineage facility
2001 30 Oct06
Lee, D; Kang, J; Mitra, P; Giles, CL; On, BW
Are Your Citations Clean? New Scenarios and Challenges in Maintaining Digital Libraries
2006 Feb07
Lee, D.; Kang, J.; Mitra, P.; Giles, C. Lee; On, B.-W.
Large-Scale Citation Matching of Scientific Digital Libraries
2006 Mar07
Jakoniene, V; Rundqvist, D;Lambrix, P
A method for similarity-based grouping of biological data
2006 8 Mar07
Herbert, KG; Gehani, NH; Piel, WH; Wang, JTL; Wu, CH
BIO-AJAX: an extensible framework for biological data cleaning
2004 30 Mar07
Tan, YF; Kan, MY; Lee, D
Search engine driven author disambiguation
2006 63 Apr07
Bhattacharya, I.; Getoor, L.;
Collective Entity Resolution in Relational Data
2007 238 Apr07
Bhattacharya, I.; Licamele, L.; Getoor, L.;
Relational Clustering for Entity Resolution Queries
2006 2 Apr07
Bhattacharya, I.; Getoor, L.;
A Latent Dirichlet Model for Unsupervised Entity Resolution
2006 144 Apr07
Michalowski, M; Thakkar, S; Knoblock, CA
Exploiting secondary sources for automatic object consolidation
2003 23 Apr07
Kübart, J,.; Grimmer, Udo; Hipp, Jochen
Regelbasierte Ausreißersuche zur Datenqualitätsanalyse
2005 4 Apr07
Snae, C; Diaz, BM
An interface for mining genealogical nominal data using the concept of linkage and a hybrid name matching algorithm
2002 9 Apr07
Malin, B
Re-identification of Familial Database Records.
2006 Apr07
Borgman, CL; Siegfried, SL
Getty's Synoname and its cousins: A survey of applications of personal name-matching algorithms
1992 69 Apr07
McCallum, A; Nigam, K; Ungar, LH
Efficient clustering of high-dimensional data sets with application to reference matching
2000 550 Apr07
Monge, AE
Matching Algorithms within a Duplicate Detection System
2000 91 Apr07