Gravano, L.; Ipeirotis, P.G.; Jagadish, H.V.; Koudas, N.; Muthukrishnan, S.; Srivastava, D. Approximate string joins in a database (almost) for free |
2001 |
411 |
Oct06 |
Fellegi, I.P.; Sunter, A.B. A Theory for Record Linkage |
1969 |
1444 |
Oct06 |
Chaudhuri, S.; Ganti, V.; Kaushik, R. A Primitive Operator for Similarity Joins in Data Cleaning |
2006 |
201 |
Oct06 |
Christen, P.; Churches, T.; Zhu, J. Probabilistic Name and Address Cleaning and Standardization |
2002 |
|
Oct06 |
Cohen, W.W. Data integration using similarity joins and a word-based information representation language |
2000 |
195 |
Oct06 |
Bitton, D.; DeWitt, D.J. Duplicate record elimination in large data files |
1983 |
208 |
Oct06 |
Zhao, Huimin; Ram, Sudha Entity identification for heterogeneous database integration: a multiple classifier system approach and empirical evaluation |
2005 |
42 |
Oct06 |
Verykios, V. S.; Moustakides, G. V.; Elfeky, M. G. A Bayesian decision model for cost optimal record matching |
2003 |
66 |
Oct06 |
Cohen, William; Richman, Jacob Learning to match and cluster large high-dimensional data sets for data integration |
2002 |
274 |
Oct06 |
Tejada, S Learning Object Identification Rules for Information Integration |
2002 |
219 |
Oct06 |
Xi, W; Fox, EA; Fan, W; Zhang, B; Chen, Z; Yan, J; J Yan, D SimFusion: measuring similarity using unified relationship matrix |
2005 |
75 |
Oct06 |
Kalashnikov, DV; Mehrotra, S A probabilistic model for entity disambiguation using relationships |
2005 |
16 |
Sep06 |
Bhattacharya, I; Getoor, L; Licamele, L Query-time entity resolution |
2006 |
39 |
Sep06 |
Mazeika, A.; Bohlen, M.H. Cleansing Databases of Misspelled Proper Nouns |
2006 |
|
Sep06 |
Benedikt, M.; Bohannon, P.; Bruns, G. Data Cleaning for Decision Support |
2006 |
6 |
Sep06 |
Bai, Y.; Wang, F.; Liu, P. Efficiently Filtering RFID Data Streams |
2006 |
114 |
Sep06 |
Zhuang, Y.; Chen, L. In-network Outlier Cleaning for Data Collection in Sensor Networks |
2006 |
38 |
Sep06 |
Kotidis, Y.; Marian, A.; Srivastava, D. Circumventing Data Quality Problems Using Multiple Join Paths |
2006 |
10 |
Sep06 |
Menestrina, D.; Benjelloun, O.; Garcia-Molina, H. Generic Entity Resolution with Data Confidences |
2006 |
44 |
Sep06 |
Dai, B. T.; Koudas, N.; Ooi, B. C.; Srivastava, D.; Venkatasubramanian, S. Column Heterogeneity as a Measure of Data Quality |
2006 |
8 |
Sep06 |
Qi, Y.; Candan, K. S.; Sapino, M. L.; Kintigh, K. W. QUEST: QUery-driven Exploration of Semistructured Data with ConflicTs and Partial Knowledge |
2006 |
7 |
Sep06 |
Milano, D.; Scannapieco, M.; Catarci, T. Structure Aware XML Object Identification |
2006 |
27 |
Sep06 |
Demeyer, S; Mens, T; Wermelinger, M Towards a software evolution benchmark |
2001 |
29 |
Sep06 |
Liu, L; Zicari, R; Hursch, W; Lieberherr, KJ The role of polymorphic reuse mechanisms in schema evolution in an object-oriented database |
1997 |
|
Sep06 |
Lautemann, SE A Propagation Mechanism for Populated Schema Versions |
1997 |
33 |
Sep06 |