Rahm, Erhard; Do, Hong Hai Data Cleaning: Problems and Current Approaches |
2000 |
778 |
Aug06 |
Chaudhuri, Surajit; Ganti, Venkatesh; Motwani, Rajeev Robust Identification of Fuzzy Duplicates |
2005 |
140 |
Aug06 |
Raman, V; Hellerstein, J Potters Wheel: An Interactive Framework for Data Cleaning and Transformation |
2001 |
26 |
Sep06 |
Galhardas, H; Florescu, D; Shasha, D; Simon, E AJAX: an extensible data cleaning tool |
2000 |
175 |
Sep06 |
Chaudhuri, S; Dayal, U An overview of data warehousing and OLAP technology |
1997 |
|
Sep06 |
Hernandez, MA; Stolfo, S. Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem |
1998 |
604 |
Sep06 |
Maletic, J.I.; Marcus, A. Data Cleansing: Beyond Integrity Analysis |
2000 |
138 |
Sep06 |
Ananthakrishna, R; Chaudhuri, S; Ganti, V Eliminating fuzzy duplicates in data warehouses |
2002 |
334 |
Sep06 |
Chaudhuri, S.; Ganjam, K.; Ganti, V.; Motwani, R. Robust and efficient fuzzy match for online data cleaning |
2003 |
378 |
Sep06 |
Bhattacharya, I; Getoor, L Iterative record linkage for cleaning and integration |
2004 |
|
Sep06 |
Chaudhuri, S.; Ganjam, K.; Ganti, V.; Kapoor, R.; Narasayya, V.; Vassilakis, T. Data cleaning in microsoft SQL server 2005 |
2005 |
26 |
Sep06 |
Bilenko, M; Mooney, RJ Adaptive duplicate detection using learnable string similarity measures |
2003 |
573 |
Sep06 |
Kirsten, T.; Rahm, E. BioFuice: Mapping-based data integration in bioinformatics |
2006 |
18 |
Sep06 |
Chen, Z; Kalashnikov, DV; Mehrotra, S Exploiting relationships for object consolidation |
2005 |
68 |
Sep06 |
Cohen, WW; Kautz, H; McAllester, D Hardening soft information sources |
2000 |
91 |
Sep06 |
Cohen, WW; Ravikumar, P; Fienberg, SE A comparison of string distance metrics for name-matching tasks |
2003 |
1091 |
Sep06 |
Cohen, W.W.; Hirsh, H. Joins that generalize: text classification using Whirl |
1998 |
161 |
Sep06 |
Cohen, WW Integration of heterogeneous databases without common domains using queries based on textual similarity |
1998 |
438 |
Sep06 |
Jeh, G; Widom, J SimRank: a measure of structural-context similarity |
2002 |
|
Sep06 |
Do, H. H.; Stöhr, T.; Rahm, E.; Müller, R. Evaluierung von Data Warehouse-Werkzeugen |
2000 |
3 |
Sep06 |
Galhardas, H; Florescu, D; Shasha, D; Simon, E; Saita, C. Declarative data cleaning: Language, model, and algorithms |
2001 |
323 |
Sep06 |
Sarawagi, S; Bhamidipaty, A Interactive deduplication using active learning |
2002 |
|
Sep06 |
Doan, AnHai; Lu, Ying; Lee, Yoonkyong; Han, Jiawei Object Matching for Information Integration: A Profiler-Based Approach |
2003 |
68 |
Sep06 |
Dong, X.; Halevy, A.; Madhavan, J. Reference reconciliation in complex information spaces |
2005 |
380 |
Sep06 |
Elfeky, MG; Verykios, VS; Elmagarmid, AK TAILOR: a record linkage tool box |
2002 |
|
Sep06 |
Chua, CEH; Chiang, RHL; Lim, EP Instance-based attribute identification in database integration |
2003 |
36 |
Sep06 |
Gu, L; Baxter, R; Vickers, D; Rainsford, C Record linkage: Current practice and future directions |
2003 |
137 |
Sep06 |
Ganesh, M.; Srivastava, J.; Richardson, T. Mining entity-identification rules for database integration |
1996 |
21 |
Sep06 |
Hernandez, M.A.; Stolfo, S.J. The merge/purge problem for large databases |
1995 |
751 |
Sep06 |
Kailing, K.; Kriegel, H.P.; Schonauer, S.; Seidl, T. Efficient similarity search for hierarchical data in large databases |
2004 |
|
Sep06 |
Kang, J.; Han, T.S.; Lee, D.; Mitra, P. Establishing value mappings using statistical models and user feedback |
2005 |
9 |
Sep06 |
Kohavi, R.; John, G.H. Wrappers for Feature Subset Selection |
1997 |
4115 |
Sep06 |
Lee, M.L.; Hsu, W.; Kothari, V. Cleaning the spurious links in data |
2004 |
40 |
Sep06 |
Naumann, F; Leser, U; Freytag, J Quality-driven Integration of Heterogeneous Information Systems |
1999 |
246 |
Sep06 |
Nelder, J.A.; Mead, R. A simplex method for function minimization |
1965 |
16651 |
Sep06 |
Singla, P.; Domingos, P. Multi-relational record linkage |
2004 |
|
Sep06 |
Pinheiro, J.C.; Sun, D.X. Methods for linking and mining massive heterogeneous databases |
1998 |
33 |
Sep06 |
Quass, D.; Starkey, P. Record linkage for genealogical databases |
2003 |
24 |
Sep06 |
Rahm, E.; Thor, A.; Aumueller, D.; Do, H.H.; Golovin, N.; Kirsten, T. iFuice--Information Fusion utilizing Instance Correspondences and Peer Mappings |
2005 |
|
Sep06 |
Rahm, E.; Thor, A. Citation analysis of database publications |
2005 |
54 |
Sep06 |
Shen, W; Li, X; Doan, AH Constraint-Based Entity Matching |
2005 |
58 |
Sep06 |
Tejada, S; Knoblock, CA; Minton, S Learning domain-independent string transformation weights for high accuracy object identification |
2002 |
202 |
Sep06 |
Thor, A; Golovin, N; Rahm, E Adaptive website recommendations with AWESOME |
2005 |
7 |
Sep06 |
Thor, A.; Rahm, E. AWESOME - a Data Warehouse-based System for Adaptive Website Recommendations |
2004 |
18 |
Sep06 |
Weis, M; Naumann, F DogmatiX tracks down duplicates in XML |
2005 |
|
Sep06 |
Lautemann, SE A Propagation Mechanism for Populated Schema Versions |
1997 |
33 |
Sep06 |
Liu, L; Zicari, R; Hursch, W; Lieberherr, KJ The role of polymorphic reuse mechanisms in schema evolution in an object-oriented database |
1997 |
|
Sep06 |
Demeyer, S; Mens, T; Wermelinger, M Towards a software evolution benchmark |
2001 |
29 |
Sep06 |
Milano, D.; Scannapieco, M.; Catarci, T. Structure Aware XML Object Identification |
2006 |
27 |
Sep06 |
Qi, Y.; Candan, K. S.; Sapino, M. L.; Kintigh, K. W. QUEST: QUery-driven Exploration of Semistructured Data with ConflicTs and Partial Knowledge |
2006 |
7 |
Sep06 |