Search: edu

Results 1 - 50 of 51

Results

Title/Author Year Citationssort icon added
Cohen, WW; Ravikumar, P; Fienberg, SE
A comparison of string distance metrics for name-matching tasks
2003 1091 Sep06
Bilenko, M; Mooney, RJ
Adaptive duplicate detection using learnable string similarity measures
2003 573 Sep06
Gravano, L.; Ipeirotis, P.G.; Jagadish, H.V.; Koudas, N.; Muthukrishnan, S.; Srivastava, D.
Approximate string joins in a database (almost) for free
2001 411 Oct06
Dong, X.; Halevy, A.; Madhavan, J.
Reference reconciliation in complex information spaces
2005 380 Sep06
Ananthakrishna, R; Chaudhuri, S; Ganti, V
Eliminating fuzzy duplicates in data warehouses
2002 334 Sep06
Mann, GS; Yarowsky, D
Unsupervised Personal Name Disambiguation
2003 283 Apr07
Pasula, H; Marthi, B; Milch, B; Russell, S; Shpitser, I
Identity uncertainty and citation matching
2003 267 Apr07
Tejada, S; Knoblock, CA; Minton, S
Learning domain-independent string transformation weights for high accuracy object identification
2002 202 Sep06
Cohen, W.W.; Hirsh, H.
Joins that generalize: text classification using Whirl
1998 161 Sep06
Jin, L.; Li, C.; Mehrotra, S.
Efficient record linkage in large data sets
2003 154 Oct06
Chaudhuri, Surajit; Ganti, Venkatesh; Motwani, Rajeev
Robust Identification of Fuzzy Duplicates
2005 140 Aug06
Maletic, J.I.; Marcus, A.
Data Cleansing: Beyond Integrity Analysis
2000 138 Sep06
Gravano, L.; Ipeirotis, P.G.; Koudas, N.; Srivastava, D.
Text joins in an RDBMS for web data integration
2003 129 Oct06
Bai, Y.; Wang, F.; Liu, P.
Efficiently Filtering RFID Data Streams
2006 114 Sep06
Cohen, WW; Kautz, H; McAllester, D
Hardening soft information sources
2000 91 Sep06
Hassell, J.; Aleman-Meza, B.; Arpinar, I.B.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text
2006 86 Apr07
Guha, S.; Koudas, N.; Marathe, A.; Srivastava, D.
Merging the Results of Approximate Match Operations
2004 73 Oct06
Whang, Steven Euijong; Menestrina, David; Koutrika, Georgia; Theobald, Martin; Garcia-Molina, Hector
Entity resolution with iterative blocking
2009 68 Sep09
Shen, W; Li, X; Doan, AH
Constraint-Based Entity Matching
2005 58 Sep06
Hassanzadeh, O; Consens, M
Linked movie data base
2009 57 May10
Arasu, Arvind; Ré, Christopher; Suciu, Dan
Large-Scale Deduplication with Constraints Using Dedupalog
2009 56 Sep09
Singla, P; Domingos, P
Object identification with attribute-mediated dependences
2005 56 Apr07
Menestrina, D.; Benjelloun, O.; Garcia-Molina, H.
Generic Entity Resolution with Data Confidences
2006 44 Sep06
Zhao, Huimin; Ram, Sudha
Entity identification for heterogeneous database integration: a multiple classifier system approach and empirical evaluation
2005 42 Oct06
On, Byung-Won; Koudas, Nick; Lee, Dongwon; Srivastava, Divesh
Group Linkage
2007 40 Feb07
Yan, S; Lee, D; Kan, MY; Giles, CL
Adaptive sorted neighborhood methods for efficient record linkage
2007 32 Nov07
Li, Huajing; Councill, Isaac; Lee, Wang-Chien; Giles, C. Lee
CiteSeerX: an Architecture and Web Service Design for an Academic Document Search Engine
2006 29 Feb07
Hassanzadeh, Oktie; Chiang, Fei; Miller, Renée; Lee, Hyun Chul
Framework for Evaluating Clustering Algorithms in Duplicate Detection
2009 29 Sep09
Zhao, H; Ram, S
Combining schema and instance information for integrating heterogeneous data sources
2007 28 Nov09
Raman, V; Hellerstein, J
Potters Wheel: An Interactive Framework for Data Cleaning and Transformation
2001 26 Sep06
Chen, Zhaoqi; Kalashnikov, Dmitri V.; Mehrotra, Sharad
Exploiting context analysis for combining multiple entity resolution systems
2009 25 Sep09
Yakout, Mohamed; Atallah, Mikhail J.; Elmagarmid, Ahmed K.
Efficient Private Record Linkage
2009 22 Sep09
Bolelli, Levent; Ertekin, Seyda; Giles, C. Lee
Clustering Scientific Literature Using Sparse Citation Graph Analysis
2006 15 Feb07
Miller, Renee; Kementsietsidis, Anastasios; Lim, Lipyeow; Wang, Min
Linkage Query Writer
2009 13 Sep09
Councill, Isaac G.; Giles, C. Lee; Iorio, Ernesto Di; Gori, Marco; Maggini, Marco; Pucci, Augusto
Towards Next Generation CiteSeer: A Flexible Architecture for Digital Library Deployment
2006 13 Feb07
Phua, C; Lee, V; Smith, K
The Personal Name Problem And a Recommended Data Mining Solution
2006 12 Apr07
Councill, Isaac G.; Li, Huajing; Zhuang, Ziming; Debnath, Sandip; Bolelli, Levent; Lee, Wang-Chien; Sivasubramaniam, Anand; Giles, C. Lee
Learning metadata from the evidence in an on-line citation matching scheme
2006 10 Feb07
Kotidis, Y.; Marian, A.; Srivastava, D.
Circumventing Data Quality Problems Using Multiple Join Paths
2006 10 Sep06
Wellner, B; Castano, J; Pustejovsky, J
Adaptive string similarity metrics for biomedical reference resolution
2005 9 Feb09
Kang, J.; Han, T.S.; Lee, D.; Mitra, P.
Establishing value mappings using statistical models and user feedback
2005 9 Sep06
Dai, B. T.; Koudas, N.; Ooi, B. C.; Srivastava, D.; Venkatasubramanian, S.
Column Heterogeneity as a Measure of Data Quality
2006 8 Sep06
Qi, Y.; Candan, K. S.; Sapino, M. L.; Kintigh, K. W.
QUEST: QUery-driven Exploration of Semistructured Data with ConflicTs and Partial Knowledge
2006 7 Sep06
Lu, Y; Nie, Z; Cheng, T; Gao, Y; Wen, JR
Name Disambiguation Using Web Connection
2007 4 Feb09
Chaudhuri, S; Sarma, AD; Ganti, V; Kaushik, R
Leveraging aggregate constraints for deduplication
2007 Sep09
Silva, Yasin N.; Aref, Walid G.; Ali, Mohamed H.
Similarity Group-By
2009 Sep09
Chen, Z; Kalashnikov, DV; Mehrotra, S
Adaptive graphical approach to entity resolution
2007 Nov07
Frigui, Hichem
MembershipMap: Data Transformation Based on Membership Aggregation
2004 Oct06
On, BW; Elmacioglu, E; Lee, D; Kang, J; Pei, J
Improving Grouped-Entity Resolution using Quasi-Cliques
2006 Feb07
Bilenko, M; Mooney, RJ
On evaluation and training-set construction for duplicate detection
2003 Oct06
Singla, P.; Domingos, P.
Multi-relational record linkage
2004 Sep06