Ontology-Driven Automatic Entity Disambiguation in Unstructured Text

Authors: 
Hassell, J.; Aleman-Meza, B.; Arpinar, I.B.
Author: 
Hassell, J
Aleman-Meza, B
Arpinar, I
Year: 
2006
Venue: 
Proc. ICSW 2006, LNCS
URL: 
http://lsdis.cs.uga.edu/library/download/HAA-disambiguation-ISWC2006.pdf
Citations: 
86
Citations range: 
50 - 99
AttachmentSize
Hassell2006OntologyDrivenAutomatic.pdf281.11 KB

Precisely identifying entities in web documents is essential for
document indexing, web search and data integration. Entity disambiguation is
the challenge of determining the correct entity out of various candidate entities.
Our novel method utilizes background knowledge in the form of a populated
ontology. Additionally, it does not rely on the existence of any structure in a
document or the appearance of data items that can provide strong evidence,
such as email addresses, for disambiguating person names. Originality of our
method is demonstrated in the way it uses different relationships in a document
as well as from the ontology to provide clues in determining the correct entity.
We demonstrate the applicability of our method by disambiguating names of
researchers appearing in a collection of DBWorld posts using a large scale, realworld
ontology extracted from the DBLP bibliography website. The precision
and recall measurements provide encouraging results.