Efficient similarity search for hierarchical data in large databases

Guided search

Click a term to initiate a search.

Keyword search

Efficient similarity search for hierarchical data in large databases

Thu, 09/14/2006 - 09:57 — Anonymous

Authors:

Kailing, K.; Kriegel, H.P.; Schonauer, S.; Seidl, T.

Author:

Kailing, K

Kriegel, H

Schonauer, S

Seidl, T

Year:

2004

Venue:

Proc. 9th Int. Conf. on Extending Database Technology (EDBT’04), 2004

URL:

http://www.springerlink.com/content/V0WT6KHXGHWEAUJD/fulltext.pdf

Citations:

Citations range:

n/a

Attachment	Size
Kailing2004Efficientsimilaritysearchforhierarchicaldatainlarge.pdf	1.03 MB

Structured and semi-structured object representations are getting more
and more important for modern database applications. Examples for such data are
hierarchical structures including chemical compounds, XML data or image data.
As a key feature, database systems have to support the search for similar objects
where it is important to take into account both the structure and the content features
of the objects. A successful approach is to use the edit distance for tree
structured data. As the computation of this measure is NP-complete, constrained
edit distances have been successfully applied to trees. While yielding good results,
they are still computationally complex and, therefore, of limited benefit for
searching in large databases. In this paper, we propose a filter and refinement
architecture to overcome this problem. We present a set of new filter methods
for structural and for content-based information in tree-structured data as well
as ways to flexibly combine different filter criteria. The efficiency of our methods,
resulting from the good selectivity of the filters is demonstrated in extensive
experiments with real-world applications.

websearch

Data Cleaning publication categorizer

Guided search

Data Cleaning

Data sets

Data type

Paper type

Venue type

Author

Year

mailpart

Citations range

Keyword search

Efficient similarity search for hierarchical data in large databases

Related categories

User login