Effective keyword-based XML retrieval using the data-centric and document-centric features

Tsubasa Tanabe, Toshiyuki Shimizu, Masatoshi Yoshikawa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Extensible Markup Language (XML) is used for not only describing structured documents but also for describing data just for generating XML from relational data. The former is called document-centric XML, and the latter is called data-centric XML. From studies on retrieving data-centric XML by using keyword searches, methods based on LCA have been proposed, while from studies on retrieving document-centric XML, methods based on information retrieval that focus on the granularity of XML elements have been proposed. However, documents generally have both data-centric and document-centric elements, so there are cases in which desired results cannot be returned by using existing research. We propose a method for constructing suitable search results for XML documents that include both data-centric and document-centric elements by considering a user's query intention and element features (data-centric or document-centric). Our experiments show that both data-centric and document-centric elements need to be considered for actual XML documents.

Original languageEnglish
Title of host publicationInformation Retrieval Technology - 8th Asia Information Retrieval Societies Conference, AIRS 2012, Proceedings
Pages427-436
Number of pages10
DOIs
Publication statusPublished - 2012
Externally publishedYes
Event8th Asia Information Retrieval Societies Conference, AIRS 2012 - Tianjin, China
Duration: Dec 17 2012Dec 19 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7675 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference8th Asia Information Retrieval Societies Conference, AIRS 2012
Country/TerritoryChina
CityTianjin
Period12/17/1212/19/12

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Effective keyword-based XML retrieval using the data-centric and document-centric features'. Together they form a unique fingerprint.

Cite this