TY - GEN
T1 - Effective keyword-based XML retrieval using the data-centric and document-centric features
AU - Tanabe, Tsubasa
AU - Shimizu, Toshiyuki
AU - Yoshikawa, Masatoshi
PY - 2012
Y1 - 2012
N2 - Extensible Markup Language (XML) is used for not only describing structured documents but also for describing data just for generating XML from relational data. The former is called document-centric XML, and the latter is called data-centric XML. From studies on retrieving data-centric XML by using keyword searches, methods based on LCA have been proposed, while from studies on retrieving document-centric XML, methods based on information retrieval that focus on the granularity of XML elements have been proposed. However, documents generally have both data-centric and document-centric elements, so there are cases in which desired results cannot be returned by using existing research. We propose a method for constructing suitable search results for XML documents that include both data-centric and document-centric elements by considering a user's query intention and element features (data-centric or document-centric). Our experiments show that both data-centric and document-centric elements need to be considered for actual XML documents.
AB - Extensible Markup Language (XML) is used for not only describing structured documents but also for describing data just for generating XML from relational data. The former is called document-centric XML, and the latter is called data-centric XML. From studies on retrieving data-centric XML by using keyword searches, methods based on LCA have been proposed, while from studies on retrieving document-centric XML, methods based on information retrieval that focus on the granularity of XML elements have been proposed. However, documents generally have both data-centric and document-centric elements, so there are cases in which desired results cannot be returned by using existing research. We propose a method for constructing suitable search results for XML documents that include both data-centric and document-centric elements by considering a user's query intention and element features (data-centric or document-centric). Our experiments show that both data-centric and document-centric elements need to be considered for actual XML documents.
UR - http://www.scopus.com/inward/record.url?scp=84871604563&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84871604563&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-35341-3_38
DO - 10.1007/978-3-642-35341-3_38
M3 - Conference contribution
AN - SCOPUS:84871604563
SN - 9783642353406
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 427
EP - 436
BT - Information Retrieval Technology - 8th Asia Information Retrieval Societies Conference, AIRS 2012, Proceedings
T2 - 8th Asia Information Retrieval Societies Conference, AIRS 2012
Y2 - 17 December 2012 through 19 December 2012
ER -