Research Paper Search Using a Topic-Based Boolean Query Search and a General Query-Based Ranking Model

Satoshi Fukuda, Yoichi Tomiura, Emi Ishita

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

When conducting a search for research papers, the search should return comprehensive results related to the user’s query. In general, a user inputs a Boolean query that reflects the information need, and the search engine ranks the research papers based on the query. However, it is difficult to anticipate all possible terms that authors of relevant papers might have used. Moreover, general query-based ranking methods emphasize how to rank the relevant documents at the top of the results, but require some means of guaranteeing the comprehensiveness of the results. Therefore, two ranking methods that consider the comprehensiveness of relevant papers are proposed. The first uses a topic-based Boolean query search. This search converts every word in the abstract set and query into a topic via topic analysis by Latent Dirichlet Allocation (LDA) and conducts a search at the topic level. The topic assigned to synonyms of a search term is expected to be the same as that assigned to the search term. Each paper is ranked based on the number of times it is matched with each topic-based Boolean query search executed for various LDA parameter settings. The second is a hybrid method that emphasizes better results from our topic-based ranking result and a general query-based ranking result. This method is based on the observation that the paper sets retrieved by our method and by a general ranking method will be different. Through experiments using the NTCIR-1 and -2 datasets, the effectiveness of our topic-based and hybrid methods are demonstrated.

Original languageEnglish
Title of host publicationDatabase and Expert Systems Applications - 30th International Conference, DEXA 2019, Proceedings
EditorsSven Hartmann, Josef Küng, Gabriele Anderst-Kotsis, Ismail Khalil, Sharma Chakravarthy, A Min Tjoa
PublisherSpringer
Pages65-75
Number of pages11
ISBN (Print)9783030276171
DOIs
Publication statusPublished - 2019
Event30th International Conference on Database and Expert Systems Applications, DEXA 2019 - Linz, Austria
Duration: Aug 26 2019Aug 29 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11707 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference30th International Conference on Database and Expert Systems Applications, DEXA 2019
Country/TerritoryAustria
CityLinz
Period8/26/198/29/19

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Research Paper Search Using a Topic-Based Boolean Query Search and a General Query-Based Ranking Model'. Together they form a unique fingerprint.

Cite this