A keyword recommendation method using CorKeD words and its application to earth science data

Youichi Ishida, Toshiyuki Shimizu, Masatoshi Yoshikawa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

In various research domains, data providers themselves annotate their own data with keywords from a controlled vocabulary. However, since selecting keywords requires extensive knowledge of the domain and the controlled vocabulary, even data providers have difficulty in selecting appropriate keywords from the vocabulary. Therefore, we propose a method for recommending relevant keywords in a controlled vocabulary to data providers. We focus on a keyword definition, and calculate the similarity between an abstract text of data and the keyword definition. Moreover, considering that there are unnecessary words in the calculation, we extract CorKeD (Corpus-based Keyword Decisive) words from a target domain corpus so that we can measure the similarity appropriately. We conduct an experiment on earth science data, and verify the effectiveness of extracting the CorKeD words, which are the terms that better characterize the domain.

Original languageEnglish
Title of host publicationInformation Retrieval Technology - 11th Asia Information Retrieval Societies Conference, AIRS 2015, Proceedings
EditorsFalk Scholer, Guido Zuccon, Shlomo Geva, Aixin Sun, Hideo Joho, Peng Zhang
PublisherSpringer Verlag
Pages96-108
Number of pages13
ISBN (Print)9783319289397
DOIs
Publication statusPublished - 2015
Externally publishedYes
Event11th Asia Information Retrieval Societies Conference, AIRS 2015 - Brisbane, Australia
Duration: Dec 2 2015Dec 4 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9460
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th Asia Information Retrieval Societies Conference, AIRS 2015
Country/TerritoryAustralia
CityBrisbane
Period12/2/1512/4/15

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint

Dive into the research topics of 'A keyword recommendation method using CorKeD words and its application to earth science data'. Together they form a unique fingerprint.

Cite this