TY - GEN
T1 - Feature words that classify problem sentence in scientific article
AU - Sakai, Toshihiko
AU - Hirokawa, Sachio
PY - 2012
Y1 - 2012
N2 - Literature review requires understanding the contents from several view points, such as the problem and the method that the articles describe. Search from these viewpoints will improve the efficiency of survey, if particular segments of articles were extracted, indexed and can be used as auxiliary query. This paper focuses on sentences that describe the problem in an abstract and the feature sets that classify such problem sentences. Classification performance are evaluated by 10-fold cross-validation for six candidate sets of feature words. It turned out that the set of all words gains the best performance if 90% of the data are used as training data. However, the set of a small number of words with positive scores outperforms other feature sets, if the training data is only 10%. In such a realistic situation, the feature words are effective in improving classification performance.
AB - Literature review requires understanding the contents from several view points, such as the problem and the method that the articles describe. Search from these viewpoints will improve the efficiency of survey, if particular segments of articles were extracted, indexed and can be used as auxiliary query. This paper focuses on sentences that describe the problem in an abstract and the feature sets that classify such problem sentences. Classification performance are evaluated by 10-fold cross-validation for six candidate sets of feature words. It turned out that the set of all words gains the best performance if 90% of the data are used as training data. However, the set of a small number of words with positive scores outperforms other feature sets, if the training data is only 10%. In such a realistic situation, the feature words are effective in improving classification performance.
UR - http://www.scopus.com/inward/record.url?scp=84873381932&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84873381932&partnerID=8YFLogxK
U2 - 10.1145/2428736.2428803
DO - 10.1145/2428736.2428803
M3 - Conference contribution
AN - SCOPUS:84873381932
SN - 9781450313063
T3 - ACM International Conference Proceeding Series
SP - 360
EP - 367
BT - 14th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2012 - Proceedings
T2 - 14th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2012
Y2 - 3 December 2012 through 5 December 2012
ER -