TY - GEN
T1 - Feature extraction using restricted bootstrapping
AU - Hirokawa, Sachio
PY - 2012
Y1 - 2012
N2 - The bootstrapping method is known as an application of the Page-rank technique for documents and words. The technique calculates the score of the words by mutually propagating the score of the words and the documents. However, sometimes the result is far away from the initial query word. The problem is known as "topic drift". This paper proposes to restrict the words to be to the top t words in the process of bootstrapping. The method is simpler than the technique known so far. The method is applied for the real bankruptcy information documents to extract the bankruptcy causes strongly related to the query. It is confirmed that the method prevents the topic drift.
AB - The bootstrapping method is known as an application of the Page-rank technique for documents and words. The technique calculates the score of the words by mutually propagating the score of the words and the documents. However, sometimes the result is far away from the initial query word. The problem is known as "topic drift". This paper proposes to restrict the words to be to the top t words in the process of bootstrapping. The method is simpler than the technique known so far. The method is applied for the real bankruptcy information documents to extract the bankruptcy causes strongly related to the query. It is confirmed that the method prevents the topic drift.
UR - http://www.scopus.com/inward/record.url?scp=84864051520&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84864051520&partnerID=8YFLogxK
U2 - 10.1109/ICIS.2012.50
DO - 10.1109/ICIS.2012.50
M3 - Conference contribution
AN - SCOPUS:84864051520
SN - 9780769546940
T3 - Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012
SP - 283
EP - 288
BT - Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012
T2 - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012
Y2 - 30 May 2012 through 1 June 2012
ER -