TY - GEN
T1 - A Branch-and-Bound Approach to Efficient Classification and Retrieval of Documents
AU - Ii, Kotaro
AU - Saigo, Hiroto
AU - Tabei, Yasuo
N1 - Publisher Copyright:
© 2024 by SCITEPRESS – Science and Technology Publications, Lda.
PY - 2024
Y1 - 2024
N2 - Text classification and retrieval have been crucial tasks in natural language processing. In this paper, we present novel techniques for these tasks by leveraging the invariance of feature order to the evaluation results. Building on the assumption that text retrieval or classification models have already been constructed from the training documents, we propose efficient approaches that can restrict the search space spanned by the test documents. Our approach encompasses two key contributions. The first contribution introduces an efficient method for traversing a search tree, while the second contribution involves the development of novel pruning conditions. Through computational experiments using real-world datasets, we consistently demonstrate that the proposed approach outperforms the baseline method in various scenarios, showcasing its superior speed and efficiency.
AB - Text classification and retrieval have been crucial tasks in natural language processing. In this paper, we present novel techniques for these tasks by leveraging the invariance of feature order to the evaluation results. Building on the assumption that text retrieval or classification models have already been constructed from the training documents, we propose efficient approaches that can restrict the search space spanned by the test documents. Our approach encompasses two key contributions. The first contribution introduces an efficient method for traversing a search tree, while the second contribution involves the development of novel pruning conditions. Through computational experiments using real-world datasets, we consistently demonstrate that the proposed approach outperforms the baseline method in various scenarios, showcasing its superior speed and efficiency.
UR - http://www.scopus.com/inward/record.url?scp=85190714627&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85190714627&partnerID=8YFLogxK
U2 - 10.5220/0012310600003654
DO - 10.5220/0012310600003654
M3 - Conference contribution
AN - SCOPUS:85190714627
SN - 9789897586842
T3 - International Conference on Pattern Recognition Applications and Methods
SP - 205
EP - 214
BT - Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods
A2 - Castrillon-Santana, Modesto
A2 - De Marsico, Maria
A2 - Fred, Ana
PB - Science and Technology Publications, Lda
T2 - 13th International Conference on Pattern Recognition Applications and Methods, ICPRAM 2024
Y2 - 24 February 2024 through 26 February 2024
ER -