TY - JOUR
T1 - Can an A.I. win a medal in the mathematical olympiad?-Benchmarking mechanized mathematics on pre-university problems 1
AU - Matsuzaki, Takuya
AU - Iwane, Hidenao
AU - Kobayashi, Munehiro
AU - Zhan, Yiyang
AU - Fukasaku, Ryoya
AU - Kudo, Jumma
AU - Anai, Hirokazu
AU - Arai, Noriko H.
N1 - Publisher Copyright:
© 2018 - IOS Press and the authors. All rights reserved.
PY - 2018
Y1 - 2018
N2 - This paper introduces a benchmark problem library for mechanized mathematics including computer algebra and automated theorem proving. The library consists of pre-university mathematical problems taken from exercise problem books, university entrance examinations, and the International Mathematical Olympiads. The subject areas include real algebra, geometry, number theory, pre-calculus, calculus, and combinatorics. It thus includes problems in various areas of pre-university mathematics and with a variety of difficulty. Unlike other existing benchmark libraries, this one contains problems that are formalized so that they are obtainable as the result of mechanical translation of the original problems expressed in natural language. In other words, the library is designed to support the integration of the technologies of mechanized mathematics and natural language processing towards the goal of end-To-end automatic mathematical problem solving. The paper also presents preliminary experimental results of our prototype reasoning component of an end-To-end system on the library. The library is publicly available through the Internet.
AB - This paper introduces a benchmark problem library for mechanized mathematics including computer algebra and automated theorem proving. The library consists of pre-university mathematical problems taken from exercise problem books, university entrance examinations, and the International Mathematical Olympiads. The subject areas include real algebra, geometry, number theory, pre-calculus, calculus, and combinatorics. It thus includes problems in various areas of pre-university mathematics and with a variety of difficulty. Unlike other existing benchmark libraries, this one contains problems that are formalized so that they are obtainable as the result of mechanical translation of the original problems expressed in natural language. In other words, the library is designed to support the integration of the technologies of mechanized mathematics and natural language processing towards the goal of end-To-end automatic mathematical problem solving. The paper also presents preliminary experimental results of our prototype reasoning component of an end-To-end system on the library. The library is publicly available through the Internet.
UR - http://www.scopus.com/inward/record.url?scp=85047195529&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85047195529&partnerID=8YFLogxK
U2 - 10.3233/AIC-180762
DO - 10.3233/AIC-180762
M3 - Article
AN - SCOPUS:85047195529
SN - 0921-7126
VL - 31
SP - 251
EP - 266
JO - AI Communications
JF - AI Communications
IS - 3
ER -