TY - GEN
T1 - Computational Analysis of Audio Recordings of Piano Performance for Automatic Evaluation
AU - Kato, Norihiro
AU - Nakamura, Eita
AU - Mine, Kyoko
AU - Doeda, Orie
AU - Yamada, Masanao
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - We developed a computational evaluation method for piano performance with the goal of building a practice support system for beginners. We recorded students’ performances as audio data and applied several recent methods for audio-to-MIDI transcription based on deep neural networks to extract the pitch, onset time, and offset time of musical notes. To determine the correctness of the performance, we aligned the extracted MIDI data with the musical score using a hidden Markov model (HMM). We compared the audio-to-MIDI transcription methods and optimized the weight on different types of performance errors to conform to teacher’s assessment. Our experiments showed a strong correlation between the rate of performance errors obtained from the alignment and the evaluation by a teacher who listened to the performance. The results that indicate performance errors and tempo stability can be used in a practice support system that provides feedback to learners.
AB - We developed a computational evaluation method for piano performance with the goal of building a practice support system for beginners. We recorded students’ performances as audio data and applied several recent methods for audio-to-MIDI transcription based on deep neural networks to extract the pitch, onset time, and offset time of musical notes. To determine the correctness of the performance, we aligned the extracted MIDI data with the musical score using a hidden Markov model (HMM). We compared the audio-to-MIDI transcription methods and optimized the weight on different types of performance errors to conform to teacher’s assessment. Our experiments showed a strong correlation between the rate of performance errors obtained from the alignment and the evaluation by a teacher who listened to the performance. The results that indicate performance errors and tempo stability can be used in a practice support system that provides feedback to learners.
UR - http://www.scopus.com/inward/record.url?scp=85172030645&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85172030645&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-42682-7_46
DO - 10.1007/978-3-031-42682-7_46
M3 - Conference contribution
AN - SCOPUS:85172030645
SN - 9783031426810
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 586
EP - 592
BT - Responsive and Sustainable Educational Futures - 18th European Conference on Technology Enhanced Learning, EC-TEL 2023, Proceedings
A2 - Viberg, Olga
A2 - Jivet, Ioana
A2 - Muñoz-Merino, Pedro J.
A2 - Perifanou, Maria
A2 - Papathoma, Tina
PB - Springer Science and Business Media Deutschland GmbH
T2 - Proceedings of the 18th European Conference on Technology Enhanced Learning, ECTEL 2023
Y2 - 4 September 2023 through 8 September 2023
ER -