The emergence of digital textbook reading systems such as Bookroll, and their ability of recording reader interactions has opened the possibility of analyzing the students reading behaviors and characteristics. To date, several works have conducted compelling analyses characterizing the different types of students with the use of clustering ML models, while others have used supervised ML models to predict their academic performance. The main characteristic these models share is that internally they simplify the students' data into a latent representation to get an insight or make a prediction. Nevertheless, these representations are oversimplified, otherwise difficult to interpret. Accordingly, the present work explores the use of Variational Autoencoders to make more interpretable and complex latent representations. After a brief description of these models, we present and discuss the results of four explorative studies when using the LAK22 Data Challenge Workshop datasets. Our results show that the probabilistic latent representations generated by the proposed models preserve the student reading characteristics, allowing a better visual interpretation when using 3 dimensions. Also, they allow supervised regressive and classification models to have a more stable and less overfitted learning process, which also allows some of them to make better score predictions.
|CEUR Workshop Proceedings
|出版済み - 2022
|4th Workshop on Predicting Performance Based on the Analysis of Reading Behavior, DC in LAK 2022 - Virtual, Online, 米国
継続期間: 3月 22 2022 → …
!!!All Science Journal Classification (ASJC) codes