TY - JOUR
T1 - A Bayesian nonparametric topic model for repeated measured data
T2 - An application to prescription data
AU - Okui, Tasuku
N1 - Publisher Copyright:
© 2020, The Behaviormetric Society.
PY - 2021/1
Y1 - 2021/1
N2 - Topic models are currently used in many fields, particularly for marketing or medical science data analysis, often where an individual subject is repeatedly measured. A topic tracking model (TTM) that can consider the persistency of topics of individual subjects has been already proposed. Although the TTM estimates several parameters for each timepoint through online learning, offline learning should be utilized for analyses of preexisting data sets. Additionally, when a topic model is used, the number of topics should be decided in advance. However, deciding an appropriate number of topics is often difficult. Therefore, we propose a TTM with offline learning and a Bayesian nonparametric TTM (BNPTTM) for time-series data sets where data from individual subjects are repeated measures. The performance of the proposed topic model is evaluated using an actual prescription data set. Our results suggest that the TTM with offline learning has better predictive ability than the existing TTM, and the BNPTTM can deduce the number of topics from a given data set.
AB - Topic models are currently used in many fields, particularly for marketing or medical science data analysis, often where an individual subject is repeatedly measured. A topic tracking model (TTM) that can consider the persistency of topics of individual subjects has been already proposed. Although the TTM estimates several parameters for each timepoint through online learning, offline learning should be utilized for analyses of preexisting data sets. Additionally, when a topic model is used, the number of topics should be decided in advance. However, deciding an appropriate number of topics is often difficult. Therefore, we propose a TTM with offline learning and a Bayesian nonparametric TTM (BNPTTM) for time-series data sets where data from individual subjects are repeated measures. The performance of the proposed topic model is evaluated using an actual prescription data set. Our results suggest that the TTM with offline learning has better predictive ability than the existing TTM, and the BNPTTM can deduce the number of topics from a given data set.
UR - http://www.scopus.com/inward/record.url?scp=85087634700&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85087634700&partnerID=8YFLogxK
U2 - 10.1007/s41237-020-00117-5
DO - 10.1007/s41237-020-00117-5
M3 - Article
AN - SCOPUS:85087634700
SN - 0385-7417
VL - 48
SP - 179
EP - 190
JO - Behaviormetrika
JF - Behaviormetrika
IS - 1
ER -