TY - JOUR
T1 - Principal points analysis via p-median problem for binary data
AU - Yamashita, Haruka
AU - Kawahara, Yoshinobu
N1 - Funding Information:
This work was supported by JSPS KAKENHI [Grant number 16K16361].
Publisher Copyright:
© 2019, © 2019 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2020/5/18
Y1 - 2020/5/18
N2 - Analysis with principal points is a useful statistical tool for summarizing large data. In this paper, we propose a subgradient-based algorithm to calculate a set of principal points for multivariate binary data by the formulating it as a p-median problem. This enables us to find a globally optimal set of principal points or an ε-optimal solution in the middle of the calculation by combining an upper bound found using the greedy method. This algorithm is an iterative procedure where each iteration can be calculated in an efficient manner. We investigate the applicability of the proposed framework with questionnaire data and arXiv co-authors data.
AB - Analysis with principal points is a useful statistical tool for summarizing large data. In this paper, we propose a subgradient-based algorithm to calculate a set of principal points for multivariate binary data by the formulating it as a p-median problem. This enables us to find a globally optimal set of principal points or an ε-optimal solution in the middle of the calculation by combining an upper bound found using the greedy method. This algorithm is an iterative procedure where each iteration can be calculated in an efficient manner. We investigate the applicability of the proposed framework with questionnaire data and arXiv co-authors data.
UR - http://www.scopus.com/inward/record.url?scp=85074012296&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85074012296&partnerID=8YFLogxK
U2 - 10.1080/02664763.2019.1675605
DO - 10.1080/02664763.2019.1675605
M3 - Article
AN - SCOPUS:85074012296
SN - 0266-4763
VL - 47
SP - 1282
EP - 1297
JO - Journal of Applied Statistics
JF - Journal of Applied Statistics
IS - 7
ER -