TY - JOUR
T1 - Dimensionality reduction reveals fine-scale structure in the Japanese population with consequences for polygenic risk prediction
AU - Sakaue, Saori
AU - Hirata, Jun
AU - Kanai, Masahiro
AU - Suzuki, Ken
AU - Akiyama, Masato
AU - Lai Too, Chun
AU - Arayssi, Thurayya
AU - Hammoudeh, Mohammed
AU - Al Emadi, Samar
AU - Masri, Basel K.
AU - Halabi, Hussein
AU - Badsha, Humeira
AU - Uthman, Imad W.
AU - Saxena, Richa
AU - Padyukov, Leonid
AU - Hirata, Makoto
AU - Matsuda, Koichi
AU - Murakami, Yoshinori
AU - Kamatani, Yoichiro
AU - Okada, Yukinori
N1 - Funding Information:
We sincerely thank all the participants of this study. We thank the staff of BioBank Japan Project and K. Yamamoto for supporting our analyses. UK Biobank analyses were conducted via application 31063. We thank B.M. Neale and his colleagues for providing scripts and resources for the UK Biobank analyses. This research was supported by the Tailor-Made Medical Treatment program (the BioBank Japan Project) of the Ministry of Education, Culture, Sports, Science, and Technology (MEXT), the Japan Agency for Medical Research and Development (AMED), Bioinformatics Initiative of Osaka University Graduate School of Medicine, Osaka University Center of Medical Data Science, Advanced Clinical Epidemiology Investigator’s Research Project, and National Priorities Research Program (Grant 4-344-3-105 from the Qatar National Research Fund, a member of Qatar Foundation). The MyEIRA study was financially supported by the Ministry of Health Malaysia (MRG 200512, JPP-IMR 17-017, JPP-IMR 07-046, JPP-IMR 08-006, JPP-IMR 08-012, and JPP-IMR 11-005) and by the Swedish National Research Council (DNR-348-2009-6468). We thank the Director General of Health Malaysia for supporting the work described in the MyEIRA study. Y.O. was supported by the Japan Society for the Promotion of Science (JSPS) KAKENHI (15H05911 and 19H01021), AMED (19gm6010001h0004, 19ek0410041h0003, 19ek0109413h0001, and 19km0405211h0001), and Takeda Science Foundation. J.H. is an employee of TEIJIN PHARMA LIMITED.
Publisher Copyright:
© 2020, The Author(s).
PY - 2020/12/1
Y1 - 2020/12/1
N2 - The diversity in our genome is crucial to understanding the demographic history of worldwide populations. However, we have yet to know whether subtle genetic differences within a population can be disentangled, or whether they have an impact on complex traits. Here we apply dimensionality reduction methods (PCA, t-SNE, PCA-t-SNE, UMAP, and PCA-UMAP) to biobank-derived genomic data of a Japanese population (n = 169,719). Dimensionality reduction reveals fine-scale population structure, conspicuously differentiating adjacent insular subpopulations. We further enluciate the demographic landscape of these Japanese subpopulations using population genetics analyses. Finally, we perform phenome-wide polygenic risk score (PRS) analyses on 67 complex traits. Differences in PRS between the deconvoluted subpopulations are not always concordant with those in the observed phenotypes, suggesting that the PRS differences might reflect biases from the uncorrected structure, in a trait-dependent manner. This study suggests that such an uncorrected structure can be a potential pitfall in the clinical application of PRS.
AB - The diversity in our genome is crucial to understanding the demographic history of worldwide populations. However, we have yet to know whether subtle genetic differences within a population can be disentangled, or whether they have an impact on complex traits. Here we apply dimensionality reduction methods (PCA, t-SNE, PCA-t-SNE, UMAP, and PCA-UMAP) to biobank-derived genomic data of a Japanese population (n = 169,719). Dimensionality reduction reveals fine-scale population structure, conspicuously differentiating adjacent insular subpopulations. We further enluciate the demographic landscape of these Japanese subpopulations using population genetics analyses. Finally, we perform phenome-wide polygenic risk score (PRS) analyses on 67 complex traits. Differences in PRS between the deconvoluted subpopulations are not always concordant with those in the observed phenotypes, suggesting that the PRS differences might reflect biases from the uncorrected structure, in a trait-dependent manner. This study suggests that such an uncorrected structure can be a potential pitfall in the clinical application of PRS.
UR - http://www.scopus.com/inward/record.url?scp=85082561100&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85082561100&partnerID=8YFLogxK
U2 - 10.1038/s41467-020-15194-z
DO - 10.1038/s41467-020-15194-z
M3 - Article
C2 - 32218440
AN - SCOPUS:85082561100
SN - 2041-1723
VL - 11
JO - Nature communications
JF - Nature communications
IS - 1
M1 - 1569
ER -