TY - JOUR
T1 - A convolutional neural network-based regression model to infer the epigenetic crosstalk responsible for CG methylation patterns
AU - Au Yeung, Wan Kin
AU - Maruyama, Osamu
AU - Sasaki, Hiroyuki
N1 - Funding Information:
This work was supported by a JSPS KAKENHI grant to H.S. and O.M. (JP18H05214). The funding bodies did not play any roles in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Funding Information:
We thank Motoko Unoki, Takashi Ishiuchi, Hidehiro Toh, Chaoqing Wen, Ryo Shimizu and Tomoyo Taniguchi (Kyushu University) for helpful discussion and technical assistance.
Publisher Copyright:
© 2021, The Author(s).
PY - 2021/12
Y1 - 2021/12
N2 - Background: Epigenetic modifications, including CG methylation (a major form of DNA methylation) and histone modifications, interact with each other to shape their genomic distribution patterns. However, the entire picture of the epigenetic crosstalk regulating the CG methylation pattern is unknown especially in cells that are available only in a limited number, such as mammalian oocytes. Most machine learning approaches developed so far aim at finding DNA sequences responsible for the CG methylation patterns and were not tailored for studying the epigenetic crosstalk. Results: We built a machine learning model named epiNet to predict CG methylation patterns based on other epigenetic features, such as histone modifications, but not DNA sequence. Using epiNet, we identified biologically relevant epigenetic crosstalk between histone H3K36me3, H3K4me3, and CG methylation in mouse oocytes. This model also predicted the altered CG methylation pattern of mutant oocytes having perturbed histone modification, was applicable to cross-species prediction of the CG methylation pattern of human oocytes, and identified the epigenetic crosstalk potentially important in other cell types. Conclusions: Our findings provide insight into the epigenetic crosstalk regulating the CG methylation pattern in mammalian oocytes and other cells. The use of epiNet should help to design or complement biological experiments in epigenetics studies.
AB - Background: Epigenetic modifications, including CG methylation (a major form of DNA methylation) and histone modifications, interact with each other to shape their genomic distribution patterns. However, the entire picture of the epigenetic crosstalk regulating the CG methylation pattern is unknown especially in cells that are available only in a limited number, such as mammalian oocytes. Most machine learning approaches developed so far aim at finding DNA sequences responsible for the CG methylation patterns and were not tailored for studying the epigenetic crosstalk. Results: We built a machine learning model named epiNet to predict CG methylation patterns based on other epigenetic features, such as histone modifications, but not DNA sequence. Using epiNet, we identified biologically relevant epigenetic crosstalk between histone H3K36me3, H3K4me3, and CG methylation in mouse oocytes. This model also predicted the altered CG methylation pattern of mutant oocytes having perturbed histone modification, was applicable to cross-species prediction of the CG methylation pattern of human oocytes, and identified the epigenetic crosstalk potentially important in other cell types. Conclusions: Our findings provide insight into the epigenetic crosstalk regulating the CG methylation pattern in mammalian oocytes and other cells. The use of epiNet should help to design or complement biological experiments in epigenetics studies.
UR - http://www.scopus.com/inward/record.url?scp=85108850216&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85108850216&partnerID=8YFLogxK
U2 - 10.1186/s12859-021-04272-8
DO - 10.1186/s12859-021-04272-8
M3 - Article
C2 - 34162326
AN - SCOPUS:85108850216
SN - 1471-2105
VL - 22
JO - BMC bioinformatics
JF - BMC bioinformatics
IS - 1
M1 - 341
ER -