TY - JOUR
T1 - Identification of plant vacuole proteins by using graph neural network and contact maps
AU - Sui, Jianan
AU - Chen, Jiazi
AU - Chen, Yuehui
AU - Iwamori, Naoki
AU - Sun, Jin
N1 - Publisher Copyright:
© 2023, BioMed Central Ltd., part of Springer Nature.
PY - 2023/12
Y1 - 2023/12
N2 - Plant vacuoles are essential organelles in the growth and development of plants, and accurate identification of their proteins is crucial for understanding their biological properties. In this study, we developed a novel model called GraphIdn for the identification of plant vacuole proteins. The model uses SeqVec, a deep representation learning model, to initialize the amino acid sequence. We utilized the AlphaFold2 algorithm to obtain the structural information of corresponding plant vacuole proteins, and then fed the calculated contact maps into a graph convolutional neural network. GraphIdn achieved accuracy values of 88.51% and 89.93% in independent testing and fivefold cross-validation, respectively, outperforming previous state-of-the-art predictors. As far as we know, this is the first model to use predicted protein topology structure graphs to identify plant vacuole proteins. Furthermore, we assessed the effectiveness and generalization capability of our GraphIdn model by applying it to identify and locate peroxisomal proteins, which yielded promising outcomes. The source code and datasets can be accessed at https://github.com/SJNNNN/GraphIdn .
AB - Plant vacuoles are essential organelles in the growth and development of plants, and accurate identification of their proteins is crucial for understanding their biological properties. In this study, we developed a novel model called GraphIdn for the identification of plant vacuole proteins. The model uses SeqVec, a deep representation learning model, to initialize the amino acid sequence. We utilized the AlphaFold2 algorithm to obtain the structural information of corresponding plant vacuole proteins, and then fed the calculated contact maps into a graph convolutional neural network. GraphIdn achieved accuracy values of 88.51% and 89.93% in independent testing and fivefold cross-validation, respectively, outperforming previous state-of-the-art predictors. As far as we know, this is the first model to use predicted protein topology structure graphs to identify plant vacuole proteins. Furthermore, we assessed the effectiveness and generalization capability of our GraphIdn model by applying it to identify and locate peroxisomal proteins, which yielded promising outcomes. The source code and datasets can be accessed at https://github.com/SJNNNN/GraphIdn .
UR - http://www.scopus.com/inward/record.url?scp=85171862085&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85171862085&partnerID=8YFLogxK
U2 - 10.1186/s12859-023-05475-x
DO - 10.1186/s12859-023-05475-x
M3 - Article
C2 - 37740195
AN - SCOPUS:85171862085
SN - 1471-2105
VL - 24
JO - BMC bioinformatics
JF - BMC bioinformatics
IS - 1
M1 - 357
ER -