TY - JOUR
T1 - DeepSuccinylSite
T2 - A deep learning based approach for protein succinylation site prediction
AU - Thapa, Niraj
AU - Chaudhari, Meenal
AU - McManus, Sean
AU - Roy, Kaushik
AU - Newman, Robert H.
AU - Saigo, Hiroto
AU - Kc, Dukka B.
N1 - Funding Information:
This work was supported by National Science Foundation (NSF) grant nos. 1901793, 1564606 and 1901086 (to DK). RHN is supported by an HBCU-UP Excellence in Research Award from NSF (1901793) and an SC1 Award from the National Institutes of Health National Institute of General Medical Science (5SC1GM130545). HS was supported by JSPS KAKENHI Grant Numbers JP18H01762 and JP19H04176.
Publisher Copyright:
© 2020 The Author(s).
PY - 2020/4/23
Y1 - 2020/4/23
N2 - Background: Protein succinylation has recently emerged as an important and common post-translation modification (PTM) that occurs on lysine residues. Succinylation is notable both in its size (e.g., at 100 Da, it is one of the larger chemical PTMs) and in its ability to modify the net charge of the modified lysine residue from + 1 to - 1 at physiological pH. The gross local changes that occur in proteins upon succinylation have been shown to correspond with changes in gene activity and to be perturbed by defects in the citric acid cycle. These observations, together with the fact that succinate is generated as a metabolic intermediate during cellular respiration, have led to suggestions that protein succinylation may play a role in the interaction between cellular metabolism and important cellular functions. For instance, succinylation likely represents an important aspect of genomic regulation and repair and may have important consequences in the etiology of a number of disease states. In this study, we developed DeepSuccinylSite, a novel prediction tool that uses deep learning methodology along with embedding to identify succinylation sites in proteins based on their primary structure. Results: Using an independent test set of experimentally identified succinylation sites, our method achieved efficiency scores of 79%, 68.7% and 0.48 for sensitivity, specificity and MCC respectively, with an area under the receiver operator characteristic (ROC) curve of 0.8. In side-by-side comparisons with previously described succinylation predictors, DeepSuccinylSite represents a significant improvement in overall accuracy for prediction of succinylation sites. Conclusion: Together, these results suggest that our method represents a robust and complementary technique for advanced exploration of protein succinylation.
AB - Background: Protein succinylation has recently emerged as an important and common post-translation modification (PTM) that occurs on lysine residues. Succinylation is notable both in its size (e.g., at 100 Da, it is one of the larger chemical PTMs) and in its ability to modify the net charge of the modified lysine residue from + 1 to - 1 at physiological pH. The gross local changes that occur in proteins upon succinylation have been shown to correspond with changes in gene activity and to be perturbed by defects in the citric acid cycle. These observations, together with the fact that succinate is generated as a metabolic intermediate during cellular respiration, have led to suggestions that protein succinylation may play a role in the interaction between cellular metabolism and important cellular functions. For instance, succinylation likely represents an important aspect of genomic regulation and repair and may have important consequences in the etiology of a number of disease states. In this study, we developed DeepSuccinylSite, a novel prediction tool that uses deep learning methodology along with embedding to identify succinylation sites in proteins based on their primary structure. Results: Using an independent test set of experimentally identified succinylation sites, our method achieved efficiency scores of 79%, 68.7% and 0.48 for sensitivity, specificity and MCC respectively, with an area under the receiver operator characteristic (ROC) curve of 0.8. In side-by-side comparisons with previously described succinylation predictors, DeepSuccinylSite represents a significant improvement in overall accuracy for prediction of succinylation sites. Conclusion: Together, these results suggest that our method represents a robust and complementary technique for advanced exploration of protein succinylation.
UR - http://www.scopus.com/inward/record.url?scp=85083948294&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85083948294&partnerID=8YFLogxK
U2 - 10.1186/s12859-020-3342-z
DO - 10.1186/s12859-020-3342-z
M3 - Article
C2 - 32321437
AN - SCOPUS:85083948294
SN - 1471-2105
VL - 21
JO - BMC bioinformatics
JF - BMC bioinformatics
M1 - 63
ER -