TY - JOUR
T1 - E-zyme
T2 - Predicting potential EC numbers from the chemical transformation pattern of substrate-product pairs
AU - Yamanishi, Yoshihiro
AU - Hattori, Masahiro
AU - Kotera, Masaaki
AU - Goto, Susumu
AU - Kanehisa, Minoru
N1 - Funding Information:
Funding: Ministry of Education, Culture, Sports, Science and Technology of Japan, and the Japan Science and Technology Agency.
PY - 2009
Y1 - 2009
N2 - Motivation: The IUBMB's Enzyme Nomenclature system, commonly known as the Enzyme Commission (EC) numbers, plays key roles in classifying enzymatic reactions and in linking the enzyme genes or proteins to reactions in metabolic pathways. There are numerous reactions known to be present in various pathways but without any official EC numbers, most of which have no hope to be given ones because of the lack of the published articles on enzyme assays. Results: In this article we propose a new method to predict the potential EC numbers to given reactant pairs (substrates and products) or uncharacterized reactions, and a web-server named E-zyme as an application. This technology is based on our original biochemical transformation pattern which we call an 'RDM pattern', and consists of three steps: (i) graph alignment of a query reactant pair (substrates and products) for computing the query RDM pattern, (ii) multi-layered partial template matching by comparing the query RDM pattern with template patterns related with known EC numbers and (iii) weighted major voting scheme for selecting appropriate EC numbers. As the result, cross-validation experiments show that the proposed method achieves both high coverage and high prediction accuracy at a practical level, and consistently outperforms the previous method.
AB - Motivation: The IUBMB's Enzyme Nomenclature system, commonly known as the Enzyme Commission (EC) numbers, plays key roles in classifying enzymatic reactions and in linking the enzyme genes or proteins to reactions in metabolic pathways. There are numerous reactions known to be present in various pathways but without any official EC numbers, most of which have no hope to be given ones because of the lack of the published articles on enzyme assays. Results: In this article we propose a new method to predict the potential EC numbers to given reactant pairs (substrates and products) or uncharacterized reactions, and a web-server named E-zyme as an application. This technology is based on our original biochemical transformation pattern which we call an 'RDM pattern', and consists of three steps: (i) graph alignment of a query reactant pair (substrates and products) for computing the query RDM pattern, (ii) multi-layered partial template matching by comparing the query RDM pattern with template patterns related with known EC numbers and (iii) weighted major voting scheme for selecting appropriate EC numbers. As the result, cross-validation experiments show that the proposed method achieves both high coverage and high prediction accuracy at a practical level, and consistently outperforms the previous method.
UR - http://www.scopus.com/inward/record.url?scp=66349127228&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=66349127228&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btp223
DO - 10.1093/bioinformatics/btp223
M3 - Article
C2 - 19477985
AN - SCOPUS:66349127228
SN - 1367-4803
VL - 25
SP - i179-i186
JO - Bioinformatics
JF - Bioinformatics
IS - 12
ER -