TY - JOUR
T1 - Structural Class Classification of 3D Protein Structure Based on Multi-View 2D Images
AU - Suryanto, Chendra Hadi
AU - Saigo, Hiroto
AU - Fukui, Kazuhiro
N1 - Funding Information:
The authors would like to thank Prof. Hideitsu Hino for discussions and suggestions. This work was supported by JSPS KAKENHI Grant Number 25540062 and 25700004.
Publisher Copyright:
© 2016 IEEE.
PY - 2018/1/1
Y1 - 2018/1/1
N2 - Computing similarity or dissimilarity between protein structures is an important task in structural biology. A conventional method to compute protein structure dissimilarity requires structural alignment of the proteins. However, defining one best alignment is difficult, especially when the structures are very different. In this paper, we propose a new similarity measure for protein structure comparisons using a set of multi-view 2D images of 3D protein structures. In this approach, each protein structure is represented by a subspace from the image set. The similarity between two protein structures is then characterized by the canonical angles between the two subspaces. The primary advantage of our method is that precise alignment is not needed. We employed Grassmann Discriminant Analysis (GDA) as the subspace-based learning in the classification framework. We applied our method for the classification problem of seven SCOP structural classes of protein 3D structures. The proposed method outperformed the k-nearest neighbor method (k-NN) based on conventional alignment-based methods CE, FATCAT, and TM-align. Our method was also applied to the classification of SCOP folds of membrane proteins, where the proposed method could recognize the fold HEM-binding four-helical bundle (f.21) much better than TM-Align.
AB - Computing similarity or dissimilarity between protein structures is an important task in structural biology. A conventional method to compute protein structure dissimilarity requires structural alignment of the proteins. However, defining one best alignment is difficult, especially when the structures are very different. In this paper, we propose a new similarity measure for protein structure comparisons using a set of multi-view 2D images of 3D protein structures. In this approach, each protein structure is represented by a subspace from the image set. The similarity between two protein structures is then characterized by the canonical angles between the two subspaces. The primary advantage of our method is that precise alignment is not needed. We employed Grassmann Discriminant Analysis (GDA) as the subspace-based learning in the classification framework. We applied our method for the classification problem of seven SCOP structural classes of protein 3D structures. The proposed method outperformed the k-nearest neighbor method (k-NN) based on conventional alignment-based methods CE, FATCAT, and TM-align. Our method was also applied to the classification of SCOP folds of membrane proteins, where the proposed method could recognize the fold HEM-binding four-helical bundle (f.21) much better than TM-Align.
UR - http://www.scopus.com/inward/record.url?scp=85041929058&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85041929058&partnerID=8YFLogxK
U2 - 10.1109/TCBB.2016.2603987
DO - 10.1109/TCBB.2016.2603987
M3 - Article
C2 - 28113600
AN - SCOPUS:85041929058
SN - 1545-5963
VL - 15
SP - 286
EP - 299
JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics
JF - IEEE/ACM Transactions on Computational Biology and Bioinformatics
IS - 1
M1 - 7555308
ER -