TY - JOUR
T1 - Learning reaction coordinates via cross-entropy minimization
T2 - Application to alanine dipeptide
AU - Mori, Yusuke
AU - Okazaki, Kei Ichi
AU - Mori, Toshifumi
AU - Kim, Kang
AU - Matubayasi, Nobuyuki
N1 - Funding Information:
The authors thank Shinji Saito and Takenobu Nakamura for helpful discussions. This work was partially supported by the JSPS KAKENHI [Grant Nos. JP18H02415 (K.-i.O.), JP18K05049 (T.M.), JP18H01188 (K.K.), JP20H05221 (K.K.), and JP19H04206 (N.M.)]. T.M. and K.K. thank the support from the KAKENHI Innovative Area “Studying the Function of Soft Molecular Systems by the Concerted Use of Theory and Experiment.” K.-i.O. was supported by the Building of Consortia for the Development of Human Resources in Science and Technology, MEXT, Japan. This work was also partially supported by the Fugaku Supercomputing Project and the Elements Strategy Initiative for Catalysts and Batteries (Grant No. JPMXP0112101003) from the Ministry of Education, Culture, Sports, Science, and Technology. The numerical calculations were performed at the Research Center of Computational Science, Okazaki Research Facilities, National Institutes of Natural Sciences, Japan.
Publisher Copyright:
© 2020 Author(s).
PY - 2020/8/7
Y1 - 2020/8/7
N2 - We propose a cross-entropy minimization method for finding the reaction coordinate from a large number of collective variables in complex molecular systems. This method is an extension of the likelihood maximization approach describing the committor function with a sigmoid. By design, the reaction coordinate as a function of various collective variables is optimized such that the distribution of the committor pB∗ values generated from molecular dynamics simulations can be described in a sigmoidal manner. We also introduce the L2-norm regularization used in the machine learning field to prevent overfitting when the number of considered collective variables is large. The current method is applied to study the isomerization of alanine dipeptide in vacuum, where 45 dihedral angles are used as candidate variables. The regularization parameter is determined by cross-validation using training and test datasets. It is demonstrated that the optimal reaction coordinate involves important dihedral angles, which are consistent with the previously reported results. Furthermore, the points with pB*∼0.5 clearly indicate a separatrix distinguishing reactant and product states on the potential of mean force using the extracted dihedral angles.
AB - We propose a cross-entropy minimization method for finding the reaction coordinate from a large number of collective variables in complex molecular systems. This method is an extension of the likelihood maximization approach describing the committor function with a sigmoid. By design, the reaction coordinate as a function of various collective variables is optimized such that the distribution of the committor pB∗ values generated from molecular dynamics simulations can be described in a sigmoidal manner. We also introduce the L2-norm regularization used in the machine learning field to prevent overfitting when the number of considered collective variables is large. The current method is applied to study the isomerization of alanine dipeptide in vacuum, where 45 dihedral angles are used as candidate variables. The regularization parameter is determined by cross-validation using training and test datasets. It is demonstrated that the optimal reaction coordinate involves important dihedral angles, which are consistent with the previously reported results. Furthermore, the points with pB*∼0.5 clearly indicate a separatrix distinguishing reactant and product states on the potential of mean force using the extracted dihedral angles.
UR - http://www.scopus.com/inward/record.url?scp=85089261755&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85089261755&partnerID=8YFLogxK
U2 - 10.1063/5.0009066
DO - 10.1063/5.0009066
M3 - Article
C2 - 32770909
AN - SCOPUS:85089261755
SN - 0021-9606
VL - 153
JO - Journal of Chemical Physics
JF - Journal of Chemical Physics
IS - 5
M1 - 054115
ER -