TY - JOUR
T1 - Learning reaction coordinates via cross-entropy minimization
T2 - Application to alanine dipeptide
AU - Mori, Yusuke
AU - Okazaki, Kei Ichi
AU - Mori, Toshifumi
AU - Kim, Kang
AU - Matubayasi, Nobuyuki
N1 - Publisher Copyright:
© 2020 Author(s).
PY - 2020/8/7
Y1 - 2020/8/7
N2 - We propose a cross-entropy minimization method for finding the reaction coordinate from a large number of collective variables in complex molecular systems. This method is an extension of the likelihood maximization approach describing the committor function with a sigmoid. By design, the reaction coordinate as a function of various collective variables is optimized such that the distribution of the committor pB∗ values generated from molecular dynamics simulations can be described in a sigmoidal manner. We also introduce the L2-norm regularization used in the machine learning field to prevent overfitting when the number of considered collective variables is large. The current method is applied to study the isomerization of alanine dipeptide in vacuum, where 45 dihedral angles are used as candidate variables. The regularization parameter is determined by cross-validation using training and test datasets. It is demonstrated that the optimal reaction coordinate involves important dihedral angles, which are consistent with the previously reported results. Furthermore, the points with pB*∼0.5 clearly indicate a separatrix distinguishing reactant and product states on the potential of mean force using the extracted dihedral angles.
AB - We propose a cross-entropy minimization method for finding the reaction coordinate from a large number of collective variables in complex molecular systems. This method is an extension of the likelihood maximization approach describing the committor function with a sigmoid. By design, the reaction coordinate as a function of various collective variables is optimized such that the distribution of the committor pB∗ values generated from molecular dynamics simulations can be described in a sigmoidal manner. We also introduce the L2-norm regularization used in the machine learning field to prevent overfitting when the number of considered collective variables is large. The current method is applied to study the isomerization of alanine dipeptide in vacuum, where 45 dihedral angles are used as candidate variables. The regularization parameter is determined by cross-validation using training and test datasets. It is demonstrated that the optimal reaction coordinate involves important dihedral angles, which are consistent with the previously reported results. Furthermore, the points with pB*∼0.5 clearly indicate a separatrix distinguishing reactant and product states on the potential of mean force using the extracted dihedral angles.
UR - http://www.scopus.com/inward/record.url?scp=85089261755&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85089261755&partnerID=8YFLogxK
U2 - 10.1063/5.0009066
DO - 10.1063/5.0009066
M3 - Article
C2 - 32770909
AN - SCOPUS:85089261755
SN - 0021-9606
VL - 153
JO - Journal of Chemical Physics
JF - Journal of Chemical Physics
IS - 5
M1 - 054115
ER -