TY - JOUR
T1 - Quantum chemistry-machine learning approach for predicting and elucidating molecular hyperpolarizability
T2 - Application to [2.2]paracyclophane-containing push-pull polymers
AU - Ivonina, Mariia V.
AU - Orimoto, Yuuichi
AU - Aoki, Yuriko
N1 - Funding Information:
M.V.I. sincerely thanks the Otsuka Toshimi Scholarship Foundation for their strong financial support and encouragement during the completion of doctoral courses. This research was also supported by JSPS/MEXT (KAKENHI, Grant Nos. 23245005, 16KT0059, 25810103, 15KT0146, 16K08321, and 20H00588) and JST-CREST. All the computations in this research were performed on Linux PC systems in our laboratory and high-performance computing systems at the Research Institute for Information Technology in Kyushu University.
Publisher Copyright:
© 2021 Author(s).
PY - 2021/3/28
Y1 - 2021/3/28
N2 - Nonlinear optical properties of organic chromophores are of great interest in diverse photonic and optoelectronic applications. To elucidate general trends in the behaviors of molecules, large amounts of data are required. Therefore, both an accurate and a rapid computational approach can significantly promote the theoretical design of molecules. In this work, we combined quantum chemistry and machine learning (ML) to study the first hyperpolarizability (β) in [2.2]paracyclophane-containing push-pull compounds with various terminal donor/acceptor pairs and molecular lengths. To generate reference β values for ML, the ab initio elongation finite-field method was used, allowing us to treat long polymer chains with linear scale efficiency and high computational accuracy. A neural network (NN) model was built for β prediction, and the relevant molecular descriptors were selected using a genetic algorithm. The established NN model accurately reproduced the β values (R2 > 0.99) of long molecules based on the input quantum chemical properties (dipole moment, frontier molecular orbitals, etc.) of only the shortest systems and additional information about the actual system length. To obtain general trends in molecular descriptor-target property relationships learned by the NN, three approaches for explaining the ML decisions (i.e., partial dependence, accumulated local effects, and permutation feature importance) were used. The effect of donor/acceptor alternation on β in the studied systems was examined. The asymmetric extension of molecular regions end-capped with donors and acceptors produced unequal β responses. The results revealed how the electronic properties originating from the nature of substituents on the microscale controlled the magnitude of β according to the NN approximation. The applied approach facilitates the conceptual discoveries in chemistry by using ML to both (i) efficiently generate data and (ii) provide a source of information about causal correlations among system properties.
AB - Nonlinear optical properties of organic chromophores are of great interest in diverse photonic and optoelectronic applications. To elucidate general trends in the behaviors of molecules, large amounts of data are required. Therefore, both an accurate and a rapid computational approach can significantly promote the theoretical design of molecules. In this work, we combined quantum chemistry and machine learning (ML) to study the first hyperpolarizability (β) in [2.2]paracyclophane-containing push-pull compounds with various terminal donor/acceptor pairs and molecular lengths. To generate reference β values for ML, the ab initio elongation finite-field method was used, allowing us to treat long polymer chains with linear scale efficiency and high computational accuracy. A neural network (NN) model was built for β prediction, and the relevant molecular descriptors were selected using a genetic algorithm. The established NN model accurately reproduced the β values (R2 > 0.99) of long molecules based on the input quantum chemical properties (dipole moment, frontier molecular orbitals, etc.) of only the shortest systems and additional information about the actual system length. To obtain general trends in molecular descriptor-target property relationships learned by the NN, three approaches for explaining the ML decisions (i.e., partial dependence, accumulated local effects, and permutation feature importance) were used. The effect of donor/acceptor alternation on β in the studied systems was examined. The asymmetric extension of molecular regions end-capped with donors and acceptors produced unequal β responses. The results revealed how the electronic properties originating from the nature of substituents on the microscale controlled the magnitude of β according to the NN approximation. The applied approach facilitates the conceptual discoveries in chemistry by using ML to both (i) efficiently generate data and (ii) provide a source of information about causal correlations among system properties.
UR - http://www.scopus.com/inward/record.url?scp=85103290051&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85103290051&partnerID=8YFLogxK
U2 - 10.1063/5.0040342
DO - 10.1063/5.0040342
M3 - Article
C2 - 33810676
AN - SCOPUS:85103290051
SN - 0021-9606
VL - 154
JO - Journal of Chemical Physics
JF - Journal of Chemical Physics
IS - 12
M1 - 124107
ER -