1. Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning
- Author
-
Alexandre Tkatchenko, Klaus-Robert Müller, Matthias Rupp, and O. Anatole von Lilienfeld
- Subjects
FOS: Computer and information sciences ,Physics [G04] [Physical, chemical, mathematical & earth Sciences] ,General Physics and Astronomy ,FOS: Physical sciences ,Machine Learning (stat.ML) ,Machine learning ,computer.software_genre ,Cross-validation ,Organic molecules ,Schrödinger equation ,Set (abstract data type) ,symbols.namesake ,Statistics - Machine Learning ,Physics - Chemical Physics ,Physics ,Chemical Physics (physics.chem-ph) ,Condensed Matter - Materials Science ,business.industry ,Materials Science (cond-mat.mtrl-sci) ,Regression analysis ,Disordered Systems and Neural Networks (cond-mat.dis-nn) ,Condensed Matter - Disordered Systems and Neural Networks ,Potential energy ,Nonlinear system ,Physique [G04] [Physique, chimie, mathématiques & sciences de la terre] ,symbols ,Density functional theory ,Artificial intelligence ,business ,computer - Abstract
We introduce a machine learning model to predict atomization energies of a diverse set of organic molecules, based on nuclear charges and atomic positions only. The problem of solving the molecular Schrodinger equation is mapped onto a nonlinear statistical regression problem of reduced complexity. Regression models are trained on and compared to atomization energies computed with hybrid density-functional theory. Cross validation over more than seven thousand organic molecules yields a mean absolute error of similar to 10 kcal/mol. Applicability is demonstrated for the prediction of molecular atomization potential energy curves.
- Published
- 2012