Many Molecular Properties from One Kernel in Chemical Space

Authors

  • Raghunathan Ramakrishnan Institute of Physical Chemistry and National Center for Computational Design and Discovery of Novel Materials (MARVEL), Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
  • O. Anatole von Lilienfeld Institute of Physical Chemistry and National Center for Computational Design and Discovery of Novel Materials (MARVEL), Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland; Argonne Leadership Computing Facility, Argonne National Laboratory, 9700 S. Cass Avenue, Lemont, IL 60439, USA. anatole.vonlilienfeld@unibas.ch

DOI:

https://doi.org/10.2533/chimia.2015.182

Keywords:

Chemical space, Kernel ridge regression, Machine learning, Molecular properties, Quantum chemistry

Abstract

We introduce property-independent kernels for machine learning models of arbitrarily many molecular properties. The kernels encode molecular structures for training sets of varying size, as well as similarity measures sufficiently diffuse in chemical space to sample over all training molecules. When provided with the corresponding molecular reference properties, they enable the instantaneous generation of machine learning models which can be systematically improved through the addition of more data. This idea is exemplified for single kernel based modeling of internal energy, enthalpy, free energy, heat capacity, polarizability, electronic spread, zero-point vibrational energy, energies of frontier orbitals, HOMO-LUMO gap, and the highest fundamental vibrational wavenumber. Models of these properties are trained and tested using 112,000 organic molecules of similar size. The resulting models are discussed as well as the kernels' use for generating and using other property models.

Downloads

Published

2015-04-29