Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Binder

Machine learning potentials

Why settle on a functional form?

The potential energy Epot(x)E_\mathrm{pot}(\mathbf{x}) has many complex features:

  • Number of minima exponential on N

  • Multiple transition paths

  • Features at different length and energy scales

Approaches to approximate the true potential energy function:

  • Quantum-mechanical calculations would be very accurate, but are way too expensive

  • The classical forcefields from the previous lectures are cheap but only a crude approximation

  • Machine learning can approximate a region of the function close to QM accuracy but with much less expensive function evaluations

Machine learning

Regression: Predict the continuous outcome/function value given some new unseen input

Step 1: Collect data (measurements, calculations)

ML_1.png

Step 2: Fit model (find best parameters of function)

ML_2.png

Step 3: Apply model, e.g. y(2.0)=2.1

Machine-learning (ML) potentials

Data: EPot(x)E_\mathrm{Pot}(\mathbf{x}) ... potential energy as a function of the atomic coordinates from QM calculations for many relevant conformations (+ optionally atomic forces f\mathbf{f})\[0.07cm]

ML_4.png

Problem: Learned potential energy would not be permuation, translation, or rotation invariant!

Assumption 1: EPot(x)E_\mathrm{Pot}(\mathbf{x}) is approximately additive with respect to the atomic contributions x1,x2,...,xN\mathbf{x}_1, \mathbf{x}_2, ..., \mathbf{x}_N

ML_5.png

Problem: Learned potential energy would not be translation or rotation invariant!

Assumption 2: The atomic contribution EPot,N(x)E_{\mathrm{Pot},N}(\mathbf{x}) can also be learned from EPot,N(qN)E_{\mathrm{Pot},N}(\mathbf{q_N}) with qN\mathbf{q_N} being a descriptor representation of the local environment around atom N

ML_6.png

Problem: We must choose/design suitable descriptors

How to design a descriptor

Descriptors must be invariant with respect to translation, and invariant or equivariant with respect to rotation.

Simplest choice: Two-body (atom + distance information), or three-body (atom + distance + angle information)

ML_7.png

Usually: Many-body representation (all neighboring atoms up to a cutoff) with a radial part and an angular part (usually based on spherical harmonics):

images_large_ar0c00868_0001.jpeg Zubatiuk & Isayev (2021)

Descriptor-based architectures

Design of descriptors is still a work in progress, many variants published recently:

  • Atom-centered symmetry functions (ACSFs)

  • Smooth overlap of atomic positions (SOAP)

  • Eigenvalues of the Coulomb matrix

Exemplar architectures:

  • GAP: Gaussian process regression on SOAP descriptors

  • NeuralIL: Residual neural network on Bessel descriptors

  • ANI: Neural network on ACSFs

Learned descriptors

So far, we have custom-made the descriptors, relying on expert knowledge which information is important:

ML_8.png

Instead, we can learn the descriptor from the data using message passing (= another model):

ML_9.png

Evolution of learned descriptors:

  • 2-Body (atoms I, J) invariant message passing with scalar features: Very basic learned features

  • Multi-body messages including also triples (atoms I, J, K) or more: Can capture more information

  • 2-Body (atoms I, J) equivariant message passing with scalar and vectorial features: Can also capture directional information

  • Combination of equivariant message-passing with multi-body messages: Currently one of the best models

What about the long-range part?

Previous methods only use local descriptors. What about long-range forces though?

Thre is commonly accepted approach yet. Some possibilities:

  • Fixed atomic multipoles (charges, dipoles, etc)

  • Machine-learning method for the charge density and the derived potential energy

  • Project long-range part onto local descriptors

  • Ewald-type ML

ml_ewald.jpg proceedings.mlr.press/v202/kosmala23a.html

Disadvantages/Challenges

The development of machine-learned force fields is far from finished, and many challenges remain. Maybe you want to contribute?

  • No functional form means no safety net - predictions can get arbitrarily different from the ground truth. Predictions can thus even have a wrong sign, i.e. be attractive instead of repulsive

  • Generalization ability often poor

  • High dependence on training set

  • Currently promising results after some finetuning, but not off-the-bench

References
  1. Zubatiuk, T., & Isayev, O. (2021). Development of Multimodal Machine Learning Potentials: Toward a Physics-Aware Artificial Intelligence. Accounts of Chemical Research, 54(7), 1575–1585. 10.1021/acs.accounts.0c00868