Efficient Implementation of Gaussian Process Regression Accelerated Saddle Point Searches with Application to Molecular Reactions

Rohit Goswami; Maxim Masterov; Satish Kamath; Alejandro Peña-Torres; Hannes Jónsson

doi:10.1021/acs.jctc.5c00866

What is it about?

Finding transition states -- the atomic configurations at energy barriers between reactant and product -- requires many evaluations of quantum mechanical energies and forces, each of which can take hours. Gaussian process (GP) regression can reduce the number of these evaluations by building a surrogate energy surface on the fly, but previous implementations were slow to set up and hard to use. We reimplemented the GP-accelerated dimer method for the EON saddle point search code, focusing on practical performance. The implementation works in simple Cartesian coordinates rather than requiring specialized internal coordinates, yet achieves comparable accuracy. On a benchmark of 238 molecular reactions, the GP approach needs roughly one tenth the force evaluations of the standard dimer method. The code is publicly available in the EON software package (https://eondocs.org) and interoperates with standard electronic structure codes.

Photo by New Material on Unsplash

Why is it important?

Saddle point searches are a bottleneck in computational studies of reaction mechanisms, catalysis, and diffusion. Each search can require hundreds of force evaluations at the DFT or coupled-cluster level. Reducing that count by an order of magnitude makes it feasible to survey reaction networks that were previously too expensive. The Cartesian-coordinate formulation matters because it avoids the coordinate-transformation machinery that made earlier GP implementations fragile for complex systems. This simplicity also makes the method easier to couple with new electronic structure codes or machine-learned potentials. The benchmark on 238 diverse reactions provides a quantitative baseline for future methods.

Perspectives

This was technically one of my first PhD projects. We took an existing idea -- GP acceleration of the dimer method -- and focused on making it actually work reliably across a broad set of systems. The collaboration with SURFsara (the Dutch national computing center) helped us benchmark at scale. Most of the work went into practical engineering: robust convergence criteria, sensible defaults, and integration with EON's existing infrastructure. The decision to stay in Cartesian coordinates, despite the theoretical appeal of internal coordinates, came from testing on real systems where coordinate transformations introduced more problems than they solved. The resulting code became the baseline that our later OT-GP work (the adaptive pruning paper) improved upon.
Rohit Goswami
University of Iceland

This page is a summary of: Efficient Implementation of Gaussian Process Regression Accelerated Saddle Point Searches with Application to Molecular Reactions, Journal of Chemical Theory and Computation, August 2025, American Chemical Society (ACS),
DOI: 10.1021/acs.jctc.5c00866.
You can read the full text:

Read

Resources

Contributors

The following have contributed to this page

Rohit Goswami
University of Iceland

GP-accelerated saddle point searches with 10x fewer force evaluations

What is it about?

Why is it important?

Perspectives

Resources

Materials cloud archive

Github reproducer

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

GP-accelerated saddle point searches with 10x fewer force evaluations

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Resources

Materials cloud archive

Github reproducer

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management