Minimum-Excess-Work Guidance: Score-Based Sampling with Experimental Data or Sparse Restraints
Artikel i vetenskaplig tidskrift, 2026

Surrogate models, such as Boltzmann generators (BGs) and emulators (BEs), based on deep generative models are becoming an important tool in molecular simulation. Often, we may want to use additional external information such as sparse experimental data to refine these models. However, there is no unique way to achieve this goal. Here, we propose a method inspired by thermodynamic work from statistical mechanics to regularize the guidance of pretrained probability flow generative models (e.g., continuous normalizing flows or diffusion models) to match additional sparse information. The regularization ensures that the excess work of the guidance procedure is minimized. We developed two guiding strategies based on this method: Path Guidance, which facilitates sampling of rare transition states by concentrating probability mass on user-defined subsets, and Observable Guidance, which aligns generated distributions with experimental observables while preserving entropy. We demonstrate the framework’s versatility on two coarse-grained Boltzmann emulators, showcasing its ability to sample transition configurations and to correct systematic biases using experimental data on a variety of model protein systems. Finally, we provide bounds on the distributional differences between the guided and unguided distributions. The method bridges thermodynamic principles with modern generative architectures, offering a principled, efficient, and physics-inspired alternative to standard fine-tuning in data-scarce domains. Our results highlight improved sample efficiency and bias reduction, underscoring their applicability to molecular simulations and beyond.

Molecular modeling

Mathematical methods

Quality management

Computer simulations

Diffusion

Författare

Christopher Kara

Chalmers, Data- och informationsteknik, Data Science och AI

Tobias Höppe

Technische Universität München

Emmanouil Angelis

Technische Universität München

Jacob Mathias Schreiner

Chalmers, Data- och informationsteknik, Data Science och AI

Stefan Bauer

Technische Universität München

Andrea Dittadi

Technische Universität München

Simon Olsson

Chalmers, Data- och informationsteknik, Data Science och AI

Journal of Chemical Theory and Computation

1549-9618 (ISSN) 1549-9626 (eISSN)

Ämneskategorier (SSIF 2025)

Biofysik

Fundament

Grundläggande vetenskaper

DOI

10.1021/acs.jctc.6c00080

Mer information

Senast uppdaterat

2026-05-29