Data-Driven Modeling of Biomolecular Dynamics under Physical and Experimental Constraints
Doctoral thesis, 2026

Biophysical experiments and biomolecular simulations report on the same processes that govern biological function, yet each provides only a partial view.
Experiments offer unbiased but sparse ensemble measurements, while simulations provide atomic detail but suffer from force-field biases and finite sampling. Integrating both into a single model is essential for a complete picture of biomolecular dynamics. This thesis develops the mathematical foundations for this integration through constrained statistical inference using the transfer operator framework and observables as the common language between experiments and simulations, comprising five papers in total. A survey chapter reviews machine learning approaches to molecular dynamics and identifies the absence of experimental consistency mechanisms as a common limitation. An empirical study then predicts dynamics-sensitive experiments from simulation data, revealing significant discrepancies that establish the need for integrative approaches. To address this, dynamic Augmented Markov Models (dynAMMo) are introduced, incorporating experimental measurements as constraints to recover slow exchange processes even from incomplete simulations. The Minimum-Excess-Work framework pushes the integration upstream, offering a method that incorporates stationary observables into probability flow generative models. Finally, applying dynAMMo to an engineered lipocalin demonstrates that these methods yield mechanistic insights that neither simulations nor experiments alone could provide. Together, these contributions progressively shift from comparing experimental and computational data, to correcting kinetic models post hoc, to guiding the generative process itself.

Markov state models

AI4Science

transfer operators

generative modeling

NMR relaxation dispersion

constrained statistical inference

biomolecular dynamics

integrative structural biology

EDIT EL53
Opponent: Matteo Degiacomi, Edinburgh University, United Kingdom

Author

Christopher Kara

Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI

Machine Learning in Molecular Dynamics Simulations of Biomolecular Systems

Comprehensive Computational Chemistry,;Vol. 3(2024)p. 475-492

Book chapter

Rescuing Off-Equilibrium Simulation Data through Dynamic Experimental Data with dynAMMo

Machine Learning: Science and Technology,;Vol. 4(2023)

Journal article

Minimum-Excess-Work Guidance: Score-Based Sampling with Experimental Data or Sparse Restraints

Areas of Advance

Information and Communication Technology

Subject Categories (SSIF 2025)

Bioinformatics (Computational Biology)

Roots

Basic sciences

Infrastructure

Chalmers e-Commons (incl. C3SE, 2020-)

DOI

10.63959/chalmers.dt/5895

ISBN

978-91-8103-438-7

Doktorsavhandlingar vid Chalmers tekniska högskola. Ny serie: 5895

Publisher

Chalmers

EDIT EL53

Online

Opponent: Matteo Degiacomi, Edinburgh University, United Kingdom

More information

Latest update

5/18/2026