Data-Driven Modeling of Biomolecular Dynamics under Physical and Experimental Constraints
Doctoral thesis, 2026
Experiments offer unbiased but sparse ensemble measurements, while simulations provide atomic detail but suffer from force-field biases and finite sampling. Integrating both into a single model is essential for a complete picture of biomolecular dynamics. This thesis develops the mathematical foundations for this integration through constrained statistical inference using the transfer operator framework and observables as the common language between experiments and simulations, comprising five papers in total. A survey chapter reviews machine learning approaches to molecular dynamics and identifies the absence of experimental consistency mechanisms as a common limitation. An empirical study then predicts dynamics-sensitive experiments from simulation data, revealing significant discrepancies that establish the need for integrative approaches. To address this, dynamic Augmented Markov Models (dynAMMo) are introduced, incorporating experimental measurements as constraints to recover slow exchange processes even from incomplete simulations. The Minimum-Excess-Work framework pushes the integration upstream, offering a method that incorporates stationary observables into probability flow generative models. Finally, applying dynAMMo to an engineered lipocalin demonstrates that these methods yield mechanistic insights that neither simulations nor experiments alone could provide. Together, these contributions progressively shift from comparing experimental and computational data, to correcting kinetic models post hoc, to guiding the generative process itself.
Markov state models
AI4Science
transfer operators
generative modeling
NMR relaxation dispersion
constrained statistical inference
biomolecular dynamics
integrative structural biology
Author
Christopher Kara
Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI
Machine Learning in Molecular Dynamics Simulations of Biomolecular Systems
Comprehensive Computational Chemistry,;Vol. 3(2024)p. 475-492
Book chapter
Motional clustering in supra-τ<inf>c</inf> conformational exchange influences NOE cross-relaxation rate
Journal of Magnetic Resonance,;Vol. 338(2022)
Journal article
Rescuing Off-Equilibrium Simulation Data through Dynamic Experimental Data with dynAMMo
Machine Learning: Science and Technology,;Vol. 4(2023)
Journal article
Minimum-Excess-Work Guidance: Score-Based Sampling with Experimental Data or Sparse Restraints
Conformational Quenching in an Engineered Lipocalin Protein Achieves High Affinity Binding to the Toxin Colchicine
Angewandte Chemie,;Vol. 64(2025)
Journal article
Areas of Advance
Information and Communication Technology
Subject Categories (SSIF 2025)
Bioinformatics (Computational Biology)
Roots
Basic sciences
Infrastructure
Chalmers e-Commons (incl. C3SE, 2020-)
DOI
10.63959/chalmers.dt/5895
ISBN
978-91-8103-438-7
Doktorsavhandlingar vid Chalmers tekniska högskola. Ny serie: 5895
Publisher
Chalmers