Automating Hypothesis Generation and Testing: Towards Self-driving Biology
Doktorsavhandling, 2025

Biological systems remain only partially understood, and the relative pace of functional discovery has been slowing down despite advances in measurement technologies. A growing consensus suggests that the most promising way forward is not only via conventional laboratory automation, but through the development of fully autonomous systems that can generate, prioritize, draw insight from, and execute high-throughput experimentation.

This thesis explores how such automation can accelerate scientific discovery by combining methods from artificial intelligence—such as inductive logic programming, explainable AI, and large language models—with physical instrumentation, including laboratory robotics and high-throughput analytical platforms like mass spectrometry. The work spans the entire discovery cycle, from hypothesis generation to experimental evaluation.

As a case study, the methods are applied to Saccharomyces cerevisiae (baker’s yeast), an extensively studied eukaryote and a powerful model organism for systems biology. In doing so, the thesis contributes to further characterization of key aspects of yeast biology, including the diauxic shift and its regulators (via untargeted metabolomics), genome-wide proteomic regulation, phenotypic determinants of fitness, and metabolic interactions involving amino acids.

The findings emphasize that automation in biology requires more than throughput alone. Automated systems must also leverage existing knowledge, provide interpretable reasoning processes, and preferably capture enough metadata for auditability. These studies also highlight how automation, when combined with structured knowledge and high-throughput experimentation, can refine existing approaches and move biology toward more integrative and transparent modes of discovery.

Automation of science

mass spectrometry

laboratory automation

inductive logic programming

metabolomics

systems biology

machine learning

KB10 (10:an), Chemistry building, Kemivägen 8a, Chalmers University of Technology, Campus Johanneberg
Opponent: Prof. Ola Spjuth, Uppsala University

Författare

Daniel Brunnsåker

Chalmers, Data- och informationsteknik, Data Science och AI

High-throughput metabolomics for the design and validation of a diauxic shift model

NPJ systems biology and applications,;Vol. 9(2023)p. 11-

Artikel i vetenskaplig tidskrift

Interpreting protein abundance in Saccharomyces cerevisiae through relational learning

Bioinformatics,;Vol. 40(2024)

Artikel i vetenskaplig tidskrift

AutonoMS: Automated Ion Mobility Metabolomic Fingerprinting

Journal of the American Society for Mass Spectrometry,;Vol. 35(2024)p. 542-550

Artikel i vetenskaplig tidskrift

Filip Kronström, Daniel Brunnsåker, Ievgeniia A. Tiukova, Ross D. King. Ontology-based box embeddings and knowledge graphs for predicting phenotypic traits in Saccharomyces cerevisiae. To appear in the 19th International Conference on Neurosymbolic Learning and Reasoning.

When we think of a scientist, we picture someone coming up with ideas, testing them in the lab, and making sense of the results. But in modern biology, this process has become overwhelming. Even the simplest living cells contain thousands upon thousands of parts that interact in intricate ways. Experiments now produce vast amounts of data, far more than any one person could fully understand on their own.

This is where “AI scientists” come in. These are computer systems connected torobotic labs that can help with every step of research: suggesting ideas, planning and running experiments, and interpreting the outcomes. The aim is not to replace human scientists but to give them a powerful partner that can handle scale, speed, and detail.

In this thesis, we explore what AI scientists might look like in practice by usingbaker’s yeast, one of biology’s best-studied organisms, as a testing ground. Different parts of the discovery process were automated, from hypothesis generation to datacollection and analysis. One study also demonstrates a complete end-to-end solution. Along the way, new biological insights emerged. More importantly, the work shows how AI scientists could become genuine partners for human researchers—taking on the heavy lifting of scale and complexity, while leaving people to do what we do best: creativity, intuition, and asking the right questions.

Ämneskategorier (SSIF 2025)

Bioinformatik (beräkningsbiologi)

Bioinformatik och beräkningsbiologi

Cellbiologi

Mikrobiologi

Artificiell intelligens

Infrastruktur

Chalmers infrastruktur för masspektrometri

Chalmers e-Commons (inkl. C3SE, 2020-)

DOI

10.63959/chalmers.dt/5755

ISBN

978-91-8103-297-0

Doktorsavhandlingar vid Chalmers tekniska högskola. Ny serie: 5755

Utgivare

Chalmers

KB10 (10:an), Chemistry building, Kemivägen 8a, Chalmers University of Technology, Campus Johanneberg

Opponent: Prof. Ola Spjuth, Uppsala University

Mer information

Senast uppdaterat

2025-09-22