Knowledge Representations for Scientific Discovery
Licentiate thesis, 2025
and machine learning, and where progress can have major societal impact.
These techniques can help scientists uncover patterns in large datasets, generate
new hypotheses, and guide experimental design. Combined with robotic systems
they can also greatly increase the number of performed experiments. This thesis
investigates how structured knowledge representations can support scientific
discovery in systems biology, with a particular focus on the model organism
Saccharomyces cerevisiae (baker’s yeast).
The work introduces two ontologies designed as semantic schemas for
research databases. The first captures metadata and results from µ-chemostat
experiments, accommodating multiple measurement modalities. The second
ontology formalises revisions to computational models, with a focus on domains
where mechanistic models are updated iteratively and it is important to record
what was changed and why.
In the third contribution, information about S. cerevisiae from public data-
bases is integrated into a knowledge graph with well-defined class hierarchies.
Graph neural networks, in combination with box embeddings representing the
hierarchical structure, are used to predict growth outcomes of double gene
deletions. Furthermore, explainability techniques are applied to identify can-
didate biological interactions, forming hypotheses about traits in S. cerevisiae.
One such hypothesis is experimentally validated, illustrating how structured
representations can aid data-driven discovery from publicly available resources.
Taken together, this work introduces knowledge representations for emerging
domains, designed as tools to support scientific discovery, while also demon-
strating how rich, structured representations can enhance the interpretation of
existing data.
Machine learning
Systems Biology
Semantic Web
Ontologies
Neurosymbolic AI
Knowledge Graphs
Graph Neural Networks
Knowledge Represenations
Author
Filip Kronström
Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI
Genesis-DB: a database for autonomous laboratory systems
Bioinformatics Advances,;Vol. 3(2023)
Journal article
RIMBO - An Ontology for Model Revision Databases
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics),;Vol. 14276 LNAI(2023)p. 523-534
Paper in proceeding
Ontology-based box embeddings and knowledge graphs for predicting phenotypic traits in Saccharomyces cerevisiae
Proceedings of the 38th Conference on Uncertainty in Artificial Intelligence, UAI 2022,;Vol. 284(2025)
Paper in proceeding
Areas of Advance
Information and Communication Technology
Life Science Engineering (2010-2018)
Subject Categories (SSIF 2025)
Bioinformatics (Computational Biology)
Computer Sciences
Infrastructure
Chalmers e-Commons (incl. C3SE, 2020-)
Publisher
Chalmers
E2 Room 3364 EDIT-rummet
Opponent: Prof. Ashwin Srinivasan, Department of Computer Science and Information Systems, BITS Pilani, India