Automated Scientific Discovery: From Equation Discovery to Autonomous Discovery Systems
Reviewartikel, 2026

The paper surveys automated scientific discovery, from equation discovery and symbolic regression to autonomous discovery systems and agents. It discusses the individual approaches from a "big picture" perspective and in context, but also discusses open issues and recent topics like the various roles of deep neural networks in this area, aiding in the discovery of human-interpretable knowledge. Further, we will present closed-loop scientific discovery systems, starting with the pioneering work on the Adam system up to current efforts in fields from material science to astronomy. Finally, we will elaborate on autonomy from a machine learning perspective, but also in analogy to the autonomy levels in autonomous driving. The maximal level, level five, is defined to require no human intervention at all in the production of scientific knowledge. Achieving this is one step towards solving the Nobel Turing Grand Challenge to develop AI Scientists: AI systems capable of making Nobel-quality scientific discoveries highly autonomously at a level comparable, and possibly superior, to the best human scientists by 2050.

Self-driving labs

AI scientist

Symbolic regression

Nobel Turing challenge

AI for science

Scientific discovery

Författare

Stefan Kramer

Johannes Gutenberg-Universität Mainz

Mattia Cerrato

Johannes Gutenberg-Universität Mainz

Jannis Brugger

Technische Universität Darmstadt

Saso Dzeroski

Institut Jožef Stefan

Ross King

Göteborgs universitet

Chalmers, Data- och informationsteknik, Data Science och AI

Machine Learning

0885-6125 (ISSN) 1573-0565 (eISSN)

Vol. 115 5 109

Ämneskategorier (SSIF 2025)

Språkbehandling och datorlingvistik

Robotik och automation

Datavetenskap (datalogi)

Människa-datorinteraktion (interaktionsdesign)

Systemvetenskap, informationssystem och informatik

Datorsystem

DOI

10.1007/s10994-025-06955-2

Mer information

Senast uppdaterat

2026-05-13