Diversity-Aware Reinforcement Learning for de novo Drug Design
Paper in proceeding, 2025

Fine-tuning a pre-trained generative model has demonstrated good performance in generating promising drug molecules. The fine-tuning task is often formulated as a reinforcement learning problem, where previous methods efficiently learn to optimize a reward function to generate potential drug molecules. Nevertheless, in the absence of an adaptive update mechanism for the reward function, the optimization process can become stuck in local optima. The efficacy of the optimal molecule in a local optimization may not translate to usefulness in the subsequent drug optimization process or as a potential standalone clinical candidate. Therefore, it is important to generate a diverse set of promising molecules. Prior work has modified the reward function by penalizing structurally similar molecules, primarily focusing on finding molecules with higher rewards. To date, no study has comprehensively examined how different adaptive update mechanisms for the reward function influence the diversity of generated molecules. In this work, we investigate a wide range of intrinsic motivation methods and strategies to penalize the extrinsic reward, and how they affect the diversity of the set of generated molecules. Our experiments reveal that combining structure- and prediction-based methods generally yields better results in terms of diversity

Exploration

Reinforcement Learning

de novo Drug Design

Author

Hampus Gummesson Svensson

AstraZeneca AB

University of Gothenburg

Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI

Christian Tyrchan

AstraZeneca AB

Ola Engkvist

Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI

University of Gothenburg

AstraZeneca AB

Morteza Haghir Chehreghani

Data Science and AI 2

University of Gothenburg

IJCAI International Joint Conference on Artificial Intelligence

10450823 (ISSN)

9194-9204
978-1-956792-06-5 (ISBN)

Thirty-Fourth International Joint Conference on Artificial Intelligence, IJCAI 2025
Montreal, Canada,

Areas of Advance

Information and Communication Technology

Health Engineering

Subject Categories (SSIF 2025)

Computer Sciences

Pharmaceutical and Medical Biotechnology

Artificial Intelligence

Infrastructure

Chalmers e-Commons (incl. C3SE, 2020-)

DOI

10.24963/ijcai.2025/1022

ISBN

9781956792065

More information

Latest update

11/24/2025