Improving de novo molecular design with curriculum learning

Jeff Guo; Vendy Fialkova; Juan Diego Arango; Christian Margreitter; Jon Paul Janet; Kostas Papadopoulos; Ola Engkvist; Atanas Patronov

doi:10.1038/s42256-022-00494-4

Improving de novo molecular design with curriculum learning
Journal article, 2022

While reinforcement learning can be a powerful tool for complex design tasks such as molecular design, training can be slow when problems are either too hard or too easy, as little is learned in these cases. Jeff Guo and colleagues provide a curriculum learning extension to the REINVENT de novo molecular design framework that provides problems of increasing difficulty over epochs such that the training process is more efficient. Reinforcement learning is a powerful paradigm that has gained popularity across multiple domains. However, applying reinforcement learning may come at the cost of multiple interactions between the agent and the environment. This cost can be especially pronounced when the single feedback from the environment is slow or computationally expensive, causing extensive periods of non-productivity. Curriculum learning provides a suitable alternative by arranging a sequence of tasks of increasing complexity, with the aim of reducing the overall cost of learning. Here we demonstrate the application of curriculum learning for drug discovery. We implement curriculum learning in the de novo design platform REINVENT, and apply it to illustrative molecular design problems of different complexities. The results show both accelerated learning and a positive impact on the quality of the output when compared with standard policy-based reinforcement learning.

Author

Jeff Guo

AstraZeneca AB

Vendy Fialkova

AstraZeneca AB

Juan Diego Arango

AstraZeneca AB

Christian Margreitter

AstraZeneca AB

Jon Paul Janet

AstraZeneca AB

Kostas Papadopoulos

AstraZeneca AB

Ola Engkvist

Chalmers, Computer Science and Engineering (Chalmers)

Other publications Research

Atanas Patronov

AstraZeneca AB

Nature Machine Intelligence

25225839 (eISSN)

Vol. 4 6 555-563

Subject Categories (SSIF 2011)

Interaction Technologies

Human Computer Interaction

Computer Science

DOI

10.1038/s42256-022-00494-4

Publication data connected to DOI

More information

Latest update

1/3/2024 9

Improving de novo molecular design with curriculum learning Journal article, 2022

Author

Jeff Guo

Vendy Fialkova

Juan Diego Arango

Christian Margreitter

Jon Paul Janet

Kostas Papadopoulos

Ola Engkvist

Atanas Patronov

Nature Machine Intelligence

Subject Categories (SSIF 2011)

DOI

More information

Latest update

Improving de novo molecular design with curriculum learning
Journal article, 2022