De novo generated combinatorial library design
Journal article, 2023

Artificial intelligence (AI) contributes new methods for designing compounds in drug discovery, ranging from de novo design models suggesting new molecular structures or optimizing existing leads to predictive models evaluating their toxicological properties. However, a limiting factor for the effectiveness of AI methods in drug discovery is the lack of access to high-quality data sets leading to a focus on approaches optimizing data generation. Combinatorial library design is a popular approach for bioactivity testing as a large number of molecules can be synthesized from a limited number of building blocks. We propose a framework for designing combinatorial libraries using a molecular generative model to generate building blocks de novo, followed by using k-determinantal point processes and Gibbs sampling to optimize a selection from the generated blocks. We explore optimization of biological activity, Quantitative Estimate of Drug-likeness (QED) and diversity and the trade-offs between them, both in single-objective and in multi-objective library design settings. Using retrosynthesis models to estimate building block availability, the proposed framework is able to explore the prospective benefit from expanding a stock of available building blocks by synthesis or by purchasing the preferred building blocks before designing a library. In simulation experiments with building block collections from all available commercial vendors near-optimal libraries could be found without synthesis of additional building blocks; in other simulation experiments we showed that even one synthesis step to increase the number of available building blocks could improve library designs when starting with an in-house building block collection of reasonable size.

Author

Simon Johansson

AstraZeneca AB

Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI

University of Gothenburg

Morteza Haghir Chehreghani

University of Gothenburg

Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI

Ola Engkvist

University of Gothenburg

Chalmers, Computer Science and Engineering (Chalmers)

AstraZeneca AB

Alexander Schliep

Chalmers, Computer Science and Engineering (Chalmers), Data Science

Brandenburg University of Technology

University of Gothenburg

Digital Discovery

2635098X (eISSN)

Vol. 3 1 122-135

Subject Categories (SSIF 2011)

Other Social Sciences not elsewhere specified

DOI

10.1039/d3dd00095h

More information

Latest update

3/7/2024 9