De novo generated combinatorial library design
Artikel i vetenskaplig tidskrift, 2023

Artificial intelligence (AI) contributes new methods for designing compounds in drug discovery, ranging from de novo design models suggesting new molecular structures or optimizing existing leads to predictive models evaluating their toxicological properties. However, a limiting factor for the effectiveness of AI methods in drug discovery is the lack of access to high-quality data sets leading to a focus on approaches optimizing data generation. Combinatorial library design is a popular approach for bioactivity testing as a large number of molecules can be synthesized from a limited number of building blocks. We propose a framework for designing combinatorial libraries using a molecular generative model to generate building blocks de novo, followed by using k-determinantal point processes and Gibbs sampling to optimize a selection from the generated blocks. We explore optimization of biological activity, Quantitative Estimate of Drug-likeness (QED) and diversity and the trade-offs between them, both in single-objective and in multi-objective library design settings. Using retrosynthesis models to estimate building block availability, the proposed framework is able to explore the prospective benefit from expanding a stock of available building blocks by synthesis or by purchasing the preferred building blocks before designing a library. In simulation experiments with building block collections from all available commercial vendors near-optimal libraries could be found without synthesis of additional building blocks; in other simulation experiments we showed that even one synthesis step to increase the number of available building blocks could improve library designs when starting with an in-house building block collection of reasonable size.

Författare

Simon Johansson

AstraZeneca AB

Chalmers, Data- och informationsteknik, Data Science och AI

Göteborgs universitet

Morteza Haghir Chehreghani

Göteborgs universitet

Chalmers, Data- och informationsteknik, Data Science och AI

Ola Engkvist

Göteborgs universitet

Chalmers, Data- och informationsteknik

AstraZeneca AB

Alexander Schliep

Chalmers, Data- och informationsteknik, Data Science

Brandenburgische Technische Universität

Göteborgs universitet

Digital Discovery

2635098X (eISSN)

Vol. 3 1 122-135

Ämneskategorier

Övrig annan samhällsvetenskap

DOI

10.1039/d3dd00095h

Mer information

Senast uppdaterat

2024-03-07