Stereochemistry-aware string-based molecular generation
Journal article, 2025

This study investigates the impact of incorporating stereochemical information, a crucial aspect of computational drug discovery and materials design, in molecular generative modeling. We present a detailed comparison of stereochemistry-aware and conventionally stereochemistry-unaware string-based generative approaches, utilizing both genetic algorithms and reinforcement learning-based techniques. To evaluate these models, we introduce novel benchmarks specifically designed to assess the importance of stereochemistry-aware generative modeling. Our results demonstrate that stereochemistry-aware models generally perform on par with or surpass conventional algorithms across various stereochemistry-sensitive tasks. However, we also observe that in scenarios where stereochemistry plays a less critical role, stereochemistry-aware models may face challenges due to the increased complexity of the chemical space they must navigate. This work provides insights into the trade-offs involved in incorporating stereochemical information in molecular generative models and offers guidance for selecting appropriate approaches based on specific application requirements.

stereochemistry

drug design

machine learning

molecular generation

generative modeling

Author

Gary Tom

University of Toronto

Vector Institute for AI

Edwin Yu

University of Toronto

Naruki Yoshikawa

University of Toronto

Vector Institute for AI

Kjell Jorner

Swiss Federal Institute of Technology in Zürich (ETH)

University of Toronto

Chalmers, Chemistry and Chemical Engineering, Chemistry and Biochemistry

Alán Aspuru-Guzik

Canadian Institute for Advanced Research

Vector Institute for AI

University of Toronto

PNAS Nexus

27526542 (eISSN)

Vol. 4 11 pgaf329

Inverse design of molecules and reactions

Swedish Research Council (VR) (2020-00314), 2021-01-01 -- 2023-12-31.

Subject Categories (SSIF 2025)

Computer Sciences

DOI

10.1093/pnasnexus/pgaf329

More information

Latest update

5/22/2026