Deep learning-based k(cat) prediction enables improved enzyme-constrained model reconstruction
Artikel i vetenskaplig tidskrift, 2022

Enzyme turnover numbers (k(cat)) are key to understanding cellular metabolism, proteome allocation and physiological diversity, but experimentally measured k(cat) data are sparse and noisy. Here we provide a deep learning approach (DLKcat) for high-throughput k(cat) prediction for metabolic enzymes from any organism merely from substrate structures and protein sequences. DLKcat can capture k(cat) changes for mutated enzymes and identify amino acid residues with a strong impact on k(cat) values. We applied this approach to predict genome-scale k(cat) values for more than 300 yeast species. Additionally, we designed a Bayesian pipeline to parameterize enzyme-constrained genome-scale metabolic models from predicted k(cat) values. The resulting models outperformed the corresponding original enzyme-constrained genome-scale metabolic models from previous pipelines in predicting phenotypes and proteomes, and enabled us to explain phenotypic differences. DLKcat and the enzyme-constrained genome-scale metabolic model construction pipeline are valuable tools to uncover global trends of enzyme kinetics and physiological diversity, and to further elucidate cellular metabolism on a large scale.

Författare

Feiran Li

Chalmers, Biologi och bioteknik, Systembiologi

Le Yuan

Chalmers, Biologi och bioteknik, Systembiologi

Hongzhong Lu

Chalmers, Biologi och bioteknik, Systembiologi

Gang Li

Chalmers, Biologi och bioteknik, Systembiologi

Yu Chen

Chalmers, Biologi och bioteknik, Systembiologi

Martin Engqvist

Chalmers, Biologi och bioteknik, Systembiologi

Eduard Kerkhoven

Chalmers, Biologi och bioteknik, Systembiologi

Jens B Nielsen

Chalmers, Biologi och bioteknik, Systembiologi

BioInnovation Institute

Nature Catalysis

25201158 (eISSN)

Vol. 5 8 662-672

Synthetic microbial consortia-based platform for flavonoids production using synthetic biology (Synbio4Flav)

Europeiska kommissionen (EU) (EC/H2020/814650), 2019-01-01 -- 2023-02-28.

Bioinformatics Services for Data-Driven Design of Cell Factories and Communities (DD-DeCaF)

Europeiska kommissionen (EU) (EC/H2020/686070), 2016-03-01 -- 2020-02-28.

Ämneskategorier

Biokemi och molekylärbiologi

Bioinformatik (beräkningsbiologi)

Bioinformatik och systembiologi

DOI

10.1038/s41929-022-00798-z

Relaterade dataset

Supplementary Dataset for Deep learning based k(cat) prediction enables improved enzyme constrained model reconstruction [dataset]

DOI: 10.5281/zenodo.5164209

Mer information

Senast uppdaterat

2024-03-07