The Organelle in the Ointment: improved detection of cryptic mitochondrial reads resolves many unknown sequences in cross-species microbiome analyses
Journal article, 2024

The genomes of mitochondria and chloroplasts contain ribosomal RNA (rRNA) genes, reflecting their ancestry as free-living bacteria. These organellar rRNAs are often amplified in microbiome studies of animals and plants. If identified, they can be discarded, merely reducing sequencing depth. However, we identify certain high-abundance organeller RNAs not identified by common pipelines, which may compromise statistical analysis of microbiome structure and diversity. We quantified this by reanalyzing 7459 samples from seven 16S rRNA studies, including microbiomes from 927 unique animal genera. We find that under-annotation of cryptic mitochondrial and chloroplast reads affects multiple of these large-scale cross-species microbiome comparisons, and varies between host species, biasing comparisons. We offer a straightforward solution: supplementing existing taxonomies with diverse organelle rRNA sequences. This resolves up to 97% of unique unclassified sequences in some entire studies as mitochondrial (14% averaged across all studies), without increasing false positive annotations in mitochondria-free mock communities. Improved annotation decreases the proportion of unknown sequences by ≥10-fold in 2262 of 7459 samples (30%), spanning five of seven major studies examined. We recommend leveraging organelle sequence diversity to better identify organelle gene sequences in microbiome studies, and provide code, data resources and tutorials that implement this approach.

Author

Dylan Sonnett

Tanya Brown

Johan Bengtsson Palme

Chalmers, Life Sciences, Systems and Synthetic Biology

Jacqueline L. Padilla-Gamino

Jesse R. Zaneveld

ISME Communications

2730-6151 (ISSN)

Vol. 4 1 ycae114

Predicting future pathogenicity and antibiotic resistance

Swedish Foundation for Strategic Research (SSF) (FFL21-0174), 2022-08-01 -- 2027-12-31.

Driving Forces

Sustainable development

Subject Categories (SSIF 2011)

Biological Systematics

Microbiology

Bioinformatics and Systems Biology

Roots

Basic sciences

Areas of Advance

Health Engineering

DOI

10.1093/ismeco/ycae114

Related datasets

Extended SILVA reference taxonomy resources [dataset]

URI: https://zenodo.org/records/10251912 DOI: 10.5281/zenodo.10251912

More information

Created

12/29/2024