An End-to-End Coding Scheme for DNA-Based Data Storage With Nanopore-Sequenced Reads
Artikel i vetenskaplig tidskrift, 2026

We consider error-correcting coding for deoxyribonucleic acid (DNA)-based storage using nanopore sequencing. We model the DNA storage channel as a sampling noise channel where the input data is chunked into M short DNA strands, which are copied a random number of times, and the channel outputs a random selection of N noisy DNA strands. The retrieved DNA reads are prone to strand-dependent insertion, deletion, and substitution (IDS) errors. We construct an index-based concatenated coding scheme, i.e., the concatenation of an outer code, an index code, and an inner code. We further propose a low-complexity (linear in N) maximum a posteriori probability decoder that takes into account the strand-dependent IDS errors and the randomness of the drawing to infer symbolwise a posteriori probabilities for the outer decoder. We present Monte-Carlo simulations for information-outage probabilities and frame error rates for different channel setups on experimental data. We finally evaluate the overall system performance using the read/write cost trade-off. A powerful combination of tailored channel modeling and soft information processing allows us to achieve excellent performance even with error-prone nanopore-sequenced reads outperforming state-of-the-art schemes.

DNA storage dataset

Concatenated codes

error-correcting codes

nanopore sequencing

sampling channel

DNA storage

Författare

Lorenz Welter

Technische Universität München

Roman Sokolovskii

Imperial College London

Thomas Heinis

Imperial College London

Antonia Wachter-Zeh

Technische Universität München

Eirik Rosnes

Simula UiB

Alexandre Graell Amat

Chalmers, Elektroteknik, Kommunikation, Antenner och Optiska Nätverk

IEEE Journal on Selected Areas in Information Theory

26418770 (eISSN)

Vol. 7 17-32

Teori för Sekretess och Säkerhet inom Praktisk Federerad Inlärning

Vetenskapsrådet (VR) (2023-05065), 2023-12-01 -- 2027-11-30.

Pålitlig och säker kodad kantberäkning

Vetenskapsrådet (VR) (2020-03687), 2021-01-01 -- 2024-12-31.

Ämneskategorier (SSIF 2025)

Kommunikationssystem

Telekommunikation

DOI

10.1109/JSAIT.2026.3655592

Mer information

Senast uppdaterat

2026-03-12