The Noisy Drawing Channel: Reliable Data Storage in DNA Sequences

Andreas Lenz, Paul H. Siegel, Antonia Wachter-Zeh, Eitan Yaakobi

Research output: Contribution to journalArticlepeer-review

Abstract

Motivated by recent advances in DNA-based data storage, we study a communication system, where information is conveyed over many sequences in parallel. In this system, the receiver cannot control the access to these sequences and can only draw from these sequences, unaware which sequence has been drawn. Further, the drawn sequences are susceptible to errors. In this paper, a suitable channel model that models this input-output relationship is analyzed and its information capacity is computed for a wide range of parameters and a general class of drawing distributions. This generalizes previous results for the noiseless case and specific drawing distributions. The analysis can guide future DNA-based data storage experiments by establishing theoretical limits on achievable information rates and by proposing decoding techniques that can be useful for practical implementations of decoders.

Original languageEnglish
Pages (from-to)2757-2778
Number of pages22
JournalIEEE Transactions on Information Theory
Volume69
Issue number5
DOIs
StatePublished - 1 May 2023

Keywords

  • Biological information theory
  • DNA storage
  • channel capacity
  • data storage
  • error correction codes

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Library and Information Sciences
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'The Noisy Drawing Channel: Reliable Data Storage in DNA Sequences'. Together they form a unique fingerprint.

Cite this