Abstract
Motivated by recent advances in DNA-based data storage, we study a communication system, where information is conveyed over many sequences in parallel. In this system, the receiver cannot control the access to these sequences and can only draw from these sequences, unaware which sequence has been drawn. Further, the drawn sequences are susceptible to errors. In this paper, a suitable channel model that models this input-output relationship is analyzed and its information capacity is computed for a wide range of parameters and a general class of drawing distributions. This generalizes previous results for the noiseless case and specific drawing distributions. The analysis can guide future DNA-based data storage experiments by establishing theoretical limits on achievable information rates and by proposing decoding techniques that can be useful for practical implementations of decoders.
Original language | English |
---|---|
Pages (from-to) | 2757-2778 |
Number of pages | 22 |
Journal | IEEE Transactions on Information Theory |
Volume | 69 |
Issue number | 5 |
DOIs | |
State | Published - 1 May 2023 |
Keywords
- Biological information theory
- DNA storage
- channel capacity
- data storage
- error correction codes
All Science Journal Classification (ASJC) codes
- Information Systems
- Library and Information Sciences
- Computer Science Applications