Abstract
This paper introduces a new solution to DNA storage that integrates all three steps of retrieval, namely clustering, reconstruction, and error correction. DNA-correcting codes are presented as a unique solution to the problem of ensuring that the output of the storage system is unique for any valid set of input strands. To this end, we introduce a novel distance metric to capture the unique behavior of the DNA storage system and provide necessary and sufficient conditions for DNA-correcting codes. We also establish bounds and constructions for these codes, including an exploration of the ℓ∞ distance applied to permutations. Here, instead of interpreting permutation elements as numerical values and assessing absolute differences, we treat them as vectors and consider the Hamming distance to better model the DNA Storage System.
Original language | English |
---|---|
Pages (from-to) | 4214-4227 |
Number of pages | 14 |
Journal | IEEE Transactions on Information Theory |
Volume | 71 |
Issue number | 6 |
DOIs | |
State | Published - 2025 |
Keywords
- DNA data storage
- coding theory
- error correction codes
- indexes
- reconstruction algorithms
All Science Journal Classification (ASJC) codes
- Information Systems
- Computer Science Applications
- Library and Information Sciences