Abstract
We consider the problem of reconstructing a sequence of independent and identically distributed symbols from a set of equal-size, consecutive, fragments, as well as a dependent reference sequence. First, in the regime in which the fragments are relatively long and typically no fragment appears more than once, we determine the scaling of the failure probability of the maximum-likelihood reconstruction algorithm for a perfect reconstruction, and bound it for a partial reconstruction. Second, we characterize the regime in which the fragments are relatively short and repeating fragments abound. We state a trade-off between the fraction of fragments that cannot be adequately reconstructed vs. the distortion level allowed for the reconstruction of each fragment, while still allowing vanishing failure probability.
| Original language | English |
|---|---|
| Article number | 10504879 |
| Pages (from-to) | 1 |
| Number of pages | 1 |
| Journal | IEEE Transactions on Information Theory |
| Volume | 70 |
| Issue number | 7 |
| DOIs | |
| State | Published - 1 Jul 2024 |
Keywords
- DNA sequencing
- Distortion
- Encoding
- Fragment reordering
- Genomics
- Noise measurement
- Reconstruction algorithms
- Sequential analysis
- Vectors
- bee-identification problem
- permutation reconstruction
- reference sequence
- sequence reconstruction
- side information
- sliced sequences
All Science Journal Classification (ASJC) codes
- Information Systems
- Library and Information Sciences
- Computer Science Applications
Fingerprint
Dive into the research topics of 'Fundamental Limits of Reference-Based Sequence Reordering'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver