TY - GEN
T1 - Duplication-correcting codes for data storage in the DNA of living organisms
AU - Jain, Siddharth
AU - Farnoud, Farzad
AU - Schwartz, Moshe
AU - Bruck, Jehoshua
N1 - Publisher Copyright: © 2016 IEEE.
PY - 2016/8/10
Y1 - 2016/8/10
N2 - The ability to store data in the DNA of a living organism has applications in a variety of areas including synthetic biology and watermarking of patented genetically-modified organisms. Data stored in this medium is subject to errors arising from various mutations, such as point mutations, indels, and tandem duplication, which need to be corrected to maintain data integrity. In this paper, we provide error-correcting codes for errors caused by tandem duplications, which create a copy of a block of the sequence and insert it in a tandem manner, i.e., next to the original. In particular, we present a family of codes for correcting errors due to tandem-duplications of a fixed length and any number of errors. We also study codes for correcting tandem duplications of length up to a given constant k, where we are primarily focused on the cases of k = 2, 3.
AB - The ability to store data in the DNA of a living organism has applications in a variety of areas including synthetic biology and watermarking of patented genetically-modified organisms. Data stored in this medium is subject to errors arising from various mutations, such as point mutations, indels, and tandem duplication, which need to be corrected to maintain data integrity. In this paper, we provide error-correcting codes for errors caused by tandem duplications, which create a copy of a block of the sequence and insert it in a tandem manner, i.e., next to the original. In particular, we present a family of codes for correcting errors due to tandem-duplications of a fixed length and any number of errors. We also study codes for correcting tandem duplications of length up to a given constant k, where we are primarily focused on the cases of k = 2, 3.
UR - http://www.scopus.com/inward/record.url?scp=84985896561&partnerID=8YFLogxK
U2 - 10.1109/ISIT.2016.7541455
DO - 10.1109/ISIT.2016.7541455
M3 - Conference contribution
T3 - IEEE International Symposium on Information Theory - Proceedings
SP - 1028
EP - 1032
BT - Proceedings - ISIT 2016; 2016 IEEE International Symposium on Information Theory
T2 - 2016 IEEE International Symposium on Information Theory, ISIT 2016
Y2 - 10 July 2016 through 15 July 2016
ER -