TY - GEN
T1 - A stochastic model for genomic interspersed duplication
AU - Farnoud, Farzad
AU - Schwartz, Moshe
AU - Bruck, Jehoshua
N1 - Publisher Copyright: © 2015 IEEE.
PY - 2015/9/28
Y1 - 2015/9/28
N2 - Mutation processes such as point mutation, insertion, deletion, and duplication (including tandem and interspersed duplication) have an important role in evolution, as they lead to genomic diversity, and thus to phenotypic variation. In this work, we study the expressive power of interspersed duplication, i.e., its ability to generate diversity, via a simple but fundamental stochastic model, where the length and the location of the substring that is duplicated and the point of insertion of the copy are chosen randomly. We investigate the properties of the set of high-probability sequences in these stochastic systems. In particular we provide results regarding the asymptotic behavior of frequencies of symbols and strings in a sequence evolving through interspersed duplication. The study of such systems is an important step towards the design and analysis of more realistic and sophisticated models of genomic mutation processes.
AB - Mutation processes such as point mutation, insertion, deletion, and duplication (including tandem and interspersed duplication) have an important role in evolution, as they lead to genomic diversity, and thus to phenotypic variation. In this work, we study the expressive power of interspersed duplication, i.e., its ability to generate diversity, via a simple but fundamental stochastic model, where the length and the location of the substring that is duplicated and the point of insertion of the copy are chosen randomly. We investigate the properties of the set of high-probability sequences in these stochastic systems. In particular we provide results regarding the asymptotic behavior of frequencies of symbols and strings in a sequence evolving through interspersed duplication. The study of such systems is an important step towards the design and analysis of more realistic and sophisticated models of genomic mutation processes.
UR - http://www.scopus.com/inward/record.url?scp=84969795603&partnerID=8YFLogxK
U2 - https://doi.org/10.1109/ISIT.2015.7282586
DO - https://doi.org/10.1109/ISIT.2015.7282586
M3 - Conference contribution
T3 - IEEE International Symposium on Information Theory - Proceedings
SP - 904
EP - 908
BT - Proceedings - 2015 IEEE International Symposium on Information Theory, ISIT 2015
T2 - IEEE International Symposium on Information Theory, ISIT 2015
Y2 - 14 June 2015 through 19 June 2015
ER -