TY - JOUR
T1 - Drosophila Evolution over Space and Time (DEST)
T2 - A New Population Genomics Resource
AU - Kapun, Martin
AU - Nunez, Joaquin C.B.
AU - Bogaerts-Márquez, María
AU - Murga-Moreno, Jesús
AU - Paris, Margot
AU - Outten, Joseph
AU - Coronado-Zamora, Marta
AU - Tern, Courtney
AU - Rota-Stabelli, Omar
AU - García Guerreiro, Maria P.
AU - Casillas, Sònia
AU - Orengo, Dorcas J.
AU - Puerma, Eva
AU - Kankare, Maaria
AU - Ometto, Lino
AU - Loeschcke, Volker
AU - Onder, Banu S.
AU - Abbott, Jessica K.
AU - Schaeffer, Stephen W.
AU - Rajpurohit, Subhash
AU - Behrman, Emily L.
AU - Schou, Mads F.
AU - Merritt, Thomas J.S.
AU - Lazzaro, Brian P.
AU - Glaser-Schmitt, Amanda
AU - Argyridou, Eliza
AU - Staubach, Fabian
AU - Wang, Yun
AU - Tauber, Eran
AU - Serga, Svitlana V.
AU - Fabian, Daniel K.
AU - Dyer, Kelly A.
AU - Wheat, Christopher W.
AU - Parsch, John
AU - Grath, Sonja
AU - Veselinovic, Marija Savic
AU - Stamenkovic-Radak, Marina
AU - Jelic, Mihailo
AU - Buendía-Ruíz, Antonio J.
AU - Gómez-Julián, Maria Josefa
AU - Espinosa-Jimenez, Maria Luisa
AU - Gallardo-Jiménez, Francisco D.
AU - Patenkovic, Aleksandra
AU - Eric, Katarina
AU - Tanaskovic, Marija
AU - Ullastres, Anna
AU - Guio, Lain
AU - Merenciano, Miriam
AU - Guirao-Rico, Sara
AU - Horváth, Vivien
AU - Obbard, Darren J.
AU - Pasyukova, Elena
AU - Alatortsev, Vladimir E.
AU - Vieira, Cristina P.
AU - Vieira, Jorge
AU - Torres, Jorge Roberto
AU - Kozeretska, Iryna
AU - Maistrenko, Oleksandr M.
AU - Montchamp-Moreau, Catherine
AU - Mukha, Dmitry V.
AU - Machado, Heather E.
AU - Lamb, Keric
AU - Paulo, Tânia
AU - Yusuf, Leeban
AU - Barbadilla, Antonio
AU - Petrov, Dmitri
AU - Schmidt, Paul
AU - Gonzalez, Josefa
AU - Flatt, Thomas
AU - Bergland, Alan O.
N1 - Funding Information: We thank four reviewers and the handling editor for helpful comments on previous versions of our manuscript. We are grateful to the members of the DrosEU and DrosRTEC consortia for their long-standing support, collaboration, and for discussion. DrosEU was funded by a Special Topic Networks (STN) grant from the European Society for Evolutionary Biology (ESEB). M.K. was supported by the Austrian Science Foundation (grant no. FWF P32275); J.G. by the European Research Council (ERC) under the European Union?s Horizon 2020 research and innovation programme (H2020-ERC-2014-CoG-647900) and by the Spanish Ministry of Science and Innovation (BFU-2011-24397); T.F. by the Swiss National Science Foundation (SNSF grants PP00P3_133641, PP00P3_165836, and 31003A_182262) and a Mercator Fellowship from the German Research Foundation (DFG), held as a EvoPAD Visiting Professor at the Institute for Evolution and Biodiversity, University of M?nster; AOB by the National Institutes of Health (R35 GM119686); M.K. by Academy of Finland grant 322980; V.L. by Danish Natural Science Research Council (FNU) (grant no. 4002-00113B); FS Deutsche Forschungsgemeinschaft (DFG) (grant no. STA1154/4-1), Project 408908608; J.P. by the Deutsche Forschungsgemeinschaft Projects 274388701 and 347368302; A.U. by FPI fellowship (BES-2012-052999); ET Israel Science Foundation (ISF) (grant no. 1737/17); M.S.V., M.S.R. and M.J. by a grant from the Ministry of Education, Science and Technological Development of the Republic of Serbia (451-03-68/2020-14/200178); A.P., K.E. and M.T. by a grant from the Ministry of Education, Science and Technological Development of the Republic of Serbia (451-03-68/2020-14/200007); and TM NSERC grant RGPIN-2018-05551. The authors acknowledge Research Computing at The University of Virginia for providing computational resources and technical support that have contributed to the results reported within this publication (https://rc.virginia.edu, last accessed September 6, 2021). Funding Information: We thank four reviewers and the handling editor for helpful comments on previous versions of our manuscript. We are grateful to the members of the DrosEU and DrosRTEC consortia for their long-standing support, collaboration, and for discussion. DrosEU was funded by a Special Topic Networks (STN) grant from the European Society for Evolutionary Biology (ESEB). M.K. was supported by the Austrian Science Foundation (grant no. FWF P32275); J.G. by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (H2020-ERC-2014-CoG-647900) and by the Spanish Ministry of Science and Innovation (BFU-2011-24397); T.F. by the Swiss National Science Foundation (SNSF grants PP00P3_133641, PP00P3_165836, and 31003A_182262) and a Mercator Fellowship from the German Research Foundation (DFG), held as a EvoPAD Visiting Professor at the Institute for Evolution and Biodiversity, University of Münster; AOB by the National Institutes of Health (R35 GM119686); M.K. by Academy of Finland grant 322980; V.L. by Danish Natural Science Research Council (FNU) (grant no. 4002-00113B); FS Deutsche Forschungsgemeinschaft (DFG) (grant no. STA1154/4-1), Project 408908608; J.P. by the Deutsche Forschungsgemeinschaft Projects 274388701 and 347368302; A.U. by FPI fellowship (BES-2012-052999); ET Israel Science Foundation (ISF) (grant no. 1737/17); M.S.V., M.S.R. and M.J. by a grant from the Ministry of Education, Science and Technological Development of the Republic of Serbia (451-03-68/2020-14/200178); A.P., K.E. and M.T. by a grant from the Ministry of Education, Science and Technological Development of the Republic of Serbia (451-03-68/2020-14/200007); and TM NSERC grant RGPIN-2018-05551. The authors acknowledge Research Computing at The University of Virginia for providing computational resources and technical support that have contributed to the results reported within this publication (https://rc.virginia.edu, last accessed September 6, 2021). Publisher Copyright: © The Author(s) 2021.
PY - 2021/12/9
Y1 - 2021/12/9
N2 - Drosophila melanogaster is a leading model in population genetics and genomics, and a growing number of whole-genome data sets from natural populations of this species have been published over the last years. A major challenge is the integration of disparate data sets, often generated using different sequencing technologies and bioinformatic pipelines, which hampers our ability to address questions about the evolution of this species. Here we address these issues by developing a bioinformatics pipeline that maps pooled sequencing (Pool-Seq) reads from D. melanogaster to a hologenome consisting of fly and symbiont genomes and estimates allele frequencies using either a heuristic (PoolSNP) or a probabilistic variant caller (SNAPE-pooled). We use this pipeline to generate the largest data repository of genomic data available for D. melanogaster to date, encompassing 271 previously published and unpublished population samples from over 100 locations in >20 countries on four continents. Several of these locations have been sampled at different seasons across multiple years. This data set, which we call Drosophila Evolution over Space and Time (DEST), is coupled with sampling and environmental metadata. A web-based genome browser and web portal provide easy access to the SNP data set. We further provide guidelines on how to use Pool-Seq data for model-based demographic inference. Our aim is to provide this scalable platform as a community resource which can be easily extended via future efforts for an even more extensive cosmopolitan data set. Our resource will enable population geneticists to analyze spatiotemporal genetic patterns and evolutionary dynamics of D. melanogaster populations in unprecedented detail.
AB - Drosophila melanogaster is a leading model in population genetics and genomics, and a growing number of whole-genome data sets from natural populations of this species have been published over the last years. A major challenge is the integration of disparate data sets, often generated using different sequencing technologies and bioinformatic pipelines, which hampers our ability to address questions about the evolution of this species. Here we address these issues by developing a bioinformatics pipeline that maps pooled sequencing (Pool-Seq) reads from D. melanogaster to a hologenome consisting of fly and symbiont genomes and estimates allele frequencies using either a heuristic (PoolSNP) or a probabilistic variant caller (SNAPE-pooled). We use this pipeline to generate the largest data repository of genomic data available for D. melanogaster to date, encompassing 271 previously published and unpublished population samples from over 100 locations in >20 countries on four continents. Several of these locations have been sampled at different seasons across multiple years. This data set, which we call Drosophila Evolution over Space and Time (DEST), is coupled with sampling and environmental metadata. A web-based genome browser and web portal provide easy access to the SNP data set. We further provide guidelines on how to use Pool-Seq data for model-based demographic inference. Our aim is to provide this scalable platform as a community resource which can be easily extended via future efforts for an even more extensive cosmopolitan data set. Our resource will enable population geneticists to analyze spatiotemporal genetic patterns and evolutionary dynamics of D. melanogaster populations in unprecedented detail.
KW - Adaptation
KW - Demography
KW - Drosophila melanogaster
KW - Evolution
KW - Population genomics
KW - SNPs
UR - http://www.scopus.com/inward/record.url?scp=85123463299&partnerID=8YFLogxK
U2 - https://doi.org/10.1093/molbev/msab259
DO - https://doi.org/10.1093/molbev/msab259
M3 - Article
C2 - 34469576
SN - 0737-4038
VL - 38
SP - 5782
EP - 5805
JO - Molecular Biology and Evolution
JF - Molecular Biology and Evolution
IS - 12
ER -