TY - JOUR
T1 - Transcriptomic signatures across human tissues identify functional rare genetic variation
AU - Aguet, François
AU - Barbeira, Alvaro N.
AU - Bonazzola, Rodrigo
AU - Brown, Andrew
AU - Castel, Stephane E.
AU - Jo, Brian
AU - Kasela, Silva
AU - Kim-Hellmuth, Sarah
AU - Liang, Yanyu
AU - Oliva, Meritxell
AU - Flynn, Elise D.
AU - Parsana, Princy
AU - Fresard, Laure
AU - Gamazon, Eric R.
AU - Hamel, Andrew R.
AU - He, Yuan
AU - Hormozdiari, Farhad
AU - Mohammadi, Pejman
AU - Muñoz-Aguirre, Manuel
AU - Park, Yo Son
AU - Saha, Ashis
AU - Segrè, Ayellet V.
AU - Strober, Benjamin J.
AU - Wen, Xiaoquan
AU - Wucher, Valentin
AU - Ardlie, Kristin G.
AU - Battle, Alexis
AU - Brown, Christopher D.
AU - Cox, Nancy
AU - Das, Sayantan
AU - Dermitzakis, Emmanouil T.
AU - Engelhardt, Barbara E.
AU - Garrido-Martín, Diego
AU - Gay, Nicole R.
AU - Getz, Gad A.
AU - Guigó, Roderic
AU - Handsaker, Robert E.
AU - Hoffman, Paul J.
AU - Im, Hae Kyung
AU - Kashin, Seva
AU - Kwong, Alan
AU - Lappalainen, Tuuli
AU - Li, Xiao
AU - MacArthur, Daniel G.
AU - Montgomery, Stephen B.
AU - Rouhana, John M.
AU - Stephens, Matthew
AU - Stranger, Barbara E.
AU - Todres, Ellen
AU - Yeger-Lotem, Esti
AU - Ferraro, Nicole M.
AU - Einson, Jonah
AU - Abell, Nathan S.
AU - Brandt, Margot
AU - Bucan, Maja
AU - Davis, Joe R.
AU - Greenwald, Emily
AU - Hess, Gaelen T.
AU - Hilliard, Austin T.
AU - Kember, Rachel L.
AU - Kotis, Bence
AU - Peloso, Gina
AU - Ramdas, Shweta
AU - Scott, Alexandra J.
AU - Smail, Craig
AU - Tsang, Emily K.
AU - Zekavat, Seyedeh M.
AU - Ziosi, Marcello
AU - Aradhana, null
AU - Assimes, Themistocles L.
AU - Bassik, Michael C.
AU - Correa, Adolfo
AU - Hall, Ira
AU - Li, Xin
AU - Natarajan, Pradeep
AU - Anand, Shankara
AU - Gabriel, Stacey
AU - Graubert, Aaron
AU - Hadley, Kane
AU - Huang, Katherine H.
AU - Meier, Samuel R.
AU - Nedzel, Jared L.
N1 - Publisher Copyright: © 2020 American Association for the Advancement of Science. All rights reserved.
PY - 2020/9/1
Y1 - 2020/9/1
N2 - INTRODUCTION: The human genome contains tens of thousands of rare (minor allele frequency <1%) variants, some of which contribute to disease risk. Using 838 samples with whole-genome and multitissue transcriptome sequencing data in the Genotype-Tissue Expression (GTEx) project version 8, we assessed how rare genetic variants contribute to extreme patterns in gene expression (eOutliers), allelic expression (aseOutliers), and alternative splicing (sOutliers). We integrated these three signals across 49 tissues with genomic annotations to prioritize high-impact rare variants (RVs) that associate with human traits. RATIONALE: Outlier gene expression aids in identifying functional RVs. Transcriptome sequencing provides diverse measurements beyond gene expression, including allele-specific expression and alternative splicing, which can provide additional insight into RV functional effects. RESULTS: After identifying multitissue eOutliers, aseOutliers, and sOutliers, we found that outlier individuals of each type were significantly more likely to carry an RV near the corresponding gene. Among eOutliers, we observed strong enrichment of rare structural variants. sOutliers were particularly enriched for RVs that disrupted or created a splicing consensus sequence. aseOutliers provided the strongest enrichment signal when evaluated from just a single tissue. We developed Watershed, a probabilistic model for personal genome interpretation that improves over standard genomic annotation–based methods for scoring RVs by integrating these three transcriptomic signals from the same individual and replicates in an independent cohort. To assess whether outlier RVs identified in GTEx associate with traits, we evaluated these variants for association with diverse traits in the UK Biobank, the Million Veterans Program, and the Jackson Heart Study. We found that transcriptome-assisted prioritization identified RVs with larger trait effect sizes and were better predictors of effect size than genomic annotation alone. CONCLUSION: With >800 genomes matched with transcriptomes across 49 tissues, we were able to study RVs that underlie extreme changes in the transcriptome. To capture the diversity of these extreme changes, we developed and integrated approaches to identify expression, allele-specific expression, and alternative splicing outliers, and characterized the RV landscape underlying each outlier signal. We demonstrate that personal genome interpretation and RV discovery is enhanced by using these signals. This approach provides a new means to integrate a richer set of functional RVs into models of genetic burden, improve disease gene identification, and enable the delivery of precision genomics.
AB - INTRODUCTION: The human genome contains tens of thousands of rare (minor allele frequency <1%) variants, some of which contribute to disease risk. Using 838 samples with whole-genome and multitissue transcriptome sequencing data in the Genotype-Tissue Expression (GTEx) project version 8, we assessed how rare genetic variants contribute to extreme patterns in gene expression (eOutliers), allelic expression (aseOutliers), and alternative splicing (sOutliers). We integrated these three signals across 49 tissues with genomic annotations to prioritize high-impact rare variants (RVs) that associate with human traits. RATIONALE: Outlier gene expression aids in identifying functional RVs. Transcriptome sequencing provides diverse measurements beyond gene expression, including allele-specific expression and alternative splicing, which can provide additional insight into RV functional effects. RESULTS: After identifying multitissue eOutliers, aseOutliers, and sOutliers, we found that outlier individuals of each type were significantly more likely to carry an RV near the corresponding gene. Among eOutliers, we observed strong enrichment of rare structural variants. sOutliers were particularly enriched for RVs that disrupted or created a splicing consensus sequence. aseOutliers provided the strongest enrichment signal when evaluated from just a single tissue. We developed Watershed, a probabilistic model for personal genome interpretation that improves over standard genomic annotation–based methods for scoring RVs by integrating these three transcriptomic signals from the same individual and replicates in an independent cohort. To assess whether outlier RVs identified in GTEx associate with traits, we evaluated these variants for association with diverse traits in the UK Biobank, the Million Veterans Program, and the Jackson Heart Study. We found that transcriptome-assisted prioritization identified RVs with larger trait effect sizes and were better predictors of effect size than genomic annotation alone. CONCLUSION: With >800 genomes matched with transcriptomes across 49 tissues, we were able to study RVs that underlie extreme changes in the transcriptome. To capture the diversity of these extreme changes, we developed and integrated approaches to identify expression, allele-specific expression, and alternative splicing outliers, and characterized the RV landscape underlying each outlier signal. We demonstrate that personal genome interpretation and RV discovery is enhanced by using these signals. This approach provides a new means to integrate a richer set of functional RVs into models of genetic burden, improve disease gene identification, and enable the delivery of precision genomics.
UR - http://www.scopus.com/inward/record.url?scp=85096430793&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/record.url?scp=85090818075&partnerID=8YFLogxK
U2 - https://doi.org/10.1126/SCIENCE.AAZ5900
DO - https://doi.org/10.1126/SCIENCE.AAZ5900
M3 - Article
C2 - 32913073
SN - 0036-8075
VL - 369
JO - Science
JF - Science
IS - 6509
M1 - 1334
ER -