Identifying Systematic Variation at the Single-Cell Level by Leveraging Low-Resolution Population-Level Data

Elior Rahmani, Michael I. Jordan, Nir Yosef

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A major limitation in single-cell genomics is a lack of ability to conduct cost-effective population-level studies. As a result, much of the current research in single-cell genomics focuses on biological processes that are broadly conserved across individuals, such as cellular organization and tissue development. This limitation prevents us from studying the etiology of experimental or clinical conditions that may be inconsistent across individuals owing to molecular variation and a wide range of effects in the population. In order to address this gap, we developed “kernel of integrated single cells” (Keris), a novel model-based framework to inform the analysis of single-cell gene expression data with population-level effects of a condition of interest. By inferring cell-type-specific moments and their variation across conditions using large tissue-level bulk data representing a population, Keris allows us to generate testable hypotheses at the single-cell level that would otherwise require collecting single-cell data from a large number of donors. Within the Keris framework, we show how the combination of low-resolution, large bulk data with small but high-resolution single-cell data enables the identification of changes in cell-subtype compositions and the characterization of subpopulations of cells that are affected by a condition of interest. Using Keris we estimate linear and non-linear age-associated changes in cell-type expression in large bulk peripheral blood mononuclear cells (PBMC) data. Combining with three independent single-cell PBMC datasets, we demonstrate that Keris can identify changes in cell-subtype composition with age and capture cell-type-specific subpopulations of senescent cells. This demonstrates the promise of enhancing single-cell data with population-level information to study compositional changes and to profile condition-affected subpopulations of cells, and provides a potential resource of targets for future clinical interventions.

Original languageEnglish
Title of host publicationResearch in Computational Molecular Biology - 26th Annual International Conference, RECOMB 2022, Proceedings
EditorsItsik Pe’er
PublisherSpringer Science and Business Media B.V.
Pages371
Number of pages1
Volume13278
Edition1
ISBN (Electronic)978-3-031-04749-7
ISBN (Print)9783031047480
DOIs
StatePublished - 12 May 2022
Externally publishedYes
Event26th International Conference on Research in Computational Molecular Biology, RECOMB 2022 - San Diego, United States
Duration: 22 May 202225 May 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13278 LNBI

Conference

Conference26th International Conference on Research in Computational Molecular Biology, RECOMB 2022
Country/TerritoryUnited States
CitySan Diego
Period22/05/2225/05/22

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Identifying Systematic Variation at the Single-Cell Level by Leveraging Low-Resolution Population-Level Data'. Together they form a unique fingerprint.

Cite this