An empirical Bayes method for differential expression analysis of single cells with deep generative models

Pierre Boyeau, Jeffrey Regier, Adam Gayoso, Michael I. Jordan, Romain Lopez, Nir Yosef

Research output: Contribution to journalArticlepeer-review

Abstract

Detecting differentially expressed genes is important for characterizing subpopulations of cells. In scRNA-seq data, however, nuisance variation due to technical factors like sequencing depth andRNAcapture efficiency obscures the underlying biological signal. Deep generative models have been extensively applied to scRNA-seq data, with a special focus on embedding cells into a low-dimensional latent space and correcting for batch effects. However, little attention has been paid to the problem of utilizing the uncertainty from the deep generative model for differential expression (DE). Furthermore, the existing approaches do not allow for controlling for effect size or the false discovery rate (FDR). Here, we present lvm-DE, a generic Bayesian approach for performing DE predictions from a fitted deep generative model, while controlling the FDR. We apply the lvm-DE framework to scVI and scSphere, two deep generative models. The resulting approaches outperform state-of-the-art methods at estimating the log fold change in gene expression levels as well as detecting differentially expressed genes between subpopulations of cells.

Original languageEnglish
Article numbere2209124120
JournalProceedings of the National Academy of Sciences of the United States of America
Volume120
Issue number21
DOIs
StatePublished - 16 May 2023

All Science Journal Classification (ASJC) codes

  • General

Fingerprint

Dive into the research topics of 'An empirical Bayes method for differential expression analysis of single cells with deep generative models'. Together they form a unique fingerprint.

Cite this