scDRS: single-cell disease relevance score
Posted on (Update: ) 0 Comments
single-cell disease relevance score (scDRS)
links scRNA-seq with polygeneic disease risk at single-cell resolution, independent of annotated cell types
- scDRS identifies cells exhibiting excess expression across disease-associated genes impacted by GWASs
-
applies to 74 diseases/traits and 1.3 million single-cell gene-expression profiles across 31 tissues/organs
- $n_{cell}$ cells
- $n_{gene}$ genes
- cell-gene matrix $\bfX \in \IR^{n_{cell}\times n_{gene}}$
- $X_{cg}$ represents the expression level of cell $c$ and gene $g$
- $\bfX$ is size-factor-normalized and log-transformed from the original raw count matrix
- regress the covariates out from the normalized data
given a disease GWAS and an scRNA-seq data set
- compute a p-value for each individual cell for association with the disease
- output cell-level normalized disease scores and B sets of normalized control scores that can be used for data visualization and MC-based statistical inference
consists of three steps
- construct a set of putative disease genes from the GWAS summary statistics
- compute a raw disease score and B MC samples of raw control scores for each cell
- after gene set-wise and cell-wise normalization, scDRS computes an association p-value for each cell by comparing its normalized disease score to the empirical distribution of the pooled normalized control scores across all control gene sets and all cells