Wei-Lin Qiu
@613weilin.bsky.social
60 followers 18 following 12 posts
Postdoctoral Researcher in Robin Andersson lab at University of Copenhagen
Posts Media Videos Starter Packs
613weilin.bsky.social
Many thanks to our amazing team, especially co-first author @mayayayas.bsky.social, and supervisors @randersson.bsky.social, @jengreitz.bsky.social

12/12
613weilin.bsky.social
You can run scE2G on your data today using our pipeline here: github.com/EngreitzLab/...

We are looking forward to hearing your feedback on how scE2G works on your dataset!

11/12
GitHub - EngreitzLab/scE2G at v1.0
Pipeline to run scE2G. Contribute to EngreitzLab/scE2G development by creating an account on GitHub.
github.com
613weilin.bsky.social
We also integrate scE2G predictions with orthogonal information to prioritize causal genes and cell types for noncoding variants associated with complex traits.

For example, here we nominate regulatory interactions linking INPP4B and IL15 to lymphocyte counts in T cells.

9/12
613weilin.bsky.social
For example, here we identify cell-type specific links for SPTA1 in erythroblasts and normoblasts.

8/12
613weilin.bsky.social
We applied scE2G to over 40 cell types from PBMCs, BMMCs, and pancreatic islets, validating that scE2G predictions reflect expected patterns of cell-type specificity.

7/12
613weilin.bsky.social
We show that scE2G has robust performance for cell types with at least 2 million total ATAC fragments and 1 million RNA UMIs – about 200-400 cells from a typical 10x Multiome experiment in a tissue.

6/12
613weilin.bsky.social
Key features in scE2G include 1) ABC score, 2) Kendall correlation between peak accessibility and gene expression, and 3) whether the gene is “ubiquitously-expressed”.

Notably, the Kendall correlation improves long-range predictions and appears to detect stochastic transcriptional bursting.

5/12
613weilin.bsky.social
In systematic benchmarking against CRISPR perturbations (below), fine-mapped eQTLs, and GWAS variant-gene associations, scE2G models outperforms existing single-cell models and distance-derived baselines.

We applied and extended ENCODE benchmarking pipelines: www.biorxiv.org/content/10.1...

4/12
613weilin.bsky.social
Using a gold-standard CRISPR perturbation dataset, we trained two logistic regression models: scE2G (ATAC) and scE2G (Multiome), that use single-cell ATAC-seq or single-cell multiomic ATAC and RNA-seq data, respectively.

3/12
613weilin.bsky.social
scE2G tackles the challenge of building cell-type-specific maps of enhancer-gene regulation.

If we can do this well, we can build enhancer maps across thousands of human cell types from emerging single-cell atlases to interpret genetic variants and understand gene regulation.

2/12