Angli Xue
@anglixue.bsky.social
140 followers 56 following 16 posts
Postdoc Scientist at Garvan Institute of Medical Research Statistical genetics | Single-cell multi-omics | Complex diseases
Posts Media Videos Starter Packs
Reposted by Angli Xue
slavov-n.bsky.social
Some proteins are primarily regulated by one mechanism: RNA abundance, translation, or clearance.

The regulation of most proteins is dominated by different regulatory mechanisms across cell types.

Gratifyingly, this complex regulation defines simple rules ⬇️

www.biorxiv.org/content/10.1...
Reposted by Angli Xue
alberthenry.bsky.social
1. 🚨New preprint: tinyurl.com/tenk10k-causal.
We explored causal effects of gene expression in immune cell types on complex traits and diseases by combining single-cell expression quantitative trait loci (sc-eQTL) mapping in 5M+ cells from 1,925 donors in TenK10K study and GWAS. 🧵
Single-cell genetics identifies cell type-specific causal mechanisms in complex traits and diseases
Genome-wide association studies (GWAS) have been instrumental in uncovering the genetic basis of complex traits. When integrated with expression quantitative trait loci (eQTL) mapping, they can elucid...
tinyurl.com
anglixue.bsky.social
You are welcome to explore other TenK10K studies for different biological questions:
tinyurl.com/tenk10k-flag... led by
@annasecuomo.bsky.social
tinyurl.com/tenk10k-repeat led by
@htanudisastro.bsky.social
tinyurl.com/tenk10k-causal led by
@alberthenry.bsky.social & Anne Senabouth (14/n)
anglixue.bsky.social
@htanudisastro.bsky.social, @zhenqiao.bsky.social and many others! (12/n)
A special shout out to the core contributors to the TenK10K cohort, Rachael McCloy, Venessa Chin, Katie de Lange, Gemma Figtree, Alex Hewitt, @dgmacarthur.bsky.social,
@drjosephpowell.bsky.social (13/n)
anglixue.bsky.social
A big thanks to all co-authors, especially my supervisor
@drjosephpowell.bsky.social , & TenK10K team, Jayden Fan, Oscar Dong, @lawrencehuang.bsky.social , @petercallen.bsky.social
, Ellie Spenceley, Eszter Sagi-Zsigmond, @blakebowen.bsky.social, @annasecuomo.bsky.social, @alberthenry.bsky.social
anglixue.bsky.social
.. using paired multiome data without QTL information. This improvement further enhanced gene regulatory network inference, leading to the identification of 128 additional transcription factor (TF)–target gene pairs (a 22% increase), some of which show druggable potential. (11/n)
anglixue.bsky.social
scATAC-seq and caQTL signals also boost the gene regulatory network inference, especially when using unpaired multiome data. We inferred peak-to-gene relationships from unpaired multiome data by incorporating caQTL and eQTL, achieving up to 80% higher accuracy compared to (10/n)
anglixue.bsky.social
The genetic impact on chromatin accessibility not only shows cell type-specific patterns but also varies across cell states. We further detected 3,080 caQTLs whose allelic effects showed significant interaction with epi-genetic age. (9/n)
anglixue.bsky.social
Integrating caQTLs with GWAS+eQTL improves fine-mapping of causal variants. We pinpointed 671 credible sets for inflammatory bowel disease, 428 of which are single-variant sets, and replicated a causal variant for ETS2 in monocytes recently reported in Stankey et al. 2024. (8/n)
anglixue.bsky.social
Next, we ask why do GWAS hits often miss eQTLs? We integrated 60 GWAS from disease and blood traits with eQTLs and caQTLs and found caQTL integration yields 9.8–30% more colocalizations than eQTLs alone, particularly at distal elements or loci with multiple causal variants. (7/n)
anglixue.bsky.social
We highlight an interesting example where a chromatin peak chr10:45592479-45592785 shows a negative effect on the gene expression level of MARCH8 in NK cells but a positive effect in Conventional Dendritic Cell 2 (cDC2). (6/n)
anglixue.bsky.social
Integrating caQTL results with eQTLs from scRNA-seq of 1,925 donors and 5.4M cells revealed over 70,000 colocalized signals, including 25,280 candidate cis-regulatory elements (cCREs) further supported by causal inference using Mendelian randomization (MR). (5/n)
anglixue.bsky.social
More than half of caQTLs show cell type–specific patterns. For example, the chromatin peak chr13:24670806–24672096 contains caQTLs in CD4 TCM and CD14 monocytes, and their top variants (13:24671328:T:C in CD4 TCM and 13:24570579:C:A in CD14 Mono) are independent (LD ≈ 0). (4/n)
anglixue.bsky.social
We curated one of the largest population-level (n = 1,042) scATAC-seq data from peripheral blood with WGS data, which enabled us to characterize 440,996 chromatin peaks across 28 immune cell types and mapped 243,273 caQTLs. (3/n)
anglixue.bsky.social
Chromatin accessibility QTLs (caQTLs) directly capture the impact of non-coding variants on elements like enhancer and promoter, yet existing maps lack scale and diversity. Our study delivers a significant cell type–resolved caQTL resource in blood and demonstrates its translational utility. (2/n)
anglixue.bsky.social
New preprint alert: tinyurl.com/tenk10k-multiome. Excited to share our analysis on the impact of genetic variants on single-cell chromatin accessibility in blood, using scATAC-seq and WGS from over 1,000 donors and 3.5M nuclei as part of TenK10K phase 1 🧬
🧵👇 (1/n)
Genetic regulation of cell type-specific chromatin accessibility shapes immune function and disease risk
Understanding how genetic variation influences gene regulation at the single-cell level is crucial for elucidating the mechanisms underlying complex diseases. However, limited large-scale single-cell multi-omics data have constrained our understanding of the regulatory pathways that link variants to cell type-specific gene expression. Here we present chromatin accessibility profiles from 3.5 million peripheral blood mononuclear cells (PBMCs) across 1,042 donors, generated using single-cell ATAC-seq and multiome (RNA+ATAC) sequencing, with matched whole-genome sequencing, generated as part of the TenK10K program. We characterized 440,996 chromatin peaks across 28 immune cell types and mapped 243,273 chromatin accessibility quantitative trait loci (caQTLs), 60% of which are cell type-specific. Integration with TenK10K scRNA-seq data (5.4 million PBMCs) identified 31,688 candidate cis-regulatory elements colocalized with eQTLs; over half (52.5%) show evidence of causal effects mediated via chromatin accessibility. Integrating caQTLs with GWAS summary statistics for 16 diseases and 44 blood traits uncovered 9.8% - 30.0% more colocalized signals compared with using eQTLs alone, many of which have not been reported in prior studies. We demonstrate cell type-specific mechanisms, such as a regulatory effect on IRGM acting through altered promoter chromatin accessibility in CD8 effector memory T cells but not in naive cells. Using a graph neural network, we inferred peak-to-gene relationships from unpaired multiome data by incorporating caQTL and eQTL signals, achieving up to 80% higher accuracy compared to using paired multiome data without QTL information. This improvement further enhanced gene regulatory network inference, leading to the identification of 128 additional transcription factor (TF)-target gene pairs (a 22% increase). These findings provide an unprecedented single-cell map of chromatin accessibility and genetic variation in human circulating immune cells, establishing a powerful resource for dissecting cell type-specific regulation and advancing our understanding of genetic risk for complex diseases. ### Competing Interest Statement L.C., E.B.D., and K.K.H.F. are employed at Illumina Inc. D.G.M. is a paid advisor to Insitro and GSK, and receives research funding from Google and Microsoft, unrelated to the work described in this manuscript. G.A.F reports grants from National Health and Medical Research Council (Australia), grants from Abbott Diagnostic, Sanofi, Janssen Pharmaceuticals, and NSW Health. G.A.F reports honorarium from CSL, CPC Clinical Research, Sanofi, Boehringer-Ingelheim, Heart Foundation, and Abbott. G.A.F serves as Board Director for the Australian Cardiovascular Alliance (past President), Executive Committee Member for CPC Clinical Research, Founding Director and CMO for Prokardia and Kardiomics, and Executive Committee member for the CAD Frontiers A2D2 Consortium. In addition, G.A.F serves as CMO for the non-profit, CAD Frontiers, with industry partners including, Novartis, Amgen, Siemens Healthineers, ELUCID, Foresite Labs LLC, HeartFlow, Canon, Cleerly, Caristo, Genentech, Artyra, and Bitterroot Bio, Novo Nordisk and Allelica. In addition, G.A.F has the following patents: "Patent Biomarkers and Oxidative Stress" awarded USA May 2017 (US9638699B2) issued to Northern Sydney Local Health District, "Use of P2X7R antagonists in cardiovascular disease" PCT/AU2018/050905 licensed to Prokardia, "Methods for treatment and prevention of vascular disease" PCT/AU2015/000548 issued to The University of Sydney/Northern Sydney Local Health District, "Methods for predicting coronary artery disease" AU202290266 issued to The University of Sydney, and the patent "Novel P2X7 Receptor Antagonists" PCT/AU2022/051400 (23.11.2022), International App No: WO/2023/092175 (01.06.2023), issued to The University of Sydney. ### Funding Statement A.X. is supported by NHMRC Investigator grant 2033018. J.E.P. is supported by NHMRC Investigator grant 2034556, and a Fok Family Fellowship; D.G.M. is supported by an NHMRC investigator grant (2009982). G.A.F. and the BioHEART Study have been supported by NHMRC Investigator Grant, NSW Health Office of Health and Medical Research, and the NSW Health Statewide Biobank scheme. ### Author Declarations I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained. Yes The details of the IRB/oversight body that provided approval or exemption for the research described are given below: The Human Research Ethics Committee of St Vincent's Hospital gave ethical approval for this work. The National Statement on Ethical Conduct in Human Research of the National Health and Medical Research Council gave ethical approval for this work. I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals. Yes I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance). Yes I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable. Yes Raw caQTL summary statistics will be available at Zenodo website prior to acceptance. [https://github.com/powellgenomicslab/tenk10k\_phase1\_multiome][1] [1]: https://github.com/powellgenomicslab/tenk10k_phase1_multiome
tinyurl.com
Reposted by Angli Xue
tuuliel.bsky.social
Our new contribution to the quest to find causal GWAS genes! Sam Ghatan from my lab at @nygenome.org led a systematic comparison of eQTLs and CRISPRi+scRNA-seq screens. TL;DR: they provide highly complementary insights, with ortogonal pros and cons. 🧵👇
www.biorxiv.org/content/10.1...
Reposted by Angli Xue
anshulkundaje.bsky.social
Very excited to announce that the single cell/nuc. RNA/ATAC/multi-ome resource from ENCODE4 is now officially public. This includes raw data, processed data, annotations and pseudobulk products. Covers many human & mouse tissues. 1/

www.encodeproject.org/single-cell/...
Single cell – ENCODEHomo sapiens clickable body map
www.encodeproject.org
anglixue.bsky.social
Hello bluesky world 👋