@kjaganatha.bsky.social
68 followers 150 following 11 posts
Posts Media Videos Starter Packs
Reposted
gnovakovsky.bsky.social
Excited to share my first contribution here at Illumina! We developed PromoterAI, a deep neural network that accurately identifies non-coding promoter variants that disrupt gene expression.🧵 (1/)
kjaganatha.bsky.social
Acknowledging that all benchmarks and related comments are based on expression effects of promoter variants. Should’ve clarified that upfront — my apologies!
kjaganatha.bsky.social
The PromoterAI source code is available at github.com/Illumina/Pro.... Precomputed scores for all promoter SNVs are freely available for academic and non-commercial research. (9/)
GitHub - Illumina/PromoterAI
Contribute to Illumina/PromoterAI development by creating an account on GitHub.
github.com
kjaganatha.bsky.social
In the @genomicsengland.bsky.social cohort, variants prioritized by PromoterAI are enriched in clinically relevant genes. These variants account for 6% of the genetic burden and, when combined with SpliceAI and PrimateAI-3D, match the genetic burden of protein-truncating variants. (8/)
kjaganatha.bsky.social
PromoterAI effectively identifies pathogenic ClinVar variants that disrupt diverse regulatory motifs — often conserved and supported by ChIP-seq evidence. (7/)
kjaganatha.bsky.social
In the @ukbiobank.bsky.social cohort, PromoterAI predictions correlate strongly with protein levels and quantitative traits, suggesting that promoter variants contribute meaningfully to phenotypic variation. (6/)
kjaganatha.bsky.social
PromoterAI’s internal representations reveal three promoter categories: ubiquitously active, bivalent chromatin, and enhancer-like. The enhancer-like category, enriched for TATA boxes, may represent enhancers co-opted as promoters. (5/)
kjaganatha.bsky.social
PromoterAI achieves the best performance across diverse benchmarks spanning RNA, proteins, QTLs, and MPRA. Basenji2 < Enformer < Borzoi @drkbio.bsky.social. ChromBPNet @anshulkundaje.bsky.social (trained on accessibility) does well in MPRA but not in tissues. Evo2 lags far behind. (4/)
kjaganatha.bsky.social
Fine-tuning was done using twin networks that contrasted observed expression between ref and alt alleles, enabling the model to attribute differences to the variant rather than unrelated confounders. This was key to generalizing across unseen genes and datasets. (3/)
kjaganatha.bsky.social
We first trained PromoterAI to predict epigenetic and expression profiles at nucleotide resolution, then fine-tuned it on rare promoter variants associated with aberrant expression. (2/)
kjaganatha.bsky.social
We're thrilled to introduce PromoterAI — a tool for accurately identifying promoter variants that impact gene expression. 🧵 (1/)