John Butts
@j-c-butts.bsky.social
52 followers 52 following 11 posts
Biomedical Sciences PhD Candidate at The University of Maine and The Jackson Laboratory Genetics. Music. Tennis. (Not necessarily in that order)
Posts Media Videos Starter Packs
j-c-butts.bsky.social
We have produced a comprehensive catalog of non-coding variant effects and make these predictions available to the community. We encourage researchers interested in gene regulation across fields to explore our precomputed predictions or generate their own to guide their future experiments!
j-c-butts.bsky.social
Lastly we investigate all human promoters by saturation mutagenesis, identifying canonical promoter TFs and linking non-coding variant effect size to coding constraint (LoEUF), bridging the gap between coding and non-coding function.
j-c-butts.bsky.social
MPAC can scale to predict 514M gnomAD variant effects and we quantify the relationship between allele frequency or evolutionary conservation with predicted skew at an unprecedented level. Notably, we find that variants causing high skew are under greater constraint than those with small effects.
j-c-butts.bsky.social
In COSMIC we identify known non-coding driver mutations (TERT) and by combining variant recurrence, regulatory element annotations, and cancer-associated promoters we nominate 1,892 emVars as putative non-coding drivers.
j-c-butts.bsky.social
Many clinically identified non-coding variants lack clear effects, using MPAC we can predict the impact of all ClinVar non-coding variants and observe enrichments in pathogenic alleles for highly disruptive variants (emVars).
j-c-butts.bsky.social
MPAC predictions distinguish causal variants from the UK Biobank, Biobank Japan and eQTLs from GTEx with experimental accuracy but without experimental overhead!
j-c-butts.bsky.social
Trained on MPRA data from a large-scale study of human trait and eQTL variants (www.biorxiv.org/content/10.1...) and extending the Malinois model architecture (www.nature.com/articles/s41...) MPAC predicts variant effects with high accuracy.
j-c-butts.bsky.social
Massively Parallel Reporter Assays (MPRAs) quantify the activity of 10-100s of thousands of sequences, however, it is not feasible to test all known variation. Modeling MPRA can increase scale and lead to better understanding of complex traits, somatic and germline diseases, and population genetics.
j-c-butts.bsky.social
Excited to share our MPAC preprint, a scalable ensemble of ML models for genome-wide non-coding variant effect prediction and our findings from 575M predictions across databases including @ukbiobank.bsky.social, GTEx, ClinVar, COSMIC, and @gnomad-project.bsky.social
www.biorxiv.org/content/10.1...
| bioRxiv
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
https://www.biorxiv.org/content/10.1101/2025.04.16.648420v1\