Karthik Guruvayurappan
somet3000.bsky.social
Karthik Guruvayurappan
@somet3000.bsky.social
Comp Bio PhD Student in Kushal Dey and Thomas Norman labs at MSK Cancer Center. Interested in gene regulation and sports
Reposted by Karthik Guruvayurappan
Which mutations rewire function of regulatory DNA?

Excited to share SEAM: Systematic Explanation of Attribtuion-based Mechanisms. SEAM is an explainable AI method that dissects cis-regulatory mechanisms learned by seq2fun genomic deep learning models.

Led by @EESetiz

1/N 🧵👇
October 9, 2025 at 12:03 PM
Reposted by Karthik Guruvayurappan
Introducing Spacelink, a unified framework for identifying and prioritizing spatially variable gene programs at tissue and cell type resolution and linking them to disease GWAS studies. Led by Hanbyul Lee & Haochen Sun www.biorxiv.org/content/10.1...
September 25, 2025 at 5:04 PM
Reposted by Karthik Guruvayurappan
I'm really happy to announce that my two main postdoctoral works have been published in the same issue of Developmental Cell! First, the final form of the mouse cortical organoid protocol (tinyurl.com/bddfmx8n), which I recently presented at Development presents (tinyurl.com/36udcpme) and…
tinyurl.com
August 27, 2025 at 4:25 PM
Reposted by Karthik Guruvayurappan
Excited to share this preprint from first author Jon Rosen, a postdoctoral fellow in the @klmohlke.bsky.social lab and my lab. We examine eQTL study sample size and how this affects signal discovery and rates of colocalization with GWAS.

www.biorxiv.org/content/10.1...
Higher eQTL power reveals signals that boost GWAS colocalization
Expression quantitative trait locus (eQTL) studies in human cohorts typically detect at least one regulatory signal per gene, and have been proposed as a way to explain mechanisms of genetic liability...
www.biorxiv.org
August 18, 2025 at 12:18 PM
Reposted by Karthik Guruvayurappan
Our paper is now out in final form at Nature Genetics! For those who missed the preprint, we used large-scale Perturb-seq targeting transcription factors to push primary fibroblasts into diverse transcriptional states, including those observed in cell atlas studies.
August 6, 2025 at 3:14 PM
Reposted by Karthik Guruvayurappan
Excited to share a new preprint from the lab with @ryandhindsa.bsky.social ! www.biorxiv.org/content/10.1...

Led by @sherrynyeo.bsky.social, @erinmayc.bsky.social, and friends, we continue our journey to find viral DNA in our favorite place-- the overlooked and discarded reads in existing data! 1/
July 22, 2025 at 9:59 PM
Reposted by Karthik Guruvayurappan
Excited to share our new preprint on detecting foldback artifacts in long reads with my advisors Matthew Meyerson and @lh3lh3.bsky.social ! Stop by poster C-180 on Wednesday at ISMB/ECCB2025 to learn more and chat!
July 21, 2025 at 2:26 PM
Reposted by Karthik Guruvayurappan
Our work on "Evaluating the representational power of pre-trained DNA language models for regulatory genomics" led by @AmberZqt with help from @NiraliSomia & @stevenyuyy is finally published in Genome Biology! Check it out!

genomebiology.biomedcentral.com/articles/10....
Evaluating the representational power of pre-trained DNA language models for regulatory genomics - Genome Biology
Background The emergence of genomic language models (gLMs) offers an unsupervised approach to learning a wide diversity of cis-regulatory patterns in the non-coding genome without requiring labels of ...
genomebiology.biomedcentral.com
July 16, 2025 at 12:12 PM
Reposted by Karthik Guruvayurappan
We are excited to introduce mRNABench, a comprehensive benchmarking suite that we used to evaluate the representational capabilities of 18 families of nucleotide foundation models on mature mRNA specific tasks.

Paper: doi.org/10.1101/2025...
Code: github.com/morrislab/mR...

A 🧵
mRNABench: A curated benchmark for mature mRNA property and function prediction
Messenger RNA (mRNA) is central in gene expression, and its half-life, localization, and translation efficiency drive phenotypic diversity in eukaryotic cells. While supervised learning has widely bee...
doi.org
July 15, 2025 at 7:09 PM
Reposted by Karthik Guruvayurappan
I recently ran a session at CSAMA with Laurent Gatto and this is the way. "Good enough" with knowledge of the tools that can help you get there. Here's some of the recs from the paper.
July 13, 2025 at 5:13 AM
Reposted by Karthik Guruvayurappan
Celebrating 10 years of our lab with a new preprint:
www.biorxiv.org/content/10.1...
How does enhancer location within a TAD control transcriptional bursts from a cognate promoter?
Experiments by Jana Tünnermann and modelling by Gregory Roth
March 29, 2025 at 12:46 PM
Reposted by Karthik Guruvayurappan
Excited to share our latest by my postdoc Ben KS: we use statistical physics & Bayesian inference to model genome-wide perturbation outcomes. Remarkably, perturbation responses are encoded in gene "chatter" even before the perturbation–a fundamental insight with broad implications
shorturl.at/2LHbw
July 6, 2025 at 10:11 PM
Reposted by Karthik Guruvayurappan
I am very excited to share our latest work where we describe a new method to profile genome-wide chromatin transitions over time in single cells. Great collaborative effort with the van Oudenaarden group @hubrechtinstitute.bsky.social @oncodeinstitute.bsky.social www.nature.com/articles/s41....
Retrospective and multifactorial single-cell profiling reveals sequential chromatin reorganization during X inactivation - Nature Cell Biology
Kefalopoulou, Rullens et al. develop Dam&ChIC to assay chromatin state at two different time points in the same cell. The method was used to study the reorganization of LADs during cell division a...
www.nature.com
July 10, 2025 at 2:36 PM
Reposted by Karthik Guruvayurappan
📣New today!
📄Haplotype analysis reveals pleiotropic disease associations in the HLA region
🧑‍🤝‍🧑 @jkpritch.bsky.social @finngen.bsky.social & colleagues
Haplotype analysis reveals pleiotropic disease associations in the HLA region
The HLA region plays an important role in human health but is often excluded in large-scale GWASs. We performed haplotype-based associations from genotype data to investigate pleiotropy across thousan...
www.cell.com
July 10, 2025 at 4:51 PM
Reposted by Karthik Guruvayurappan
Neuron programming! Pro-neural TFs + 480 morphogen conditions + scRNA-seq --> Diverse iN subtypes of forebrain, midbrain, hindbrain, spinal cord, and PNS. @hsiuchuanlin.bsky.social@jasperjanssens.bsky.social‬ and Treutlein Lab! @science.org www.science.org/doi/10.1126/... #NGN2 #ASCL1
July 11, 2025 at 8:59 PM
Reposted by Karthik Guruvayurappan
Finally out! 🥳 Our paper showing how a transposable element (TE) insertion can cause developmental phenotypes is now published @natgenet.nature.com 🧬🦠🐁
Below is a brief description of the major findings. Check the full version of the paper for more details: www.nature.com/articles/s41588-025-02248-5
Enhancer adoption by an LTR retrotransposon generates viral-like particles, causing developmental limb phenotypes - Nature Genetics
Activation of an LTR retrotransposon inserted upstream of the Fgf8 gene produces viral-like particles in the mouse developing limb, triggering apoptosis and causing limb malformation. This phenotype c...
www.nature.com
July 9, 2025 at 10:05 AM
Reposted by Karthik Guruvayurappan
Delighted to share the first preprint of my postdoc in the @carlanderson.bsky.social lab www.medrxiv.org/content/10.1...! 🚨 A super exciting study I co-led with the very talented @tobioinformatics.bsky.social . Stay tuned to see what we learned about genetic susceptibility to complex disease. 🧬🧵 1/
Cell-type-resolved genetic regulatory variation shapes inflammatory bowel disease risk
Most genetic variants associated with complex diseases lie in non-coding regions, complicating efforts to identify effector genes and relevant cell types. Here, we map cis-eQTLs across 2.2 million sin...
www.medrxiv.org
July 8, 2025 at 8:51 AM
Reposted by Karthik Guruvayurappan
🚨New preprint just dropped 🚨
medrxiv.org/content/10.1101/2025.06.24.25330216
The main output from my PhD is finally public and we’re SUPER excited about the findings! If you’re interested in what we learnt about IBD with a massive 700+ sample sc-eQTL dataset of the gut, read on!
July 8, 2025 at 8:52 AM
Reposted by Karthik Guruvayurappan
New paper:
More than 2700 human 3′UTRs are highly conserved. These 3′UTRs are essential components in mRNA templates, as their deletion decreases protein activity without changing protein abundance. Highly conserved 3′UTRs help the folding of proteins with long IDRs.

www.biorxiv.org/content/10.1...
mRNA 3′UTRs chaperone intrinsically disordered regions to control protein activity
More than 2,700 human mRNA 3′UTRs have hundreds of highly conserved (HC) nucleotides, but their biological roles are unclear. Here, we show that mRNAs with HC 3′UTRs mostly encode proteins with long intrinsically disordered regions (IDRs), including MYC, UTX, and JMJD3. These proteins are only fully active when translated from mRNA templates that include their 3′UTRs, raising the possibility of functional interactions between 3′UTRs and IDRs. Rather than affecting protein abundance or localization, we find that HC 3′UTRs control transcriptional or histone demethylase activity through co-translationally determined protein oligomerization states that are kinetically stable. 3′UTR-dependent changes in protein folding require mRNA-IDR interactions, suggesting that mRNAs act as IDR chaperones. These mRNAs are multivalent, a biophysical RNA feature that enables their translation in network-like condensates, which provide favorable folding environments for proteins with long IDRs. These data indicate that the coding sequence is insufficient for the biogenesis of biologically active conformations of IDR-containing proteins and that RNA can catalyze protein folding. ### Competing Interest Statement The authors have declared no competing interest. Pershing Square Foundation, https://ror.org/04tce9s05 G. Harold & Leila Y. Mathers Foundation National Institutes of Health, DP1GM123454, R35GM144046 Memorial Sloan Kettering Cancer Center, https://ror.org/02yrq0923, P30 CA008748
www.biorxiv.org
July 7, 2025 at 2:28 PM
Reposted by Karthik Guruvayurappan
New work from the lab trying to wrap our heads around the massive complexity of the human transcriptome revealed by long-read RNA-seq! Fun collab with Gloria Sheynkman. www.biorxiv.org/content/10.1...
Perplexity as a Metric for Isoform Diversity in the Human Transcriptome
Long-read sequencing (LRS) has revealed a far greater diversity of RNA isoforms than earlier technologies, increasing the critical need to determine which, and how many, isoforms per gene are biologic...
www.biorxiv.org
July 2, 2025 at 11:46 PM
Reposted by Karthik Guruvayurappan
🧬🚨PREPRINT🚨- The first placental isoform reference from long-read seq & supporting data! We demonstrate how tissue-specific assembly improves transcript quantification with short-read data from multi-ancestry birth studies & reveals placenta-mediated effects of gestational diabetes on birth weight🫄
Long-read transcriptome assembly reveals vast isoform diversity in the placenta associated with metabolic and endocrine function
The placenta plays a critical role in fetal development and mediates maternal metabolic effects on offspring health outcomes. Despite its importance, the placenta remains understudied in large-scale g...
doi.org
July 2, 2025 at 1:08 PM
Reposted by Karthik Guruvayurappan
This a really exciting leap forward for genomic sequence to activity gene regulation models. It is a genuine improvement over pretty much all SOTA models spanning a wide range of regulatory, transcriptional and post-transcriptional processes. 1/
Excited to launch our AlphaGenome API goo.gle/3ZPUeFX along with the preprint goo.gle/45AkUyc describing and evaluating our latest DNA sequence model powering the API. Looking forward to seeing how scientists use it! @googledeepmind
June 25, 2025 at 4:18 PM
Reposted by Karthik Guruvayurappan
How do non-coding variants in enhancers lead to disease? Happy to share our recent work, led by @ewholling.bsky.social, in which we discovered that poised chromatin sensitizes enhancers to aberrant activation by non-coding mutations, contributing to disease. www.biorxiv.org/content/10.1... 1/
June 23, 2025 at 12:57 PM
Reposted by Karthik Guruvayurappan
Textbooks: “Enhancers are just a bunch of TFBSs”

But how do they REALLY work?

New paper with many contributors here @berkeleylab.lbl.gov, @anshulkundaje.bsky.social, @anusri.bsky.social

A 🧵 (1/n)

Free access link: rdcu.be/erD22
June 18, 2025 at 5:56 PM
Reposted by Karthik Guruvayurappan
Discovery of NANOG enhancers and their essential roles in self-renewal and differentiation in human embryonic stem cells: Stem Cell Reports www.cell.com/stem-cell-re...
Discovery of NANOG enhancers and their essential roles in self-renewal and differentiation in human embryonic stem cells
Huangfu and colleagues identified two NANOG enhancers from a CRISPR interference (CRISPRi) screen in hESCs. Heterozygous deletions of either enhancer significantly lowered NANOG expression, reduced hE...
www.cell.com
June 2, 2025 at 11:22 PM