Veera Rajagopal
@doctorveera.bsky.social
1.8K followers 410 following 230 posts
MBBS, MD, PhD | GWAS storyteller | Scientist at Regeneron | Human genetics & drug discovery in Neuroscience & Psychiatry
Posts Media Videos Starter Packs
doctorveera.bsky.social
There seems to be another companion manuscript with this one discussing the long read dataset in a much broader context. The current one is focussed on TRs. This data release marks exciting progress in human genetics in 2025. 9/
doctorveera.bsky.social
Interestingly, these extremely polymorphic repeat loci identified in humans are monomorphic in apes based on available data, suggesting that human-specific expansion. 8/
doctorveera.bsky.social
This observation goes against the concept of mutation constraint: disease loci are usually depleted of variations. It seems for repeat loci it's more complex. 7/
doctorveera.bsky.social
The second most polymorphic repeat at this same locus--CAG repeat in THAP11--is also disease-causing. 6/
doctorveera.bsky.social
For e.g., last year GGC exonic repeat in ZFHX3 was discovered (after nearly 25 years) to be the causal variant in SCA4 locus. The authors find that around the SCA4 locus, GGC repeat is the 5th most polymorphic repeat. 5/
doctorveera.bsky.social
The authors suggest we can actually use this logic to identify causal repeat variants at unresolved disease loci. Just search for the most variable repeats within the linkage region and most likely one of them is the culprit. 4/
doctorveera.bsky.social
Many interesting results in the preprint. One surprising revelation is the authors find that known pathogenic repeat loci are the most variable in the genome (of course, within the benign range). 3/
doctorveera.bsky.social
I remember hearing about this dataset first time at ASHG 2022. Great to see the data out finally. Using 1027 long read genomes, the authors have identified >3.6 billion TR alleles (!) at 1.7 million loci. 2/
doctorveera.bsky.social
Fascinating discovery and may have implications in psychiatric and sleep related disorders. I wonder how many more PTMs are there waiting to be discovered.
doctorveera.bsky.social
Addition and removal of histamine to histones in the neurons in brain circadian rhythm center seems to control the sleep wake cycle.
doctorveera.bsky.social
This adds to the list of histone monoaminylations recently discovered (serotonylation in 2019, dopaminylation in 2020 by the same group of authors).
doctorveera.bsky.social
Another great discovery before we close the year, highlighting the underappreciated role of retroelements in human disease and traits. 5/
doctorveera.bsky.social
This also would mean the target population for ASPA gene therapy (currently in trials by Myrtelle Inc.) might be much larger than expected. 4/
doctorveera.bsky.social
Unlike the recurrent mutations previously found, which were mostly restricted to Ashkenazi Jewish ancestry, the SVA insertion seems to be occurring across all ancestries, making it the most common cause worldwide. 3/
doctorveera.bsky.social
Due to the limitations of short read sequencing, this mutation has remained hidden for more than 25 years since ASPA mutations were first characterized in individuals Canavan disease. 2/
Reposted by Veera Rajagopal
emilymoin.com
Can't comment on the genetics but agree this is a huge "so what?" for clinical practice – we already use patient-specific trends for diagnosis and management and have for decades. The specific examples given in the article (like HCT and CKD) are... amusingly obvious.
Reposted by Veera Rajagopal
jamespirruccello.com
Arguably the reason that GWAS for blood traits have ever worked is that we are proxying the individual set point. So yes the clinical discussion is neat. The genetic implications seem nil.
doctorveera.bsky.social
Likewise, the GWAS of blood count setpoints is almost the same as past GWASs of blood counts, just the phenotypes are a little more precise leading to slightly increased genetic signals. 5/
doctorveera.bsky.social
The extra thing here is the blood counts are measured in terms of "setpoints" (a fancy term for the mean of multiple measurements per individual, excluding outliers) instead of the simple arithmetic mean of all measurements per individuals (which is what we typically do in the UK Biobank). 4/