Kaitlin Samocha
@ksamocha.bsky.social
1.5K followers 230 following 58 posts
Assistant Investigator @ MGH / Broad / HMS. Focus on human genomics and modeling rare variation. She/her
Posts Media Videos Starter Packs
Reposted by Kaitlin Samocha
pragskore.bsky.social
📃 We’re excited to share our latest work, now published in Nature Communications — a major update to the Genome Aggregation Database (gnomAD) that improves allele frequency resolution for two gnomAD-defined genetic ancestry groups using local ancestry inference (LAI).
Improved allele frequencies in gnomAD through local ancestry inference - Nature Communications
This study incorporates local ancestry into the Genome Aggregation Database (gnomAD) to improve allele frequency estimates for admixed populations, enhancing variant interpretation and enabling more accurate and equitable genomic research and clinical care.
www.nature.com
Reposted by Kaitlin Samocha
nickywhiffin.bsky.social
📣 We are recruiting! Please share!!

Are you a bioinformatician / computational scientist who wants to apply your skills to understanding regulatory biology and improving rare disease diagnosis and treatment? 🧠 💻 🧬 🩺

We have two roles available 👇

🧵 1/4
Image of an old building in Oxford with the heading 'postdoc opportunities' and the text 'computational approaches to improve rare disease diagnosis and treatment' and 'Big Data Institute, University of Oxford'
Reposted by Kaitlin Samocha
nickywhiffin.bsky.social
Isn't genetics cool???

Within only 145 nucleotides(!) of a non-coding RNA (RNU4-2) - different variants in distinct regions / structures cause three distinct disorders!!! (all discovered within the last 18 months)

🤯🤓🧬❤️
Schematic of the U4 and U6 snRNAs with coloured annotations to note nucleotides linked to different disorders:
- Teal in the T-loop and Stem III for ReNU syndrome (Chen et al. Nature 2024 and Greene et al. Nature Medicine 2024)
- Red for variants causing a recessive NDD in Stem II, the k-turn and Sm protein binding sites (De Jonghe et al. medRxiv 2025 and Rius & Blakes medRxiv 2025)
- Yellow for the central loop and Retinitis pigmentosa (Quinodoz et al. medRxiv 2025)
Reposted by Kaitlin Samocha
jamesfasham.bsky.social
🗣️ Quote of #ESHG2025 (so far)

"Who licks bone !?!" 🦴
- Johannes Krause

Anyone have that on your bingo card?

Well apparently archeologists do, to distinguish bone from stones and it causes problems in DNA sequencing. 🤔
ksamocha.bsky.social
We are just wrapping up day 1 at #ESGH2025 in beautiful Milan. For those who want some extra fun while listening to the great science, you can play bingo.👇

I know multiple of these have already occurred.
ahoischen.bsky.social
Getting ready for #eshg2025 …and our postdoc @lydiasagath.bsky.social made a nice BINGO card again.
Paying a lot of attention to the entire event will pay off!
Reposted by Kaitlin Samocha
ahoischen.bsky.social
Buongiorno Milano! Ready for a great day 1 of #eshg2025?
Packed program of excellent science 8.30am-8.00pm - plus networking event till 9.30pm to meet many friends, colleagues and collaborators! …andiamo @eshg.bsky.social @eshgyoung.bsky.social
Reposted by Kaitlin Samocha
deciphergenomics.bsky.social
Human Developmental Cell Atlas (HDCA) expression data is now displayed. Expression is displayed in 12 sections of a 6-7 post-conception week human embryo, alongside a sagittal view which displays the region of the embryo represented by each section @mhaniffa.bsky.social
Reposted by Kaitlin Samocha
ksamocha.bsky.social
Yes, I've done this once before for a paper (listing in the system only first and last authors). It is easy to get those handful of people to approve additional authors in the future if the journal requires it.

Not sure if recommended, but it saved time.
Reposted by Kaitlin Samocha
eventswcs.bsky.social
Join leading experts working in #RareDisease research at our #GRD25 conference.

📅 Dates: 9-11 April 2025
💭 Share insights in person

Explore the latest #genomics advances accelerating improvements in clinical care for rare disorders, globally.

⏰Secure your place by 11 March: bit.ly/3BpAe44
Genomics of Rare Disease (hybrid conference)
Dates: 9-11 April 2025
Location: Hinxton Hall Conference Centre, Wellcome Genome Campus, UK and online 

In-person registration deadline: 11 March 2025
Virtual registration deadline: 1 April 2025
ksamocha.bsky.social
Projects like this can’t be completed without many others: thanks to Mark Daly for continued mentorship across the years; critical support and work from @anneotation.bsky.social, @konradjk.bsky.social, Predrag Radivojac; the Hail team; and everyone associated with @gnomad-project.bsky.social.

10/10
ksamocha.bsky.social
If this work feels familiar, it is because it is building off older work from our team originally released for ExAC.

I view this iteration as more of a franchise reboot instead of a sequel – we have new leads, but similar themes.

9/10
ksamocha.bsky.social
As with all gnomAD-led projects, we’ve already shared the data and code. Regions are displayed for v2 on the gnomAD browser, the code can be seen on Github (github.com/broadinstitu...), and MPC scores are available for download.

8/10
ksamocha.bsky.social
There is still much more to learn: using 125k exomes, our median region size is ~450bp and we see a relationship between transcript length and the number of regions we can identify due to statistical power.

7/10
ksamocha.bsky.social
Finally, missense constraint information was incorporated into a deleteriousness metric named MPC (Missense deleteriousness Prediction by Constraint), which separates case from control de novo missense variants well with similar performance to ML models like AlphaMissense.

6/10
ksamocha.bsky.social
In collaboration with Predrag Radivojac and team, we demonstrated that coding bases with < 20% of their expected missense variation achieve moderate support for pathogenicity (PM1) following ACMG/AMP guidelines that can be used for clinical classification.

5/10
ksamocha.bsky.social
Missense depleted regions show an enrichment of (1) de novo missense variants in neurodevelopmental disorder cases compared to controls, (2) partitioned common variant heritability for >260 independent traits from the UK Biobank, and (3) ClinVar pathogenic (P/LP) variants.

4/10
ksamocha.bsky.social
Why try to find subgenic regions that are specifically missense constrained? Splitting up genes reveals patterns of negative and neutral selection that are obscured when looking gene-wide, including highlighting regions that have a large number of known pathogenic variants.

3/10
ksamocha.bsky.social
Co-led by the fabulous Katherine Chao and Lily Wang, we used gnomAD v2 and a recursive search to identify ~28% of canonical transcripts that were split into multiple missense constraint regions (measured by variable missense depletion in gnomAD).

2/10
ksamocha.bsky.social
Recently out on #bioRxiv: our updated approach to identify regional variability in missense mutation intolerance (“constraint”) in protein-coding genes using the gnomAD database.

www.biorxiv.org/content/10.1...

1/10
ksamocha.bsky.social
Some updated guidance on our gnomAD v4 constraint scores: gnomad.broadinstitute.org/news/2024-03...

The @gnomad-project.bsky.social team is hard at work on v4.1 and improvements across the board, so expect more updates.

Thanks to Katherine Chao for spearheading this blogpost.
ksamocha.bsky.social
A freely accessible version of the paper can be found here: rdcu.be/dsVXx
ksamocha.bsky.social
Our data are freely available on the gnomAD browser (gnomad.broadinstitute.org).

A huge thank you to Mike Guo, Laurent Francioli, Sarah Stenton, and Julia Goodrich for their hard work on this. This, of course, wouldn't be possible without everyone involved with @gnomad-project.bsky.social.