Daniel Seaton
@danieldseaton.bsky.social
380 followers 330 following 4 posts
Human genetics and genomics in drug discovery.
Posts Media Videos Starter Packs
Reposted by Daniel Seaton
delalamo.xyz
An interesting "what have we been doing all these years?" result from this paper is how sub-optimal the widely-used uniform sampling scheme can be (cluster all @50%, sample from all clusters equally). In contrast, strategies that account for the relative differences in cluster size improve val loss
(c) Diversity of PPA-1 data distributions as measured by the CDF of 50% ID cluster sizes.
(d) Validation losses of 1.4B parameter models trained on 80B tokens from different data distributions.
Reposted by Daniel Seaton
anshulkundaje.bsky.social
Single task, lightweight, short-context bp res. profile models often perform on par or outperform current large, multi task, long context models on counterfactual prediction. Much to do to improve.

Bonus: robust, efficient interpretation of syntax

Great collab with @jengreitz.bsky.social lab.
michaeltmont.bsky.social
Through this analysis, we found that chromatin accessibility models outperformed the corresponding expression models for predicting effects on gene expression, similar to previous analysis of data from plasmid reporter assays and natural genetic variation.
Reposted by Daniel Seaton
annasecuomo.bsky.social
📢 new preprint alert: So so excited to share our analysis on the impact of common and rare variants on single-cell gene expression in blood, using WGS and scRNA-seq data from nearly 2,000 individuals and 5.4m cells as part of TenK10K phase 1 🧬 www.medrxiv.org/content/10.1...
🧵👇 (1/n)
danieldseaton.bsky.social
Are you a postgraduate student interested in protein modelling and drug discovery?

We have an exciting opportunity to join our team at GSK for a 6-9 months internship, working on an ambitious cross-department research project. Apply before March 14th!

www.linkedin.com/jobs/view/41...
GSK hiring Computational Biologist in Stevenage, England, United Kingdom | LinkedIn
Posted 11:13:48 PM. Site Name: UK - Hertfordshire - Stevenage, Heidelberg - OfficePosted Date: Feb 28 2025We create a…See this and similar jobs on LinkedIn.
www.linkedin.com
Reposted by Daniel Seaton
danieldseaton.bsky.social
I think the regulatory model (looping etc) first is good with examples that are less extreme and unusual than fto or limb/shh. I think hmgcr and cholesterol is a proximal regulatory effect.
Reposted by Daniel Seaton
anshulkundaje.bsky.social
Our ChromBPNet preprint out!

www.biorxiv.org/content/10.1...

Huge congrats to Anusri! This was quite a slog (for both of us) but we r very proud of this one! It is a long read but worth it IMHO. Methods r in the supp. materials. Bluetorial coming soon below 1/
Reposted by Daniel Seaton
jeffspence.github.io
What do GWAS and rare variant burden tests discover, and why?

Do these studies find the most IMPORTANT genes? If not, how DO they rank genes?

Here we present a surprising result: these studies actually test for SPECIFICITY! A 🧵on what this means... (🧪🧬)

www.biorxiv.org/content/10.1...
Specificity, length, and luck: How genes are prioritized by rare and common variant association studies
Standard genome-wide association studies (GWAS) and rare variant burden tests are essential tools for identifying trait-relevant genes. Although these methods are conceptually similar, we show by anal...
www.biorxiv.org
Reposted by Daniel Seaton
steglelab.bsky.social
We are the Stegle Lab: A bioinformatics group advancing computational methods to study molecular variations and their impact on phenotypes. We are jointly hosted at the German Cancer Research Center (@dkfz.bsky.social) and the European Molecular Biology Laboratory (@embl.org) in Heidelberg, Germany.
Reposted by Daniel Seaton
why.bsky.team
Why @why.bsky.team · Nov 20
To be clear, we do have plans for scaling, we just kinda expected more than a couple days notice before getting blasted with a million new users a day.
The team is rapidly deploying fixes and new software to adapt. More servers in the mail.
Reposted by Daniel Seaton
hilarycmartin.bsky.social
My group's work dissecting the contribution of common variants to rare neurodevelopmental conditions is now out at nature.com/articles/s41..., led by co-first authors Qinqin Huang (not yet on blue sky) and @emiliewigdor.bsky.social . See below for Emilie's tweetorial.
Reposted by Daniel Seaton
ebi.embl.org
Not enough bioinformatics in your Bluesky feed? We’ve got you covered. Follow us for our latest news, exciting life science research, updates from our data resources, new tools and training resources.

Haven't heard of EMBL-EBI? Take a look at what we’re working on. www.ebi.ac.uk/about/our-im...
Our impact
We provide open data that helps scientists understand life and that informs solutions to real-world problems, such as infectious diseases, climate change and food security.
www.ebi.ac.uk
Reposted by Daniel Seaton
jbuenrostro.bsky.social
Hey, a question for the genetics community. Does genetic fine-mapping work well? How often does it miss?

We usually find that most fine-mapped variants do not fall within coding or regulatory regions. Is it a limitation of epigenomics or a limitation of fine-mapping? Please share your thoughts!
Reposted by Daniel Seaton
axelvisel.bsky.social
REX - a mammalian "range extender" element that can turn short-distance enhancers into long-distance enhancers.

New preprint from a collaboration led by Grace Bower and Evgeny Kvon.

doi.org/10.1101/2024...
Schematic overview of the proposed mode of action of the newly discovered REX element. Top: An enhancer can activate a gene at a short distance, but not at at long range. Middle: Presence of (C/T)AATTA motifs within an enhancer enable it to act over long distances. Bottom: Coupling a short-range enhancer to the REX element containing the same motifs turns it into a long-range enhancer.
danieldseaton.bsky.social
Yes, and also how it informs selection of training datasets. Selecting English language text for English language comprehension, augmenting image datasets by including rotated versions of the same images for image recognition.
danieldseaton.bsky.social
I think it already has some physics (eg covalent bond lengths), just less than you might expect (no force field concept). Similar, llms have some knowledge of language (eg the concept of words), just less than you might expect (no verb/noun concept).
Reposted by Daniel Seaton
annasecuomo.bsky.social
Sad to be missing #ASHG23, but check out the talk by the brilliant Wei Zhou talk on Saturday on our new scalable & efficient method for single-cell eQTL mapping!
Reposted by Daniel Seaton
jeffbarrett.eu
This is tremendous news, and the quote that "UK Biobank is the world’s most significant resource for health research" is not an exaggeration. Very happy to see it continue to be sustained, especially as we're doing a workshop tomorrow at #ASHG23 on how to use these data!
www.gov.uk/government/n...
Reposted by Daniel Seaton
kauralasoo.bsky.social
Unfortunately I have to miss #ASHG23 this year, but if you are interested in our group's work, do check out these two posters from Ralf Tambets and Krista Freimann:
Reposted by Daniel Seaton
sashagusevposts.bsky.social
Presentations from our group at ASHG next week: