Samuel Sledzieski
@samsl.io
730 followers 220 following 53 posts
Research Fellow @flatironinstitute.org @simonsfoundation.org Formerly @csail.mit.edu @msftresearch.bsky.social @uconn.bsky.social Computational systems x structure biology | he/him | https://samsl.io | 👨🏼‍💻
Posts Media Videos Starter Packs
Pinned
samsl.io
New preprint 🚨--protein language models + MD training ➡️ allosteric networks!

@sonyahanson.bsky.social and I are developing RocketSHP 🚀 for rapid genome-scale inference of local+correlated fluctuations + structure token distributions!

📄: www.biorxiv.org/content/10.1...
💻: github.com/flatironinst...
www.biorxiv.org
Reposted by Samuel Sledzieski
lindorfflarsen.bsky.social
Integrative modelling of biomolecular dynamics

Time-dependent and -resolved experiments combined with computation provide a view on molecular dynamics beyond that available from static, ensemble-averaged experiments

Review w @dariagusew.bsky.social & Carl G Henning Hansen
doi.org/10.48550/arX...
Figure 1 from the review. Caption: Comparison of a schematic example showing static, time-dependent, and time-resolved experiments illustrated by a protein folding process. (a) A static experiment measuring the observable O$_{\text{exp}}$ is shown, which can be modelled as a distribution of simulated values, O$_{\text{calc}}$, representing a conformational ensemble of folded and unfolded states. (b) Shows a time-dependent experiment, where the equilibrium dynamics of reversible folding gives rise to measured transition times $\tau_1$ and $\tau_2$. These can be modelled as equilibrium dynamics, illustrated by a free energy (FE) surface along a chosen degree of freedom (D.O.F.) (c) A time-resolved experiment probes a non-equilibrium process, where the system begins at $t_{0}$ in the folded state. During the observation time $t$ the protein unfolds until $t_{\text{max}}$. At each time point, a distinct ensemble average, O$_{\text{exp}}$, can be observed, reflecting the proteins changing structure. This evolution can be modelled as distributions of O$_{\text{calc}}$ at each time point. These are shown together with a FE surface.
Reposted by Samuel Sledzieski
rodger.bsky.social
These are the quarterbacks since 2010 who complete passes at roughly the rate Microsoft Excel’s AI mode accurately edits spreadsheets
Jake Locker
Davis Webb
Tom Savage
Rex Grossman
Garrett Gilbert
Kellen Clemens
Mark Sanchez
Logan Woodside
Zach Wilson
David Blough
Derek Anderson
Johnny Manziel
samsl.io
I’ll be at MLCB today and tomorrow, and talking about our latest work on modeling protein dynamics 🚀 tomorrow afternoon!
pkoo562.bsky.social
2025 Machine Learning in Computational Biology (#MLCB) meeting starts TODAY (9/10) at 9:30a (EST) at the NY Genome Center in NYC!

We have a great lineup of keynotes, contributed talks, and posters today and tomorrow

Schedule: mlcb.org/schedule

Join for free via livestream: m.youtube.com/@mlcbconf
MLCB - Schedule
The in-person component will be held at the New York Genome Center, 101 6th Ave, New York, NY 10013. All times below are Eastern Time.
mlcb.org
Reposted by Samuel Sledzieski
pkoo562.bsky.social
2025 Machine Learning in Computational Biology (#MLCB) meeting starts TODAY (9/10) at 9:30a (EST) at the NY Genome Center in NYC!

We have a great lineup of keynotes, contributed talks, and posters today and tomorrow

Schedule: mlcb.org/schedule

Join for free via livestream: m.youtube.com/@mlcbconf
MLCB - Schedule
The in-person component will be held at the New York Genome Center, 101 6th Ave, New York, NY 10013. All times below are Eastern Time.
mlcb.org
Reposted by Samuel Sledzieski
bergerlab.bsky.social
Excited to share our recent work: Sparse autoencoders uncover biologically interpretable features in protein language model representations now in PNAS. Thread below 🧵
Reposted by Samuel Sledzieski
diffuseproject.bsky.social
Announcing the Diffuse Project! We're unlocking protein dynamics through diffuse X-ray scattering - the overlooked signal that could revolutionize how we understand protein motion. seemay.substack.com/p/from-syste...
From systems operators to systems architects
Going up a level from data generation to think about the data systems we design and embed
seemay.substack.com
Reposted by Samuel Sledzieski
iscb.bsky.social
The results of the ISCB Leadership vote are in! 

Terms for the ISCB President-Elect and ISCB Vice President will start on January 21, 2026.
Reposted by Samuel Sledzieski
drglam.bsky.social
I just got the notice that all the FlyBase people at Harvard, including me, will be laid off on October 12. I'm devastated.
Reposted by Samuel Sledzieski
Reposted by Samuel Sledzieski
workshopmlsb.bsky.social
Stay tuned for details on the 6th edition of MLSB, officially happening this December in downtown San Diego, CA!
ginaelnesr.bsky.social
The MLSB workshop will be in San Diego, CA (co-located with NeurIPS) this year for its 6th edition in December 🧬🔬

Stay tuned @workshopmlsb.bsky.social as we share details about the stellar lineup of speakers, the official call for papers, and other announcements!🌟
Reposted by Samuel Sledzieski
miroastore.bsky.social
That's a wrap! The results of the first #cryoEM heterogeneity challenge are up on biorxiv!
biorxiv.org/content/10.110
biorxiv.org
Reposted by Samuel Sledzieski
dr-alphalyrae.bsky.social
Everyone working in a STEM field should read this - ‘writing is thinking’
www.nature.com/articles/s44...
Reposted by Samuel Sledzieski
flatironinstitute.org
#FlatironCCB scientists Natalie Sauerwald, Olga Troyanskaya and colleagues leveraged @sparkforautism.bsky.social data to reveal four distinct groups that link autism-related traits with underlying genetics. Read more: www.simonsfoundation.org/2025/07/09/n... #science #biology #autismresearch
New Study Reveals Subclasses of Autism by Linking Traits to Genetics
New Study Reveals Subclasses of Autism by Linking Traits to Genetics on Simons Foundation
www.simonsfoundation.org
samsl.io
We hope this update will significantly expand the types of researchers + compute systems that can use D-SCRIPT! Manuscript with benchmarks + implementation details coming hopefully soon, and thanks again so much to Daniel for his hard work pushing this out!
samsl.io
Along with a bunch of other quality-of-life updates (including a mode specifically designed for host x pathogen/symbiont interactions! 🦠), v0.3.0 will power the sorts of comparative genomics and network analyses that we imagined when we first designed D-SCRIPT.
samsl.io
BMPI makes D-SCRIPT fully parallel in both embedding loading and inference, and can distribute inference to an arbitrary number of available GPUs. At the same time, the blocked inference procedure means that memory usage can be decreased arbitrarily low (with a small trade-off in speed).
samsl.io
In v0.3.0, we solve *both* of these issues, enabling efficient inference on both personal computers and multi-GPU HPC systems. The secret? Our new Blocked Multi-GPU Parallel Inference (BMPI) procedure, led by Daniel Schaffer (github.com/schafferde).
samsl.io
This ends up being extremely wasteful, since D-SCRIPT still made predictions in serial on a single GPU; inference was not possible on personal systems, but much of the compute capacity of these large machines was sitting unused. Whole-genome all-by-all prediction could still take upwards of a week!
samsl.io
Despite this, one of the most common user issues we heard about was that pre-loading embeddings achieved speed by trading off massive memory usage, restricting D-SCRIPT inference at whole-genome scale only to large HPC-like machines.
samsl.io
We designed D-SCRIPT to be extremely high-throughput, and the most common use case we've seen is whole-proteome all-by-all prediction (this use case is at the core of our recent 🎼philharmonic work!)