Grant Rotskoff
@grant.rotskoff.cc
980 followers 110 following 19 posts
Statistical mechanic working on generative models for biophysics and beyond. Assistant professor at Stanford. https://statmech.stanford.edu
Posts Media Videos Starter Packs
Reposted by Grant Rotskoff
Reposted by Grant Rotskoff
spmontecarlo.bsky.social
Big fan of this perspective:
Reposted by Grant Rotskoff
andrew.diffuse.one
The plan at FutureHouse has been to build scientific agents for discoveries. We’ve spent the last year researching the best way to make agents. We’ve made a ton of progress and now we’ve engineered them to be used at scale, by anyone. Free and on API.
grant.rotskoff.cc
What an incredibly cool paper! While knot theory strictly applies to closed curves, Tommy, @smnlssn.bsky.social , and @paulrobustelli.bsky.social show that writhe, a knot "non-invariant" that changes with smooth deformations, provides a meaningful descriptor for flexible conformations.
paulrobustelli.bsky.social
Presenting one of my favorite manuscripts I've ever worked on:

"Characterizing structural and kinetic ensembles of intrinsically disordered proteins using writhe"

www.biorxiv.org/content/10.1...

by Tommy Sisk, with a generative modeling component done in collaboration with @smnlssn.bsky.social
Reposted by Grant Rotskoff
lindorfflarsen.bsky.social
Our review on machine learning methods to study sequence–ensemble–function relationships in disordered proteins is now out in COSB

authors.elsevier.com/sd/article/S...
Led by @sobuelow.bsky.social and Giulio Tesei
Figure from the paper illustrating sequence–ensemble–function relationships for disordered proteins. ML prediction (black) and design (orange) approaches are highlighted on the connecting arrows. Prediction of properties/functions from sequence (or vice versa, design) can include biophysics approaches via structural ensembles, or bioinformatics approaches via other hetero- geneous sources. The lower panels show examples of properties and functions of IDRs for predictions or design targets. ML, machine learning; IDRs, intrinsically disordered proteins and regions.
grant.rotskoff.cc
Amazingly, this trick works. Due to improved algorithms for learning the score coming from the generative modeling, the applicability of this approach is very broad. I had spent several years making false starts on implementing the Malliavin calculus, but in the end, we found a route around it ;)
grant.rotskoff.cc
Jérémie Klinger found a simple trick back to Girsanov: you take the perturbation in the diffusion at the level of the Fokker-Planck equation and rewrite it to be included in the drift. The resulting drift then has a term proportional to \nabla \log \rho(x,t), what machine learners call the score!
grant.rotskoff.cc
The “classical” strategy for diffusion sensitivities comes from financial mathematics and is called the Malliavin calculus, it’s very explicit for simple models like Black Scholes but for a general Langevin equation, it is no easy feat to compute the sensitivity.
grant.rotskoff.cc
In many biological and active systems, diffusivity is highly spatially dependent, and the theory for perturbations in such cases is rather limited, largely based on beautiful work by Leticia Cugliandolo and work by Falasco and Baeisi, among many others.
grant.rotskoff.cc
Far from equilibrium, it is not so easy: one needs to understand the dynamics, and this requires working with dynamical trajectories and their associated path measures. Classically, we do this using the Girsanov theorem, which constructs a “relative path measure” as we perturb the drift term.
grant.rotskoff.cc
Computing response functions or “sensitives” requires understanding how an external perturbation drives the change in some observable. For equilibrium systems, Onsager taught us that this can be understood with correlation functions.
grant.rotskoff.cc
Excited to see our paper “Computing Nonequilibrium Responses with Score-Shifted Stochastic Differential Equations” in Physical Review Letters this morning as an Editor’s Suggestion! We uses ideas from generative modeling to unravel a rather technical problem. 🧵 journals.aps.org/prl/abstract...
Reposted by Grant Rotskoff
sgrodriques.bsky.social
Applications for the FutureHouse Independent Postdoctoral Fellowship are due in two weeks! $125k annual stipend, full access to our resources, be coadvised by world class professors and apply our AI science agents to make new discoveries. Apply!

Details here: www.futurehouse.org/fellowship
Reposted by Grant Rotskoff
lindorfflarsen.bsky.social
Ten simple rules for developing good reading habits during graduate school and beyond

To me, the most important are:
Read often, read broadly (incl. older papers and outside your field), and learn to read some papers in detail and others more superficially (and quickly)
Ten simple rules for developing good reading habits during graduate school and beyond by Marcos Méndez
1: Develop the habit of reading on a daily basis
2: Read thoroughly to build a sound background understanding of your topic
3: Do not ignore the pillars of your discipline; read the classics
4: If you have to get familiar with a new topic, consider reading in chronological order
5: Avoid narrow-mindedness by reading beyond your discipline
6: Create a list of relevant journals
7: Not all interesting stuff will appear in articles; read books
8: Use a reference manager to keep track of your literature
9: Keep a long-term review for your own use as a way to remember what you read
10: Build your own library to make yourself independent and inspire others
grant.rotskoff.cc
I am hiring a postdoctoral scholar with a start date summer or fall 2025. Projects will be focused on thermodynamically consistent generative models, broadly defined. If you’re interested, please send a CV and one paragraph about why you think you’d be a good fit to [email protected]
grant.rotskoff.cc
Lot of cool stuff in here. Consistent with my working hypothesis that the main scientific utility of LLMs at the moment is plain old NLP
handle.invalid
🎅🏼 A small early Christmas present from our team.

To celebrate the publication of our data extraction tutorial in Chem Soc Rev, we made it easy to run it — without any installation — on a JupyterHub of the Base4NFDI.

🎥 Video intro to the JupyterHub deployment: youtu.be/l-5QNUo1fcU
From text to insight: large language models for chemical data extraction
The vast majority of chemical knowledge exists in unstructured natural language, yet structured data is crucial for innovative and systematic materials design. Traditionally, the field has relied on m...
pubs.rsc.org
grant.rotskoff.cc
Really cool opportunity via futurehouse. Come work with them and collaborate with us at Stanford!
sgrodriques.bsky.social
FutureHouse is launching an independent postdoctoral fellowship program for exceptional researchers who want to apply our automated science tools to specific problems in biology and biochemistry, in collaboration with world-leading academic labs. 1/
grant.rotskoff.cc
If you didn't see our poster at NeurIPS on how to make diffusion model inference fast, you can always read the paper here: arxiv.org/abs/2405.15986
grant.rotskoff.cc
Can verify that the code works, too :)
grant.rotskoff.cc
@franknoe.bsky.social presented this very impressive work at a fantastic @cecamevents.bsky.social workshop this week. I’m very excited to take a deep dive into the details this weekend!
franknoe.bsky.social
Super excited to preprint our work on developing a Biomolecular Emulator (BioEmu): Scalable emulation of protein equilibrium ensembles with generative deep learning from @msftresearch.bsky.social ch AI for Science.

www.biorxiv.org/content/10.1...
grant.rotskoff.cc
If you're at NeurIPS next week come see our spotlight poster led by Yinuo Ren and Haoxuan Chen! We use the parallel sampling technique to rigorously establish a big acceleration for diffusion model inference! neurips.cc/virtual/2024...
NeurIPS Poster Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time ComplexityNeurIPS 2024
neurips.cc
grant.rotskoff.cc
There's no easy way to do this in general, but computing the stationary distribution for a nonequilibrium dynamics might be a possible in some low dimensional system or systems with special structure ( ones where you can represent the distribution with tensor networks). Simulations otherwise...