Kieran Didi
@kdidi.bsky.social
1.1K followers 200 following 14 posts
🧪 Research Scientist @nvidia and PhD student @Oxford staring at proteins all-day 🧑‍💻 Website/Blog: https://kdidi.netlify.app/ 🤖 GitHub: https://github.com/kierandidi 📚 Prev. Cambridge/Heidelberg
Posts Media Videos Starter Packs
Reposted by Kieran Didi
martinsteinegger.bsky.social
MMseqs2-GPU sets new standards in single query search speed, allows near instant search of big databases, scales to multiple GPUs and is fast beyond VRAM. It enables ColabFold MSA generation in seconds and sub-second Foldseek search against AFDB50. 1/n
📄 www.nature.com/articles/s41...
💿 mmseqs.com
GPU-accelerated homology search with MMseqs2 - Nature Methods
Graphics processing unit-accelerated MMseqs2 offers tremendous speedups for homology retrieval from metagenomic databases, query-centered multiple sequence alignment generation for structure predictio...
www.nature.com
kdidi.bsky.social
For more details read the thread by the man himself: bsky.app/profile/ncor...
ncorley.bsky.social
(1/7)
Training biomolecular foundation models shouldn't be so hard. And open-source structure prediction is important. So today we're releasing two software packages: AtomWorks and RosettaFold3 (RF3)

[https://www.biorxiv.org/content/10.1101/2025.08.14.670328v2](www.biorxiv.org/content/10.1...)
Accelerating Biomolecular Modeling with AtomWorks and RF3
Deep learning methods trained on protein structure databases have revolutionized biomolecular structure prediction, but developing and training new models remains a considerable challenge. To facilita...
www.biorxiv.org
kdidi.bsky.social
An incredible project to witness, led by the most incredible dreamteam @ncorley.bsky.social , @simonmathis.bsky.social and Rohith Krishna with an amazing team inside and outside the Baker lab. Check it out and let us know what you think/contribute to the codebase! 6/6
kdidi.bsky.social
The preprint shows how atomworks leads to better reference conformers (and better predictions!), enables advanced features in RF3 like chirality-aware training or ligand templating and narrows the performance gap to closed-source models. 5/6
kdidi.bsky.social
`atomworks.ml` on the other hand offers advanced dataset featurization and sampling for deep learning workflows, all operating on the canonical AtomArray object from @biotite_python so that all transforms are traceable and generalizable between models. 4/6
kdidi.bsky.social
AtomWorks has two main components: atomworks.io takes a file (cif, sdf, ...) and does parsing, cleaning and more. You can also look at your structures in a notebook or via PyMol thanks to pymol-remote, so you can directly inspect if your code does what you want! 3/6
kdidi.bsky.social
In the past, every BioML model had its own data pipeline, creating loads of overhead. With AtomWorks, >80% of code is shared between models like ProteinMPNN, RF3 or design models. 2/6
kdidi.bsky.social
AtomWorks is out! Building upon @biotite_python, we built a toolkit for all things biomolecules and trained RF3 with it. All open-source, test it via `pip install atomworks`!

AtomWorks: github.com/RosettaCommo...
RF3: github.com/RosettaCommo...
Paper: tinyurl.com/y2w4z65b

1/6
Reposted by Kieran Didi
chaitjo.bsky.social
RosettaFold 3 is here! 🧬🚀

AtomWorks (the foundational data pipeline powering it) is perhaps the really most exciting part of this release!

Congratulations @simonmathis.bsky.social and team!!! ❤️

bioRxiv preprint: www.biorxiv.org/content/10.1...
Reposted by Kieran Didi
ncorley.bsky.social
(1/7)
Training biomolecular foundation models shouldn't be so hard. And open-source structure prediction is important. So today we're releasing two software packages: AtomWorks and RosettaFold3 (RF3)

[https://www.biorxiv.org/content/10.1101/2025.08.14.670328v2](www.biorxiv.org/content/10.1...)
Accelerating Biomolecular Modeling with AtomWorks and RF3
Deep learning methods trained on protein structure databases have revolutionized biomolecular structure prediction, but developing and training new models remains a considerable challenge. To facilita...
www.biorxiv.org
kdidi.bsky.social
Very excited about our latest all-atom generative model proteina, check out the project page (research.nvidia.com/labs/genair/...) and stay tuned for the code release soon!
kdidi.bsky.social
Excited to present my first paper officially as a PhD student now as an ICLR Oral this week! Super fun work with the GenAIR team at NVIDIA.

Talk: Fr 10:54 - 11:06 (Oral Session 3B, Garnet 213-215)
Poster: Fr 15:00-17:30 (Hall 3 + Hall 2B #5)

Come by the poster/reach out to chat
kdidi.bsky.social
Such a fun project to work on with a stellar team! Stay tuned for other things to come here, and see you all in Singapore!
karstenkreis.bsky.social
📢📢 "Proteina: Scaling Flow-based Protein Structure Generative Models"

#ICLR2025 (Oral Presentation)

🔥 Project page: research.nvidia.com/labs/genair/...
📜 Paper: arxiv.org/abs/2503.00710
🛠️ Code and weights: github.com/NVIDIA-Digit...

🧵Details in thread...

(1/n)
Reposted by Kieran Didi
anshulkundaje.bsky.social
Yet another story of issues with benchmarks and evaluations in ML4bio + a much stronger and fair benchmark #bioMLeval
briantrippe.bsky.social
🔥 Benchmark Alert! MotifBench sets a new standard for evaluating protein design methods in motif scaffolding.
Why does this matter? Reproducibility & fair comparison have been lacking—until now.
Paper: arxiv.org/abs/2502.12479 | Repo: github.com/blt2114/Moti...
A thread ⬇️
kdidi.bsky.social
Have a look at our shiny new benchmark for motif-scaffolding in computational protein design! New (and harder) tasks, including a reproducible evaluation pipeline
briantrippe.bsky.social
🔥 Benchmark Alert! MotifBench sets a new standard for evaluating protein design methods in motif scaffolding.
Why does this matter? Reproducibility & fair comparison have been lacking—until now.
Paper: arxiv.org/abs/2502.12479 | Repo: github.com/blt2114/Moti...
A thread ⬇️
Reposted by Kieran Didi
lindorfflarsen.bsky.social
This!

Also well put in this editorial in PLOS Comp Biol:
Putting benchmarks in their rightful place: The heart of computational biology
doi.org/10.1371/jour...
Screenshot from paper that says:
"Developing good and comprehensive benchmarks, in which the performance metrics of each tool reflect its real-world utility, requires a significant effort. For highly competitive and established fields, such as protein structure predictions, community experiments evaluating the methods have been held periodically to provide blinded assessments of prediction performance. These blinded assessments are perhaps the gold standard on how benchmarks should be run. However, in most areas of computational biology, no such regular blinded contests are available. Instead, many tool developers end up generating their own benchmarks, which they publish alongside a newly developed tool to show its improved performance. The downside of this approach is that, if a new approach is developed in parallel to assembly of the benchmark on which it is evaluated, there is a strong selection bias encouraging the authors to report tool development approaches performing well against the benchmark compared to previous tools. This reporting bias makes most benchmarks that accompany newly developed tools questionable. Even if the authors are aware of this problem and take conscious steps to separate benchmark, evaluation method, and method development, subconscious bias may persist and affect the final outcome."
Reposted by Kieran Didi
rne.bsky.social
Our paper on computational design of chemically induced protein interactions is out in @natureportfolio.bsky.social. Big thanks to all co-authors, especially Anthony Marchand, Stephen Buckley and Bruno Correia!

t.co/vtYlhi8aQm
kdidi.bsky.social
So excited for you and what is to come! Onwards and Upwards;)
Reposted by Kieran Didi
rory.bio
Thanks Jascha 🫶

We’re working hard to create sustainable funding mechanisms for open source scientific tooling - understanding the challenge landscape is a key first step!
achterbrain.bsky.social
Rory @rory.bio is working on a large project to accelerate science 🧪 through high-quality & open-source software.

To steer the project it would be amazing to hear from scientists across fields about problems in the scientific process *you* want to see solved! Tell Rory here flywhl-ideas.notion.site
Notion – The all-in-one workspace for your notes, tasks, wikis, and databases.
A new tool that blends your everyday work apps into one. It's the all-in-one workspace for you and your team
flywhl-ideas.notion.site
kdidi.bsky.social
Love PyMOL Remote, one of these tools that does one thing and does it well!
simonmathis.bsky.social
For some more guidance on how to use this, Martin Buttenschön wrote a nice blogpost: www.blopig.com/blog/2024/11...
Reposted by Kieran Didi
kdidi.bsky.social
MSAs go brrr with MMseqs2-GPU! Super fun project, happy to work with and learn from a stellar team of engineers and scientists. Try it out and stay tuned!

📄 Preprint: www.biorxiv.org/content/10.1...
💾 Code: mmseqs.com
🗞️ Blog: developer.nvidia.com/blog/boost-a...