Diego del Alamo
@delalamo.xyz
3.2K followers 2K following 1.8K posts
Computational protein engineering & synthetic biochemistry Opinions my own https://linktr.ee/ddelalamo
Posts Media Videos Starter Packs
delalamo.xyz
The entire J-gene of PDB 7czv (3.3 Å, shown in salmon) seems to have fallen victim to a register/misassignment error, with the C-terminal valines (right) and tryptophan (center-left) on the wrong side of the beta sheet compared to where they would be expected to be
delalamo.xyz
The worst offender I've found so far is PDB 5OMM chain C on Sabdab, which has many unassigned residues despite being 1.7 Å resolution. For example its C-terminal VTVSS starts at IMGT pos 115 instead of pos 124. Probably AbNum/ANARCI struggling w/ the gaps

opig.stats.ox.ac.uk/webapps/sabd...
delalamo.xyz
I've put together a tool to renumber antibody structures using only main chain atomic coords. No sequence info is used at all. In the process I found a few numbering mistakes in Sabdab/PDB, which is numbered by sequence & thus sensitive to gaps/register errors (deleted prev post due to bugs)
GitHub - delalamo/SAbR: Structure-based antibody renumbering
Structure-based antibody renumbering. Contribute to delalamo/SAbR development by creating an account on GitHub.
github.com
delalamo.xyz
I flirted with doing something like this a year or two ago - prepending a mAb-specific structure prediction NN to the AF2 template input for mAb design by hallucination - but was stopped by the fact that every Bio/ML NN is in PyTorch except AF2 which is written in freaking JAX
aixbiobot.bsky.social
mBER: Controllable de novo antibody design with million-scale experimental screening [new]
Ab design via seq/struct conditioning designs VHHs. Million-scale screens show success.
mBER: Controllable de novo antibody design with million-scale experimental screening Figure 1 Figure 2 Figure 3
Reposted by Diego del Alamo
nboyd.bsky.social
It's a very cool result but IMO there are caveats. Inference is (mostly) slower. There is existing work on faster models (e.g. MiniFold or protenix mini), and also existing work on ensemble prediction. I doubt this works without training on AFDB, which bakes in inductive bias from triangle layers
delalamo.xyz
Oh interesting, what’s the TLDR for where these mutations are located? Will read later
delalamo.xyz
Still boggles my mind that we can leave frameworks as-is and only edit CDRs and still reliably get nM binders. Complete opposite of what I would have expected given how hard it is to graft CDRs from one FW to another (such as humanizing mouse mAbs). First RFantibody and now this
arcinstitute.org
In another preprint from the @brianhie.bsky.social Lab and @synbiogaolab.bsky.social, they introduce Germinal, a generative AI system for de novo antibody design.

Germinal produces functional nanobodies in just dozens of tests, making custom antibody design more accessible than ever before.
Reposted by Diego del Alamo
franknoe.bsky.social
Postdoc position available in the BioEmu team at @msftresearch.bsky.social AI for Science - Berlin DE or Cambridge UK. Looking for candidates with backgrounds in #MachineLearning #AI Biophysics or Bioinformatics

jobs.careers.microsoft.com/global/en/jo...
delalamo.xyz
Apple now has a protein folding NN?...

arxiv.org/pdf/2509.18480
delalamo.xyz
How to request your swiss criminal record:
1. Fill out a form online
2. Pay a processing fee, also online
3. Print form and put it in an envelope and MAIL it
4. Receive criminal record by email
delalamo.xyz
It’s a bit goofy. I think RosettaFold2 is also still in peer review and RosettaFold3 is out?
delalamo.xyz
This upcoming week is my last week at GSK. I’m going to move on to greener pastures but first a few weeks off to … close on a house! Go for some bike rides! Paint 40k models! Etc.!
Reposted by Diego del Alamo
delalamo.xyz
Yep. Reminds me of the bar plot error bars that were just capital Ts
delalamo.xyz
Yep. Reminds me of the bar plot error bars that were just capital Ts
delalamo.xyz
Protein language models can predict epistasis (non-additivity of multiple point mutations) if predictions are nonlinearly transformed first www.biorxiv.org/content/10.1...
delalamo.xyz
I posted this on the other site and will repost here: something is fishy with Fig 2F results on inverse folding performance (left). Variance is way too low. Compare that to my results preprinted earlier this year (right). Definitely causing me to scrutinize other results in this paper more closely
delalamo.xyz
Does anyone know of an NN-based drop-in replacement for protein side chain repacking (e.g., Rosetta repack/SCWRL)? Ideally something lightweight
Reposted by Diego del Alamo
lindorfflarsen.bsky.social
New review on computational design of intrinsically disordered proteins 🖥️🍝 by @giuliotesei.bsky.social @fpesce.bsky.social & 👴

doi.org/10.48550/arX...
Figure 3 from the paper with the caption: "Role of machine learning in de novo design of IDRs. (A) Machine-learning models can be trained on diverse data sources, from molecular dynamics simulations to annotations of cellular localization and protein structures from the Protein Data Bank. (B) Often implemented as neural networks using sequence-encoded features as input, these models can initially be trained on a limited region of sequence space as surrogate models. Through active learning, additional simulations are performed during the design campaign to generate new data, and the surrogate model is retrained on the expanded dataset to progressively improve its accuracy. (C) Machine-learning models have been developed to predict biophysical observables, biological annotations, and protein structures. When combined, machine-learning models can be used to identify a set of sequences that strike a trade-off between multiple design objectives, defining a Pareto front."
delalamo.xyz
This was a great read! If it were up to me, this specific part of rule #4 would be expanded into a full Opinion piece, because proteins for structural characterization are definitely not sampled IID from uniprot/wherever
Reposted by Diego del Alamo
stephanieaw.bsky.social
Structural bioinformatics is incredibly powerful on its own or when paired with theory or experiment. One of the PDB's superpowers isn’t from one structure, but comparing many to uncover folds, binding sites, and subtle conformational shifts. chemrxiv.org/engage/chemr...
10 Rules for a Structural Bioinformatic Analysis
The Protein Data Bank (PDB) is one of the richest open‑source repositories in biology, housing over 277,000 macromolecular structural models alongside much of the experimental data that underpins thes...
chemrxiv.org
delalamo.xyz
Time to test their claim that diffusion is competitive with hallucination in generating realistic backbones of huge (>800 AA) proteins...
Reposted by Diego del Alamo
delalamo.xyz
Can anyone familiar with the economics of this situation explain how UK salaries can be 1/3 of US salaries, yet the UK is still described as being more unfriendly for R&D than the US, even when the quality of the work being done is largely the same?