James Lingford
@jameslingford.bsky.social
160 followers 230 following 60 posts
PhD student in structural biology with @greening.bsky.social and @knottrna.bsky.social at Monash Uni. (he/him) Interested in hydrogenases, evolution, protein design. 💻 https://www.jameslingford.com/
Posts Media Videos Starter Packs
jameslingford.bsky.social
Switched from using pandas to @pola.rs and it has been a total life saver. Handling dataframes is way more intuitive with the method chaining syntax. Prettier terminal printing, better error messages, lightning fast, fewer headaches.
jameslingford.bsky.social
Recent advances in the inference of deep viral
evolutionary history journals.asm.org/doi/10.1128/...

A review on structural phylogenetics utility and limitations
Reposted by James Lingford
ikalvet.bsky.social
RFdiffusion2 is now live!
github.com/RosettaCommo...

You can now design proteins, and in particular enzymes from just partially defined amino acid side chains, and without defining their sequence position or order!
jameslingford.bsky.social
• a big concern I have with these LLMs is how they might end up changing our attitudes around productivity. The promise is instant results & faster publications. But the cost will end up being garbage results that are only caught when it's too late
jameslingford.bsky.social
Just to piggyback on this critique of LLMs for coding with some of my own critiques:

• it's a bad idea to install a tool on your machine that has vast read+write permissions
• code that works but is really outputting false info is a real danger and hard to catch. LLMs make that danger worse
wcratcliff.bsky.social
Not sure how many scientists here have tried Claude Code or similar command line coding assistants. I had a complicated family property tax problem that was best solved by a brute force Monte Carlo simulation approach, so I spent a few days coding up and analyzing a model with Claude Code.
Reposted by James Lingford
martinsteinegger.bsky.social
MMseqs2 v18 is out
- SIMD FW/BW alignment (preprint soon!)
- Sub. Mat. λ calculator by Eric Dawson
- Faster ARM SW by Alexander Nesterovskiy
- MSA-Pairformer’s proximity-based pairing for multimer prediction (www.biorxiv.org/content/10.1...; avail. in ColabFold API)
💾 github.com/soedinglab/M... & 🐍
Reposted by James Lingford
macwiatrak.bsky.social
💥 Excited to introduce Bacformer 🦠 - the first foundation model for bacterial genomics. Bacformer represents genomes as sequences of ordered proteins, learning the “grammar” of how genes are arranged, interact and evolve.

Preprint 📝: biorxiv.org/content/10.1...

🧵 1/n
Reposted by James Lingford
kassipan.bsky.social
Excited to share our work on WitChi! 🛠️🖥️
We tested it on the GTDB r220 archaeal supermatrix (5,869 taxa & 10,101 cols) removing 55% of sites in <2h.

The phylogeny showed several interesting groupings with overall improved branch support:
#phylogenetics #ArchaeaSky #MSA #opensource #MEvoSky #MicroSky
Reposted by James Lingford
zeqianli.bsky.social
New tgv release: local cache!

tgv download hg38

Download UCSC reference genomes to a local sqlite db for much faster browsing. Awesome Rust tools (twobit, bigtools) made this simple.

github.com/zeqianli/tgv
GitHub - zeqianli/tgv: Explore 5,000+ genomes in the terminal. Light, blazing fast 🚀, vim-motion.
Explore 5,000+ genomes in the terminal. Light, blazing fast 🚀, vim-motion. - zeqianli/tgv
github.com
Reposted by James Lingford
benjwoodcroft.bsky.social
Out in @natbiotech.nature.com: Metagenome taxonomy profilers usually ignore unknown species. SingleM is an accurate profiler which doesn't, even detecting phyla with no MAGs. Profiles of 700,000 metagenomes at sandpiper.qut.edu.au. A 🧵
Logo for the Sandpiper website
jameslingford.bsky.social
Got this setup working where I can now run .ipynb notebooks right from inside the terminal with a combination of neovim, quarto, kitty, and this neovim plugin called molten: github.com/benlubas/mol...
Never have to abandon my precious vim setup again
jameslingford.bsky.social
Been eagerly awaiting this one. Amazing work
martinsteinegger.bsky.social
Folddisco finds similar (dis)continuous 3D motifs in large protein structure databases. Its efficient index enables fast uncharacterized active site annotation, protein conformational state analysis and PPI interface comparison. 1/9🧶🧬
📄 www.biorxiv.org/content/10.1...
🌐 search.foldseek.com/folddisco
Reposted by James Lingford
martinpacesa.bsky.social
We have written up a tutorial on how to run BindCraft, how to prepare your input PDB, how to select hotspots, and various other tips and tricks to get the most out of binder design!

github.com/martinpacesa...
jameslingford.bsky.social
Some matplotlib work in progress
Reposted by James Lingford
wcratcliff.bsky.social
1/27 We have a new paper out! Turns out that snowflake yeast have been hiding a secret from us - they've evolved a (very!) crude circulatory system. Not with blood vessels or a heart, but through spontaneous fluid flows powered by their metabolism. 🧪🔬

www.science.org/doi/full/10....
jameslingford.bsky.social
Closer... I think at this point the solution lies in manually making a list of the hex codes, but that's for another day.
jameslingford.bsky.social
Revisiting this topic now that I've forced myself to use PyMOL. Using this script to install the viridis family of colour palettes: github.com/smsaladi/pym... and running '"spectrum count, palette=magma, MODEL_NAME". The palette is not there still. They must modify the magma palette somehow