Dongwook Kim
@dongwookkim.bsky.social
160 followers 92 following 4 posts
Developing fast and easy methods for #phylogenetics and #bioinformatics | PhD in Bioinformatics | Postdoc @ Comparative Genomics Lab, UNIL/SIB🇨🇭| Formerly @ Steinegger Lab, SNU🇰🇷 | he/him
Posts Media Videos Starter Packs
Pinned
dongwookkim.bsky.social
Unicore is now published on GBE 🚀
Unicore rapidly identifies structural single-copy core genes from input species proteomes for phylogenetic analysis. Powered by Foldseek and ProstT5, Unicore enables linear-scale structure-based phylogeny of any given set of taxa. 🧵1/n
📃 doi.org/10.1093/gbe/evaf109
Reposted by Dongwook Kim
lauriebelch.bsky.social
OrthoFinder just dropped a major update

It’s faster, more accurate, and ready for thousands of genomes

Let’s break it down (1/10)

github.com/OrthoFinder/...
www.biorxiv.org/content/10.1...
Reposted by Dongwook Kim
martinsteinegger.bsky.social
Folddisco finds similar (dis)continuous 3D motifs in large protein structure databases. Its efficient index enables fast uncharacterized active site annotation, protein conformational state analysis and PPI interface comparison. 1/9🧶🧬
📄 www.biorxiv.org/content/10.1...
🌐 search.foldseek.com/folddisco
Reposted by Dongwook Kim
dongwookkim.bsky.social
This work was done by talented @sukhwanpark.bsky.social and me, supervised by amazing @martinsteinegger.bsky.social !

Try Unicore now 👉 conda install -c bioconda unicore
Code and tutorial: 🌐 github.com/steineggerlab/unicore
Manuscript: 🌐 doi.org/10.1093/gbe/evaf109
dongwookkim.bsky.social
Unicore is fast, accurate, and universal. Unicore reconstructed consistent phylogeny of bacterial/fungal species, while maintaining linear time scale over the input size. Besides, Unicore works with any given taxa, presenting scalable and universal method for structure-based phylogeny. 🧵3/n
dongwookkim.bsky.social
With Unicore, we identified 13 structural core genes from 166 species across the Tree of Life, where 8 of them could only be defined using structures. Projected on the Tree of Life reconstructed with Unicore, you can see the universally conserved structure of one of the structural core genes. 🧵2/n
dongwookkim.bsky.social
Unicore is now published on GBE 🚀
Unicore rapidly identifies structural single-copy core genes from input species proteomes for phylogenetic analysis. Powered by Foldseek and ProstT5, Unicore enables linear-scale structure-based phylogeny of any given set of taxa. 🧵1/n
📃 doi.org/10.1093/gbe/evaf109
Reposted by Dongwook Kim
martinsteinegger.bsky.social
AFESM: a metagenomic guide through the protein structure universe! We clustered 821M structures (AFDB&ESMatlas) into 5.12M groups; revealing biome-specific groups, only 1 new fold even after AlphaFold2 re-prediction & many novel domain combos. 🧵
🌐 afesm.foldseek.com
📄 www.biorxiv.org/content/10.1...
Reposted by Dongwook Kim
martinsteinegger.bsky.social
Visit our posters at #RECOMB2025 for:

Structural: MSAs, Virus DB, Core Genes, Motif Discovery, Multimer Clustering & Search, pLM Foldseek, Environmental analysis

Metagenomics: Classification & Metabuli App

GPU-based & RNA search, Proteome clustering, Novel Ribozyme discovery

& get Marv stickers!
Reposted by Dongwook Kim
3rdreviewer.bsky.social
Not really my announcement to make--I am but a lesser co-author--but IQ-TREE 3 has just been released!

(Most credit to Minh Bui and @roblanfear.bsky.social and their labs)

ecoevorxiv.org/repository/v...
IQ-TREE 3: Phylogenomic Inference Software using Complex Evolutionary Models
ecoevorxiv.org
Reposted by Dongwook Kim
ebi.embl.org
🚀 #AlphaFold Database update

AlphaFold DB now integrates The Encyclopedia of Domains (TED) – a resource designed to systematically identify & classify structural domains within AlphaFold-predicted protein structures.

www.ebi.ac.uk/about/news/u...

@pdbeurope.bsky.social
Reposted by Dongwook Kim
dessimoz.bsky.social
The PAN-GO paper is a remarkable milestone. It not only provides the most comprehensive picture of human gene function to date, but also carefully maps this knowledge across the tree of life! Congratulations @marcfeuermann.bsky.social, Pascale Gaudet & collaborators!

www.sib.swiss/news/sib-hel...
Reposted by Dongwook Kim
martinsteinegger.bsky.social
In our latest review, we explore 12 deep-learning tools for metagenomic analysis, covering their strengths, limitations, and key applications. We hope it serves as both a resource and inspiration for new ways to analyze metagenomic data. Great work by Eli Levy Karin!
📄 doi.org/10.1093/nsr/...
Reposted by Dongwook Kim
sinamajidian.bsky.social
FastOMA is out now in Nature Methods 🎉: nature.com/articles/s41592-024-02552-8 A new orthology inference algorithm that scales linearly and is highly accurate. FastOMA can process all >2000 eukaryotic UniProt ref proteomes <24 hours 🚀. Try it out github.com/DessimozLab/fastoma @dessimoz.bsky.social
FastOMA retains OMA’s high precision accuracy and even improves upon it in terms of recall, positioning it on the Pareto frontier of orthology inference methods. 
FastOMA is not only fast but also accurate. a, QfO benchmar, agreement with SwissTree reference phylogeny covering manually curated gene trees. The error bars indicate 95% confidence intervals comparing FastOMA with EnsemblCompara, Domainoid, OrthoMCL, Ortholnspector, sonicparanoid, PANTHER, OrthoFinder, Hieranoid26 and the OMA family including OMA pairs, OMA groups and OMA GETHOGs (graph-based efficient technique for HOGs).

c) A computation time comparison of FastOMA and state-of-the-art alternatives.
https://www.nature.com/articles/s41592-024-02552-8
Reposted by Dongwook Kim
martinsteinegger.bsky.social
Unicore identifies single-copy protein structures across genomes using Foldseek, bypassing slow structure predictions by utilizing 3Di predictions from ProstT5, enabling rapid phylogenetic inference at the tree-of-life scale. 1/n
📄 www.biorxiv.org/content/10.1...
💾 github.com/steineggerla...
Reposted by Dongwook Kim
biorxiv-bioinfo.bsky.social
Unicore enables scalable and accurate phylogenetic reconstruction with structural core genes https://www.biorxiv.org/content/10.1101/2024.12.22.629535v1
Reposted by Dongwook Kim
bsky.app
Bluesky @bsky.app · Dec 3
Scientists, academics, researchers: We’re excited to share that @altmetric.com is now tracking mentions of your research on Bluesky! 🧪
altmetric.com
There are already many articles for which there is more attention on Bluesky than on other comparable micro-blogging sites, meaning the academic community and the general public have clearly adopted Bluesky as one of its core places to disseminate and discuss new research.

A Place of Joy.
‘A place of joy’: why scientists are joining the rush to Bluesky
Researchers say the social-media platform — an alternative to X — offers more control over the content they see and the people they engage with.: https://www.nature.com/articles/d41586-024-03784-6
Reposted by Dongwook Kim
adamjschwarz.bsky.social
South Korean citizens helped lawmakers scale the National Assembly walls so they could bypass military barricades and vote against martial law.
Reposted by Dongwook Kim
martinsteinegger.bsky.social
Interested in bioinformatics method development for proteins, structures or metagenomic analysis? Please check out my lab’s starter pack!
🔗 go.bsky.app/VJhXcSs
Reposted by Dongwook Kim
milot.bsky.social
MMseqs2 Release 16 Highlights: GPU-accelerated search📄, ORF or new 6-frame translated search modes, contig taxonomy always keeps the longest ORF, bug fixes (reduced memory and higher sensitivity) and relicensed as MIT
📄 biorxiv.org/content/10.1...
💾 mmseqs.com and 🐍Bioconda 🖥️🧬🧶
Promotional logo for MMseqs2 16 with the MMseqs2 Rocket mascot as a smart phone like App logo
Reposted by Dongwook Kim
roliroberts.bsky.social
What did the Last Eukaryotic Common Ancestor (#LECA) look like? Consensus View in #PLOSBiology; massive authorship including @AncestralState, @lauraeme.bsky.social, John Archbald, @andrewjroger.bsky.social, @dackslabecb.bsky.social, Jeremy Wideman. plos.io/4g0alq4
Reposted by Dongwook Kim
martinsteinegger.bsky.social
Our Big Fantastic Virus Database (BFVD) is now published NAR! It contains protein structure predictions of major viral clades, enhanced by petabase-scale homology search and it's explorable on the web.
🌐 bfvd.foldseek.com
💾 bfvd.steineggerlab.workers.dev
📄 academic.oup.com/nar/advance-...