Antonio Camargo
@apcamargo.bsky.social
740 followers 120 following 41 posts
Posts Media Videos Starter Packs
apcamargo.bsky.social
A BLAST update adding support for compressed files and csv output with headers is a Good Friday night surprise!

blast.ncbi.nlm.nih.gov/doc/blast-ne...
2025 BLAST NEWS — BlastNews 0.1.1 documentation
blast.ncbi.nlm.nih.gov
Reposted by Antonio Camargo
rayanchikhi.bsky.social
🌎👩‍🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵

Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open.

doi.org/10.1101/2024...
Reposted by Antonio Camargo
benjwoodcroft.bsky.social
Out in @natbiotech.nature.com: Metagenome taxonomy profilers usually ignore unknown species. SingleM is an accurate profiler which doesn't, even detecting phyla with no MAGs. Profiles of 700,000 metagenomes at sandpiper.qut.edu.au. A 🧵
Logo for the Sandpiper website
Reposted by Antonio Camargo
bonsaiseqbioinfo.bsky.social
Preprint alert from the group 🚨 super fast grep-like sequence selection
npmalfoy.bsky.social
A common approach is to use k-mer indexes to identify datasets that share a significant number of k-mers with a query sequence.
This problem has been studied extensively, and I recommend the survey of @camillemrcht.bsky.social, a great introduction to the subject
genome.cshlp.org/content/31/1...
Data structures based on k-mers for querying large collections of sequencing data sets
An international, peer-reviewed genome sciences journal featuring outstanding original research that offers novel insights into the biology of all organisms
genome.cshlp.org
apcamargo.bsky.social
I came across their work by chance too but never really checked out the library. I'm building a Python library for internal use and evaluating backends. I'll probably go with Needletail because PyO3 makes things really easy
apcamargo.bsky.social
Interesting. Did you found it to be substantially faster than needletail?
apcamargo.bsky.social
HMMER. It's everywhere in bioinformatics, not particularly fast, and its development was recently put on hold. It's kind of wild that we still don't have a solid alternative for HMM-to-protein alignment or searching.
apcamargo.bsky.social
Amazing work! I'm looking forward to more articles like these!
Reposted by Antonio Camargo
robert.bio
Excited to announce our first interactive article on sandbox.bio, about genomic ranges: sandbox.bio/concepts/gen...

Move & resize the ranges to see how that affects bedtools operations like merge and intersect in real time!
Reposted by Antonio Camargo
simrouxvirus.bsky.social
New pre-print out \o/ All about CRISPR, metagenomes, and what you learn when you collect (a lot of) spacers from natural communities, with @apcamargo.bsky.social @urineri.bsky.social @lhug.bsky.social but also Uri Gophna, Nikhil George (not on Bsky I think) & others at JGI doi.org/10.1101/2025...
doi.org
Reposted by Antonio Camargo
tsbschm.bsky.social
The Koonin Law of Computatoinal Biology:

Whenever you think you have a great idea in computational or evolutionary biology, it will already have been published by Eugene Koonin in the mid 90ies.
Reposted by Antonio Camargo
microyunha.bsky.social
At Tatta Bio, we have been thinking deeply about the sequence-to-function problem. We believe that before AI can power functional prediction, we first need to rethink how we curate, manage, and share sequence data. Here, we share our initial ideas on what we are building next:
Today's sequence data infrastructure is set up for failure in the age of AI.
Building an open and collaborative sequence platform for both Human and AI scientists.
tattabio.substack.com
Reposted by Antonio Camargo
jimshaw.bsky.social
Announcing myloasm, a new long-read (ONT R10/PacBio) metagenome assembler that I've been working on during my postdoc in the Heng Li lab (@lh3lh3.bsky.social).

myloasm-docs.github.io
myloasm - metagenomic assembly with (noisy) long reads
myloasm-docs.github.io
Reposted by Antonio Camargo
hunemeier.bsky.social
Our new paper is out in @science.org! By exploring the rich genetic diversity of Brazil, we show how fine-scale genomic analyses reveal that this diversity, rooted in Indigenous ancestry and centuries of complex demographic history, plays a key role in population health.
Reposted by Antonio Camargo
Reposted by Antonio Camargo
oschwengers.bsky.social
We happily present: “Bakta Web – rapid and standardized genome annotation on scalable infrastructures” @OxUniPress NAR’s Web Server issue
doi.org/10.1093/nar/...

Easy to use, no registration, fast, scalable, various visualizations, in sync with Bakta CLI:
bakta.computational.bio
(1/5)
Bakta Web – rapid and standardized genome annotation on scalable infrastructures
Abstract. The Bakta command line application is widely used and one of the most established tools for bacterial genome annotation. It balances comprehensiv
doi.org
apcamargo.bsky.social
I don't think so. It was the only assembler that generated clearly wrong assemblies. That said, it may have improved since then
Reposted by Antonio Camargo
acritschristoph.bsky.social
Unique investigation of some errors in long read assemblers.

In particular these remarkably chimeric contigs are 😱, if rare....

Improving long read assemblers is definitely the space to be in when it comes to the future of metagenomics, as short reads won't be part of it 😉
apcamargo.bsky.social
My own experience lines up with what the pre-print quantified. I’ve seen a lot of concatemers in my MetaMDBG assemblies