Giulio Ermanno Pibiri
@jermp.bsky.social
660 followers 48 following 90 posts
Associate Prof. of CS at Ca' Foscari University of Venice. Indexing, Data Compression, Algorithms.
Posts Media Videos Starter Packs
jermp.bsky.social
what conf are you attending?
jermp.bsky.social
Very strange that ArXiv did not accept this review paper. I also recently (some months ago) uploaded a survey on ArXiv and the process went smooth. I suspect there must be other reasons for their decision…
jermp.bsky.social
If I remember correctly, this idea was already used in Cortex or McCortex indices by Z. Iqbal et al.
jermp.bsky.social
Very interesting opportunity!
bacpop.org
The EMBL PhD programme is open until 13th October (entry ~Sep 2026):
www.embl.org/about/info/e...

We have three positions in microbial genomics at EMBL-EBI, including one in my group. Please do apply, or if you know anyone that would be interested pass on to them
EMBL International PhD Programme – Unique in the world and waiting for you!
www.embl.org
jermp.bsky.social
Ah yeah interesting! I remember I also wanted to try something...and reordering by minimizer content came immediately to mind.
jermp.bsky.social
Yes, I agree. Without consulting online forums -- and now, asking ChatGPT -- it's impossible to remember even how to do a basic thing. CMake generates make files. Can we develop a tool that generates the CMake files that generate the make files?! :D
jermp.bsky.social
Oh right, this is the SPIRE week! Enjoy SPIRE everyone and congrats to all contributors :)
camillemrcht.bsky.social
Yohan Hernandez-Courbevoie chaired by Maxime Crochemore just before presenting REINDEER2 at #SPIRE2025 in London
Reposted by Giulio Ermanno Pibiri
robp.bsky.social
Hashing vs. sorting; interesting! reiner.org/hashed-sorting. Also I wonder if, depending on your use case, semi-sorting provides an even greater benefit? 🧬🖥️
Hashed sorting is typically faster than hash tables
Benchmarks and theoretical explanation of why and when hashed radix sort beats hash tables.
reiner.org
jermp.bsky.social
Wow, fantastic guys! Congratulations and finger crossed! :)
Reposted by Giulio Ermanno Pibiri
curiouscoding.nl
Paraseq 0.4 is out now! With double the throughput for processing paired-end input :)

github.com/noamteyssier...
jermp.bsky.social
After requests from the community, SSHash lands on Bioconda package index: bioconda.github.io/recipes/ssha.... Cool! Thanks @robp.bsky.social for all the help and support.
Package Recipe 'sshash' — Bioconda documentation
bioconda.github.io
jermp.bsky.social
As in previous years, the workshop will be free to attend, but *registration will be required* in order to participate. The call for abstracts will be announced around mid-November, and registrations will open shortly thereafter. Can't wait to meet you all in Venice :)
jermp.bsky.social
DSB is an annual scientific meeting at the crossroads of computer science and biology. It is the unique forum to discuss compact data structures and their applications for processing data from life sciences.
jermp.bsky.social
We are glad to announce that the next workshop “Data Structures in Bioinformatics” (DSB 2026) will take place in Venice, Italy, on *February 18-19*, 2026. dsb-meeting.github.io/DSB2026/ Book the dates! #DSB26
DSB 2026 Venice - February 18-19
Workshop Data Structures in Bioinformatics
dsb-meeting.github.io
Reposted by Giulio Ermanno Pibiri
robp.bsky.social
Without our own users, I have no idea when I would have learned about this extremely subtle limitation of the minimap2 serialized index github.com/COMBINE-lab/... (reference names > 255 bytes don't roundtrip). Users push your software in ways you might not even think to!
jermp.bsky.social
Congratulations Ragnar!! 🎉
jermp.bsky.social
I only see an improvement for k=63; storing minimizer positions don’t give a similar improvement for k=31. I don’t see any immediate explanation, especially given that the extension rate and other query statistics are identical… for a potential paper, we should find an explanation 😌
jermp.bsky.social
I'm about to release SSHash v4.0.0. More details in a future post. Among other things, now indexes store the pos. of minimizers instead of super-kmers. Compared to the latest post (last jun), this somehow had a very positive effect of streaming queries for larger k (e.g., 63)! ☺️ CC @robp.bsky.social