Paul Medvedev
@pashadag.bsky.social
1.8K followers 160 following 80 posts
Algorithmic Bioinformatics Researcher and Teacher. Posts about research results and educational/mentorship topics (for details, see http://bit.ly/380vX22).
Posts Media Videos Starter Packs
Reposted by Paul Medvedev
benlangmead.bsky.social
I've added 7 videos to my Burrows-Wheeler indexing playlist (www.youtube.com/playlist?lis...), rounding out the r-index series and adding a 5-part series on the move structure. Now 27 videos in that playlist. I aim to add videos on prefix-free parsing, PBWT, Wheeler languages/automata in the future.
Burrows-Wheeler Indexing - YouTube
Videos on : (a) the Burrows-Wheeler Transform (BWT), (b) the FM Index, which uses the BWT to construct a full-text index, (c) Wheeler graphs, (d) r-index, an...
www.youtube.com
Sounds like someone is trying to solve a bidirected flow problem..
i've let the person in charge know
There seems to be a self-contradiction within the CFP, since it also says: "Submissions to peer-reviewed journals other than the partnering ones are also allowed.."
Reposted by Paul Medvedev
biorxiv-bioinfo.bsky.social
Alice: fast and haplotype-aware assembly of high-fidelity reads based on MSR sketching https://www.biorxiv.org/content/10.1101/2025.09.29.679204v1
It could be. Or it could be that the decision process is not consistent? Hard to tell...
I see. Do you know if the list of papers that are posted there get disseminated somehow through mail lists or social media?
They do, but they did not accept our paper. From what we understood, it was because it was a review paper and not novel research
Reposted by Paul Medvedev
recombconf.bsky.social
#RECOMB2026 will be in Thessaloniki, Greece on May 26-29, 2026. Satellites on May 24-25. Save the date!

Το συνέδριο #RECOMB2026 θα πραγματοποιηθεί στη Θεσσαλονίκη, στις 26-29 Μαΐου 2026. Οι δορυφορικές εκδηλώσεις θα διεξαχθούν στις 24-25 Μαΐου 2026. Σημειώστε την ημερομηνία!
Hi Gaurav, I'm not sure what you mean. (But it sounds like you are asking for a library with all these implemented in one place? That would be quite an undertaking! As these things are always evolving, I'd guess it would also not age well.
I guess that if all one wants is to just have a doi for the pdf, there are various options (zenodo, HAL). But if one is looking to have the title "advertised" broadly (as happens with a biorxiv or arxiv preprint), then that's the hard part
I've seen it used for storing datasets but I haven't seen it used for pre-prints. If you have any examples, let me know!
I hadn't heard of it before, but looking at their webpage: "Effective 8/25/2025, we will be suspending submissions to this generalist server hosted by OSF Preprints."
I can appreciate the perspective of bioRxiv about not taking reviews (arXiv is not transparent about their policy). But in the end of the day, the community needs some way to disseminate pre-print reviews that are not just putting them in a shared dropbox folder :(
If you're wondering why we're hosting the pre-print via dropbox, its because arXiv (and bioRxiv) did not accept it (because it is a review). Its a bit disconcerting, because a review is precisely the type of paper that would benefit a lot from pre-publication dissemination and feedback.
Thank you folks for your feedback on our survey about Hash functions in genomic sequence analysis. We've updated the paper and you can see the new version here: tinyurl.com/4kk9ccmt.
Thank you folks for your feedback on our survey about Hash functions in genomic sequence analysis. We've updated the paper and you can see the new version here: tinyurl.com/4kk9ccmt.
Are you referring to the randomness of the *location*? If yes, you could plot the distribution of distances between adjacent errors and overlay it with what would be expected under a Poisson model
Reposted by Paul Medvedev
sinamajidian.bsky.social
Excited to share our EvANI benchmarking workflow, published in Briefings in Bioinformatics doi.org/10.1093/bib/...
Computing average nucleotide identity (ANI) is neither conceptually nor computationally trivial. Its definition has evolved over years, with different meanings and assumptions (1/5)
Figure 1(A) ANI quantifies the similarity between two genomes. ANI can be defined as the number of aligned positions where the two aligned bases are identical, divided by the total number of aligned bases. Historically, ANI was calculated using a single gene family for multiple sequence alignment. Another approach finds orthologous genes between two genomes and reports the average similarity between their CDSs. This method was later extended to whole-genome alignment by identifying local alignments and excluding supplementary alignments with lower similarity. (B) Different ANI tools employ various approaches in calculating ANI values. ANIm, OrthoANI, and FastANI use aligners to identify homologous regions, whereas Mash uses k-mer hashing to estimate similarities. Only alignments with higher similarity represented by green arrows are included in ANI calculations, while red arrows, corresponding to paralogs, are excluded. (C) The proposed benchmarking method evaluates the performance of different tools using both real and simulated data. It assumes that more distantly related species on the phylogenetic tree should have lower ANI similarities. This is measured by calculating the statistics of Spearman rank correlation. We expect a negative correlation between ANI and the tree distance (scatter plot on the right).
https://academic.oup.com/bib/article/doi/10.1093/bib/bbaf267/8160681
Reposted by Paul Medvedev
jimshaw.bsky.social
Preprint out for myloasm, our new nanopore / HiFi metagenome assembler!

Nanopore's getting accurate, but

1. Can this lead to better metagenome assemblies?
2. How, algorithmically, to leverage them?

with co-author Max Marin @mgmarin.bsky.social, supervised by Heng Li @lh3lh3.bsky.social

1 / N
biorxiv-bioinfo.bsky.social
High-resolution metagenome assembly for modern long reads with myloasm https://www.biorxiv.org/content/10.1101/2025.09.05.674543v1
congratulations to you both
Reposted by Paul Medvedev
rayanchikhi.bsky.social
🌎👩‍🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵

Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open.

doi.org/10.1101/2024...
Reposted by Paul Medvedev