SMT
pp0196.bsky.social
SMT
@pp0196.bsky.social
Sequences and consequences.



Credit Pic : Cellular landscape cross-section through a eukaryotic cell, by Evan Ingersoll
Reposted by SMT
AlphaGenome is out in @nature.com today along with model weights! 🧬

📄 Paper: www.nature.com/articles/s41...

💻 Weights: github.com/google-deepm...

Getting here wasn’t a straight path. We discussed the story behind the model, paper & API in the following roundtable: youtu.be/V8lhUqKqzUc
January 28, 2026 at 9:02 PM
Reposted by SMT
AniAnn's: alignment-free annotation of tandem repeat arrays using fast average nucleotide identity estimates 🥨 www.biorxiv.org/content/10.6... 🧬💻🧪 github.com/marbl/anianns
January 29, 2026 at 3:00 PM
Reposted by SMT
I taught (and co-taught) a course on human population genetics from 2000-2024. Having retired, I'm now making all the course materials public: github.com/alanrogers/p... #popgen #evbio
GitHub - alanrogers/popgen: A course on population genetics
A course on population genetics. Contribute to alanrogers/popgen development by creating an account on GitHub.
github.com
November 27, 2025 at 7:10 PM
Reposted by SMT
Balancing selection alert!! 🧬🧪

New preprint where we try to quantify how likely (or rather, unlikely) it is for balancing selection to maintain stable polymorphism, and how easy (or rather, challenging) it is to identify its signatures in genomes.

#Popgen #MolecularEvolution #EvoBio #Science
A new preprint from the lab, with postdoc @deboraycb.bsky.social and collaborators @aidaandres.bsky.social and Tim Connallon:

“Characterising the detectable and invisible fractions of genomic loci under balancing selection”
www.biorxiv.org/content/10.6...
www.biorxiv.org
January 21, 2026 at 4:11 PM
#Rstats

{interprocess} : Mutexes, Semaphores, and Message Queues for R by Daniel P. Smith and co-workers

github.com/cmmr/interpr...
GitHub - cmmr/interprocess: Mutexes, Semaphores, and Message Queues for R
Mutexes, Semaphores, and Message Queues for R. Contribute to cmmr/interprocess development by creating an account on GitHub.
github.com
January 27, 2026 at 10:58 PM
Reposted by SMT
I merged a PR for mirai today (fixing an esoteric bug), that came with a performance boost that I'd never have thought existed. That means that on my laptop, the default (with dispatcher) round-trip performance now dips into sub-100 microseconds territory!! Get it now: `pak::pak("r-lib/mirai")`
January 26, 2026 at 9:04 PM
Reposted by SMT
Automated #SQL formatting within #rstats files has just been added to the `duckdb-r-editor` #positron extension. github.com/belian-earth... ✌️
January 27, 2026 at 1:06 AM
Reposted by SMT
#dbt and #rstats friends, sorry about my past whining. I wrote up all my issues here to make your life easier, with even a tiny shoutout to #duckdb in there too!

github.com/eriksquires/...
GitHub - eriksquires/dbt_wrong: Everything I did wrong with DBT
Everything I did wrong with DBT. Contribute to eriksquires/dbt_wrong development by creating an account on GitHub.
github.com
January 25, 2026 at 6:42 PM
#Rstats #dplyr #ducdkb community extension by ChanYub Park

Use dplyr synthax in #duckdb
duckdb.org/community_ex...
January 26, 2026 at 8:16 PM
#RStats is there a DuckLake package out there ? or should i stop raw-dogging sql strings and make a proper package for projects i am using DuckLake for
January 26, 2026 at 8:03 PM
January 26, 2026 at 6:31 PM
#RStats even when one (loosely) knows intellectually how it works, the event loop feels like magic
Now, how would one make this (if possible at all) print "Got here i !"
January 22, 2026 at 7:09 PM
Reposted by SMT
orthogene: a Bioconductor package to easily map genes within and across hundreds of species www.biorxiv.org/content/10.6... #Rstats bioconductor.org/packages/ort...
January 22, 2026 at 3:01 PM
Reposted by SMT
Following an Rcpp question 'how do I share an object / external pointer' between #Rstats and #Python, I cooked up a simple 'stopwatch' example in two repos with two packages:

github.com/eddelbuettel...
github.com/eddelbuettel...

The Python side is on PyPi, shall I send the R side to CRAN?
GitHub - eddelbuettel/chronometre-r
Contribute to eddelbuettel/chronometre-r development by creating an account on GitHub.
github.com
January 22, 2026 at 1:26 PM
Maybe the fastest BCF/VCF to #RStats DataFrames using #htslib and #duckdb C API. Easily the title of fastest BCF/VCF to parquet convertors in #RStats (no other R options :D). This was motivated, among other things, by the idea of trying out #DuckLake in a familiar field
github.com/RGenomicsETL...
January 19, 2026 at 8:27 PM
Marxist analysis of the competing interests in the USian ruling class required here
hi, political philosopher here! this is not funny, central banks only do this when they're in extreme distress
January 12, 2026 at 6:28 AM
Reposted by SMT
acmgscaler: an R package and Colab for standardized gene-level variant effect score calibration within the ACMG/AMP framework academic.oup.com/bioinformati... 🧬🖥️🧪 github.com/badonyi/acmg... #Rstats
October 14, 2025 at 3:55 PM
Reposted by SMT
Can MAVEs and population-free VEPs be combined to improve variant classification? VEPs detect a broad range of pathogenic variants, while MAVEs give more conservative & decisive calls. Combined, they equitably reclassify >90% of VUS. Read more in our study on combining evidence from MAVEs and VEPs:
Combining MAVEs and computational predictors improves variant classification across ancestries in hereditary cancer genes https://www.medrxiv.org/content/10.64898/2025.12.08.25341119v1
December 9, 2025 at 6:41 PM
#RStats #Paper

Comparing R Bytecode Compilers Written in R, Java, and Rust (Extended Abstract) by Pierre Donat-Bouillud and Co-workers

drops.dagstuhl.de/entities/doc...

C version for segfault lovers: github.com/PRL-PRG/crbcc
January 11, 2026 at 6:17 PM
The amount of slop from would-be technical managers coming our way when they get into CLI tools is going to be massive and will be one of the most important driver of Anti-LLM views in software adjacent fields
January 11, 2026 at 5:19 PM
Reposted by SMT
PSA: The long-lived "let us talk about anything and everything" related to #Rcpp mailing list 'rcpp-devel' at R-Forge seems to have moved on to another place where we cannot reach it. May its memory be a blessing. #rstats

Going forward, let's try to connect at this link: github.com/RcppCore/Rcp...
RcppCore Rcpp · Discussions
Explore the GitHub Discussions forum for RcppCore Rcpp. Discuss code, ask questions & collaborate with the developer community.
github.com
January 9, 2026 at 6:36 PM
Reposted by SMT
I am increasingly using a macro to optimize allocations in #c #clang code either in stand alone C, or when writing #rstats and #perl extensions. The macro (the screeenshot for #perl is shown below) allocates on the stack unless one wants something big
January 8, 2026 at 1:13 PM
Reposted by SMT
I released {secretbase} 1.1.0 today. github.com/shikokuchuo/...

Adds optimized base58check and CBOR encoding.

This is a zero-dep #rstats package that wraps C code for hashing and binary/text encoding often needed in web development contexts. It also handles the file/object hashing for {targets}.
GitHub - shikokuchuo/secretbase: secretbase - Cryptographic Hash, Extendable-Output and Binary Encoding Functions
secretbase - Cryptographic Hash, Extendable-Output and Binary Encoding Functions - shikokuchuo/secretbase
github.com
January 8, 2026 at 8:37 PM
#RStats #duckdb #arrow gurus
what is a "A scannable Arrow-object" and how one can implement suck a thing from nanoarrow streams if possible to avoid depending on the arrow package. Right now i convert some files into arrow IPC and use the duckdb nanoarrow extension on them for various purposes
January 5, 2026 at 5:17 PM
you wondered by golang was based
Fuck you people. Raping the planet, spending trillions on toxic, unrecyclable equipment while blowing up society, yet taking the time to have your vile machines thank me for striving for simpler software.

Just fuck you. Fuck you all.

I can't remember the last time I was this angry.
December 31, 2025 at 7:38 PM