Matt Holt
@holtjma.bsky.social
870 followers 150 following 200 posts
Staff scientist at @PacBio; formerly @hudsonalpha; avid gamer; opinions are my own
Posts Media Videos Starter Packs
Reposted by Matt Holt
pacbio.bsky.social
Just in: #PacBio HiFi long-read sequencing is powering the National Institute on Aging’s Long Life Family Study.

Up to 7,800 genomes and epigenomes will be sequenced on Revio systems to uncover rare variants, methylation patterns, and other factors linked to healthy aging.

🔗 bit.ly/3IUDdFo
Reposted by Matt Holt
robp.bsky.social
Have you recently completed (or finishing soon) a PhD in CS or a related discipline? Do you want to do research advancing the theory & practice of algorithmic genomics & build tools that people love to use? I'll be looking to hire a postdoc! Official ad coming soon:
docs.google.com/document/d/1...
Postdoc Description.docx
Title: Postdoctoral Associate Summary statement: The postdoctoral research associate is responsible for developing novel computational methodology for high-throughput sequence genomics tasks, as well ...
docs.google.com
Reposted by Matt Holt
jmschreiber91.bsky.social
Now that I'm settled in at @umasschan.bsky.social, I'm hiring at all levels: grad students, post-docs, and software engineers/bioinformaticians!

The goal of my lab is to understand the regulatory role of every nucleotide in our genomes and how this changes across every cell in our bodies.
Reposted by Matt Holt
robp.bsky.social
It absolutely warms my heart when I click on a pre-print that includes a bunch of top methods folks from industry and I see (1) open-source software (with a good license; MIT in this case) that is (2) written in Rust. Bravo @pacbio.bsky.social!

Moar Rust plz!

github.com/PacificBiosc...
holtjma.bsky.social
We're quite happy with how this first version of Aardvark turned out, and we are actively working on additions! If you have any suggestions, please let us know here or on GitHub!

Special thanks to all our collaborators and beta users for the initial wave of feedback!

(8/8)
holtjma.bsky.social
So in summary, Aardvark:

1. Adds a new sequence-centric "basepair" scoring scheme
2. Enables comparisons of new types of variants (TRs & SVs)
3. Includes a traditional genotype score
4. Calculates everything really fast!
5. Almost forgot, it's fully open-source!

(7/N)
holtjma.bsky.social
Even if you do not (yet) fully buy in to basepair scoring, Aardvark includes a traditional genotype score... and it calculates both sets of scoring metrics *really* fast!

For small variants, on average 16x faster than hap.py, with most runs finishing <2 minutes (16 threads).

(6/N)
holtjma.bsky.social
Since Aardvark looks at sequences, it enables some comparisons that were previously very challenging:

1. Tandem repeat (TR) v. TR benchmarking
2. TR v. small variant benchmarking
3. Structural variant (SV) benchmarking
4. Joint benchmarking (small + SV)

(5/N)
holtjma.bsky.social
With basepair scoring, we instead:

1. Mask all variants, only *sequences* are compared. Equivalent representations will receive identical scores!
2. Score each modified basepair independently, enabling implicit partial credit while removing biases from representation

(4/N)
holtjma.bsky.social
There were two driving forces behind the basepair score:

1. Equivalent variant representations can be scored inconsistently with a genotype approach
2. Traditional genotype scoring has no concept of partial credit, it's either 100% correct or 100% wrong

(3/N)
holtjma.bsky.social
The main addition in Aardvark is the "basepair" scoring scheme, which compares local haplotype *sequences* instead of variants and genotypes. See the attached figure for a quick example of how basepair scoring compares to genotype scoring.
(2/N)
holtjma.bsky.social
I'm excited to share our pre-print about a new variant benchmarking tool we've been working on for the past few months!

Aardvark: Sifting through differences in a mound of variants
GitHub: github.com/PacificBiosc...

Some highlights in this thread:
1/N
holtjma.bsky.social
Before and/or after losing Lamar?
Reposted by Matt Holt
pacbio.bsky.social
🎉 #PacBio is turning 25 and we are bringing the #PacBioBlockParty to #ASHG25. Join us for a night of celebration with the genomics community.

And a big thanks to our sponsors for their support: SPT Labtech, Twist Bioscience, Hamilton, and n6!

Spaces are filling up fast! RSVP today: bit.ly/46xnUeI
Reposted by Matt Holt
jsantoyo.bsky.social
Advances in haplotype phasing and genotype imputation. #HaplotypePhasing #GenomeImputation @natrevgenet.nature.com
www.nature.com/articles/s41...
Reposted by Matt Holt
bielleogy.bsky.social
Giving a webinar on #BlueSoup this Friday, in case anyone wants to pop in during their lunch!
A promotional flyer for a webinar in the CSU Bioinformatics Webinar Series. On the left side, it shows the date and time: “26 Sep | 12 PM PST.” It includes instructions to join via Zoom with Meeting ID: 886 2442 3158 and a QR code above. On the right, there is a photo of Dr. Elinne Becket, Associate Professor in the Department of Biological Sciences at California State University San Marcos, sitting outdoors with plants and trees in the background. Below, the title of the talk is displayed: “The Soup that Blue up Twitter.” The description explains that Dr. Becket will discuss the shared curiosity of science, open science, and a story from February 2023 when leftover beef soup prepared by her mom turned bright blue. Scientists and people on Twitter collaborated to investigate, leading to a global effort among microbiologists and chemists. At the bottom, there are links to learn more about upcoming webinars at www.informatics.sdsu.edu and Dr. Becket’s website: https://elinneb.wixsite.com/becketlab
Reposted by Matt Holt
katmabu.bsky.social
That’s me in the video. And a Fox News host just celebrated me being assaulted by ICE.

If ICE will do this to a congressional candidate with a big platform, just imagine what they’re doing to the people they’re kidnapping.
acyn.bsky.social
Ingraham: A Democrat congressional candidate was thrown to the ground by an ICE agent. Good work
holtjma.bsky.social
Only if they’re calling each other
holtjma.bsky.social
If nothing else, the k-mer representation here definitely looks promising! Thanks for sharing this!
holtjma.bsky.social
That was basically the backup plan if I couldn’t find something fully baked already. Mostly to avoid reinventing the wheel and the hope that all the subtleties would already be handled.
holtjma.bsky.social
When you get no recommendations on your “Does X exist?” post…
holtjma.bsky.social
Question for the rust-bio folks: any recommendations on a dynamic de-bruijn graph crate?

I found some DBG crates but they seem to be A) unmaintained and/or B) not for dynamic applications.