Quentin Anthony
banner
quentinanthon15.bsky.social
Quentin Anthony
@quentinanthon15.bsky.social
I make models more efficient.
Google Scholar: https://scholar.google.com/citations?user=GDm6BIAAAAAJ&hl=en
Inspired by “minimal implementation“ projects in AI such as
@karpathy.bsky.social’s nanoGPT, I worked to bring this concept to the HPC world!

I’ve built a minimal implementation of an MPI library called nanoMPI, which focuses on clarity, simplicity, and easy installation.
June 4, 2025 at 6:10 PM
We are the first to demonstrate higher training kernel throughput (both transformers and SSM hybrids) on AMD MI300X compared to H100!

- rocm.blogs.amd.com/ecosystems-a...
- www.zyphra.com/post/trainin...
December 10, 2024 at 9:35 PM
We dropped the Zamba2 and Zyda2 tech reports on arxiv!
- Zamba2 models of size 1.2B, 2.7B, 7.4B
- Zyda-2 5T token dataset
- We discuss more specifics on model arch, training process, dataset creation, etc

Links:
- Zamba2: arxiv.org/abs/2411.15242
- Zyda-2: arxiv.org/abs/2411.06068
November 26, 2024 at 8:23 PM