Lightnews — Scholar-powered news

Quentin Anthony

@quentinanthon15.bsky.social

410 followers 43 following 11 posts

I make models more efficient.
Google Scholar: https://scholar.google.com/citations?user=GDm6BIAAAAAJ&hl=en

Posts Replies Media Videos

Quentin Anthony

@quentinanthon15.bsky.social

Inspired by “minimal implementation“ projects in AI such as
@karpathy.bsky.social’s nanoGPT, I worked to bring this concept to the HPC world!

I’ve built a minimal implementation of an MPI library called nanoMPI, which focuses on clarity, simplicity, and easy installation.

June 4, 2025 at 6:10 PM

Quentin Anthony

@quentinanthon15.bsky.social

We are the first to demonstrate higher training kernel throughput (both transformers and SSM hybrids) on AMD MI300X compared to H100!

- rocm.blogs.amd.com/ecosystems-a...
- www.zyphra.com/post/trainin...

December 10, 2024 at 9:35 PM

Quentin Anthony

@quentinanthon15.bsky.social

We dropped the Zamba2 and Zyda2 tech reports on arxiv!
- Zamba2 models of size 1.2B, 2.7B, 7.4B
- Zyda-2 5T token dataset
- We discuss more specifics on model arch, training process, dataset creation, etc

Links:
- Zamba2: arxiv.org/abs/2411.15242
- Zyda-2: arxiv.org/abs/2411.06068

November 26, 2024 at 8:23 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news