Quentin Anthony
banner
quentinanthon15.bsky.social
Quentin Anthony
@quentinanthon15.bsky.social
I make models more efficient.
Google Scholar: https://scholar.google.com/citations?user=GDm6BIAAAAAJ&hl=en
We dropped the Zamba2 and Zyda2 tech reports on arxiv!
- Zamba2 models of size 1.2B, 2.7B, 7.4B
- Zyda-2 5T token dataset
- We discuss more specifics on model arch, training process, dataset creation, etc

Links:
- Zamba2: arxiv.org/abs/2411.15242
- Zyda-2: arxiv.org/abs/2411.06068
November 26, 2024 at 8:23 PM