Here's Tulu 3 405B. A holiday present from @hamishivi.bsky.social @vwxyzjn.bsky.social and team.
Here's Tulu 3 405B. A holiday present from @hamishivi.bsky.social @vwxyzjn.bsky.social and team.
Controlling who in the world can finetune on what is dumb
Controlling who in the world can finetune on what is dumb
My talk at the NeurIPS Latent Space live event (pre o3).
Slides: https://buff.ly/40hsoTx
Post: https://buff.ly/40i2rDC
YouTube: https://buff.ly/40k8GH3
My talk at the NeurIPS Latent Space live event (pre o3).
Slides: https://buff.ly/40hsoTx
Post: https://buff.ly/40i2rDC
YouTube: https://buff.ly/40k8GH3
Much like Moore's law being "over", we will continue to find other ways to continue to scale progress in these models by leveraging more, more efficient, more targeted compute.
Much like Moore's law being "over", we will continue to find other ways to continue to scale progress in these models by leveraging more, more efficient, more targeted compute.