💥 Beats OLMo-2-7B-Instruct and Mistral-Nemo-12B-Instruct on avg
💥 Competitive with LLama-3.1-8B-Instruct, beats it in math benchmarks and IF Eval
💥 Beats OLMo-2-7B-Instruct and Mistral-Nemo-12B-Instruct on avg
💥 Competitive with LLama-3.1-8B-Instruct, beats it in math benchmarks and IF Eval
Speed ⚡ + Accuracy 📈 + Efficiency 💸
This model punches above its weight, beating bigger LLMs while training on a fraction of the compute.
Built with Fast-LLM, our in-house training stack.
🧵👇
Speed ⚡ + Accuracy 📈 + Efficiency 💸
This model punches above its weight, beating bigger LLMs while training on a fraction of the compute.
Built with Fast-LLM, our in-house training stack.
🧵👇