🤖 AI tinkerer 🏗️ Building tech communities
🇪🇺 UK 🇬🇧🇩🇪🇻🇪 | Check bio: axelgarciak.com/bio
But they are great for local use cases.
It's nice that they are hybrid, i.e: thinking can be deactivated.
Qwen2.5 was already the best in its class for a while! Hopefully we'll get a Coder fine-tune!
But they are great for local use cases.
It's nice that they are hybrid, i.e: thinking can be deactivated.
Qwen2.5 was already the best in its class for a while! Hopefully we'll get a Coder fine-tune!
However, previous Gemma models were quite good relative to their size, so I'm sure Gemma 3 is really good!
However, previous Gemma models were quite good relative to their size, so I'm sure Gemma 3 is really good!
If you were to use fp4 exclusively, there is an advantage in speed from these newer GPUs.
If you use fp8 there is still an improvement but far less than advertised.
If you were to use fp4 exclusively, there is an advantage in speed from these newer GPUs.
If you use fp8 there is still an improvement but far less than advertised.