I'd only expect the cost of serving it to fall if there were an architecture change or with much better inference hardware/software.
I'd only expect the cost of serving it to fall if there were an architecture change or with much better inference hardware/software.
1. the QT
2. Artificial analysis (image)
quote: x.com/artificialan...
report: artificialanalysis.ai/evaluations/...
3. Demis Hassabis said 1-2 months ago that major version numbers indicate OOM scaling, minor is RL scaling
1. the QT
2. Artificial analysis (image)
quote: x.com/artificialan...
report: artificialanalysis.ai/evaluations/...
3. Demis Hassabis said 1-2 months ago that major version numbers indicate OOM scaling, minor is RL scaling
Here's "What is AdS/CFT correspondence?" steered toward grades 5 and 17.
Here's "What is AdS/CFT correspondence?" steered toward grades 5 and 17.
Huel: 6.31 ug/400 kcal = 15.7 ng/kcal
Sweet potato: 12.1 ug/kg / 1000 kcal/kg = 12.1 ng/kcal
Huel: 6.31 ug/400 kcal = 15.7 ng/kcal
Sweet potato: 12.1 ug/kg / 1000 kcal/kg = 12.1 ng/kcal
Anthropic is releasing it as ASL-2, unlike Sonnet 4.5/Opus 4+ which are considered ASL-3
Anthropic is releasing it as ASL-2, unlike Sonnet 4.5/Opus 4+ which are considered ASL-3
3.7: 3.45
4: 3.3
4.5: 3.5
Quite different from Anthropic's relative scores!
3.7: 3.45
4: 3.3
4.5: 3.5
Quite different from Anthropic's relative scores!
OpenAI spent ~$7 billion on compute last year. Most of this went to R&D, meaning all research, experiments, and training.
Only a minority of this R&D compute went to the final training runs of released models.