Writes http://interconnects.ai
At Ai2 via HuggingFace, Berkeley, and normal places
rlhfbook.com
Sorry, living under rocks today.
Sorry, living under rocks today.
A month with SOTA releases with (truly) open model releases left and right.
www.interconnects.ai/p/latest-ope...
A month with SOTA releases with (truly) open model releases left and right.
www.interconnects.ai/p/latest-ope...
"a few hours" and the model is on HuggingFace.
www.chinatalk.media/p/the-zai-pl...
"a few hours" and the model is on HuggingFace.
www.chinatalk.media/p/the-zai-pl...
Me:
Me:
> This dataset is provided for...
> Evaluating information retrieval and retrieval augmented generation (RAG) systems.
> It is not intended for: Fine-tuning language models.
??
> This dataset is provided for...
> Evaluating information retrieval and retrieval augmented generation (RAG) systems.
> It is not intended for: Fine-tuning language models.
??
Sorry to all who delayed releases today to get out of our way.
We're hiring.
Sorry to all who delayed releases today to get out of our way.
We're hiring.
This family of 7B and 32B models represents:
1. The best 32B base model.
2. The best 7B Western thinking & instruct models.
3. The first 32B (or larger) fully open reasoning model.
This family of 7B and 32B models represents:
1. The best 32B base model.
2. The best 7B Western thinking & instruct models.
3. The first 32B (or larger) fully open reasoning model.
5.1 just feels weird, can't quite place it.
5.1 just feels weird, can't quite place it.
scholar.google.com/scholar_labs...
scholar.google.com/scholar_labs...
How the current way of training language models destroys any voice (and hope of good writing).
www.interconnects.ai/p/why-ai-wri...
How the current way of training language models destroys any voice (and hope of good writing).
www.interconnects.ai/p/why-ai-wri...
Very shocking because even the first claude code could do this. Am I the only one?
Very shocking because even the first claude code could do this. Am I the only one?
Funny dynamic. TBD if quality dropped at all, quality at least didn't nosedive. Hard to tell when style changes a bit.
Funny dynamic. TBD if quality dropped at all, quality at least didn't nosedive. Hard to tell when style changes a bit.
Excited to land in print in early 2026! Lots of improvements coming soon.
Thanks for the support!
hubs.la/Q03Tc37Q0
Excited to land in print in early 2026! Lots of improvements coming soon.
Thanks for the support!
hubs.la/Q03Tc37Q0
Surely there are more people studying how to modify & steer model personality after the GPT 4o sycophancy incident.
Surely there are more people studying how to modify & steer model personality after the GPT 4o sycophancy incident.
Some new research from me!
Exploring how easy it is to craft personalities like sycophantic chatbots, and exploring how this will change as we move from chat to agents.
www.interconnects.ai/p/opening-th...
Some new research from me!
Exploring how easy it is to craft personalities like sycophantic chatbots, and exploring how this will change as we move from chat to agents.
www.interconnects.ai/p/opening-th...
The rest of 2025 has been living through that reality with Kimi, GLM, Ant Ling, Meituan... The burden of proof is back on scaling if AI will be in the hands of a few companies.
The rest of 2025 has been living through that reality with Kimi, GLM, Ant Ling, Meituan... The burden of proof is back on scaling if AI will be in the hands of a few companies.
I'm building up a much richer (and direct) understanding of Chinese AI labs. Excited to share more here soon :)
I'm building up a much richer (and direct) understanding of Chinese AI labs. Excited to share more here soon :)
Congrats to the Moonshot AI team on the awesome open release. For close followers of Chinese AI models, this isn't shocking, but more inflection points are coming. Pressure is building on US labs with more expensive models.
www.interconnects.ai/p/kimi-k2-th...
Congrats to the Moonshot AI team on the awesome open release. For close followers of Chinese AI models, this isn't shocking, but more inflection points are coming. Pressure is building on US labs with more expensive models.
www.interconnects.ai/p/kimi-k2-th...