But I am not sure that matters much. Larger models already make far less errors & many real world processes are built with error-prone humans in mind.
But I am not sure that matters much. Larger models already make far less errors & many real world processes are built with error-prone humans in mind.
Neural Networks" paper open source, in partnership with the Computer History Museum.
computerhistory.org/press-releas...
Neural Networks" paper open source, in partnership with the Computer History Museum.
computerhistory.org/press-releas...
corca.io
corca.io
Come see us on Tuesday mornings at 8am PST!
Come see us on Tuesday mornings at 8am PST!
"3.5 Sonnet was not trained in any way that involved a larger or more expensive model (contrary to some rumors)."
—Dario Amodei
darioamodei.com/on-deepseek-...
"3.5 Sonnet was not trained in any way that involved a larger or more expensive model (contrary to some rumors)."
—Dario Amodei
darioamodei.com/on-deepseek-...
simonwillison.net/2025/Jan/29/...
simonwillison.net/2025/Jan/29/...
darioamodei.com/on-deepseek-...
darioamodei.com/on-deepseek-...
DeepSeek R1 is just the tip of the ice berg of rapid progress.
People underestimate the long-term potential of “reasoning.”
DeepSeek R1 is just the tip of the ice berg of rapid progress.
People underestimate the long-term potential of “reasoning.”