How will LLMs learn to reason efficiently?
No math in this thread, ~simple words only! Let's go through the "Process Reinforcement through IMplicit REwards" (PRIME) method. 1/n
curvy-check-498.notion.site/Process-Rein...
How will LLMs learn to reason efficiently?
No math in this thread, ~simple words only! Let's go through the "Process Reinforcement through IMplicit REwards" (PRIME) method. 1/n
curvy-check-498.notion.site/Process-Rein...
How much time do we have before Artificial General Intelligence (AGI) is here, and how much time do we have afterwards? 2/n
How much time do we have before Artificial General Intelligence (AGI) is here, and how much time do we have afterwards? 2/n
A disconnect is widening between how product builders think, and how Artificial Intelligence researchers project short-term future.
But I believe it's a solvable problem (in any world where a solution exists). 1/n🧵
A disconnect is widening between how product builders think, and how Artificial Intelligence researchers project short-term future.
But I believe it's a solvable problem (in any world where a solution exists). 1/n🧵
I now realize Haibane Renmei is not just my most favorite show ever, but also the core of my moral philosophy.
I now realize Haibane Renmei is not just my most favorite show ever, but also the core of my moral philosophy.
Embedding Claude in vim means you get a fully agentic AI pair programmer in <1000 LoC. (Or a generic chatbot, or Linux co-admin.)
Unlike CLIs, you can review Claude's changes, and keep the chat history.
github.com/pasky/claude...
Embedding Claude in vim means you get a fully agentic AI pair programmer in <1000 LoC. (Or a generic chatbot, or Linux co-admin.)
Unlike CLIs, you can review Claude's changes, and keep the chat history.
github.com/pasky/claude...