Our work on Asynchronous RLHF was accepted to
#ICLR2025 ! (I was so excited to announce it, I forgot to say I was excited)
Used by
@ai2.bsky.social for OLMo-2 32B 🔥
New results show ~70% speedups for LLM + RL math and reasoning 🧠
🧵below or hear my DLCT talk online on March 28!