Rosie Zhao
rosieyzh.bsky.social
Rosie Zhao
@rosieyzh.bsky.social
PhD student with the Harvard ML Foundations group.
Excited to be attending 🇸🇬#ICLR2025! Please reach out to chat about LLM reasoning/optimization/training dynamics!

Will be presenting a study on diagonal preconditioning optimizers for LLM pretraining (arxiv.org/abs/2407.07972) and SOAP (arxiv.org/abs/2409.11321)
April 17, 2025 at 3:52 PM
Reposted by Rosie Zhao
Ever looked at LLM skill emergence and thought 70B parameters was a magic number? Our new paper shows sudden breakthroughs are samples from bimodal performance distributions across seeds. Observed accuracy jumps abruptly while the underlying accuracy DISTRIBUTION changes slowly!
February 25, 2025 at 10:33 PM