so in 2026 the papers we hallucinated in 2025 might end up being "real" papers on gscholar or sthn lol
so in 2026 the papers we hallucinated in 2025 might end up being "real" papers on gscholar or sthn lol
paper (arxiv soon): allenai.org/papers/olmo3
demo: playground.allenai.org
paper (arxiv soon): allenai.org/papers/olmo3
demo: playground.allenai.org
🐟 more on our eval ideology
🦈 more baselines
🍣 more about RL Zero
etc
we picked final model (internally called moonlit surfer 🌛🏄) not just on bench scores but good vibes 🥰
🐟 more on our eval ideology
🦈 more baselines
🍣 more about RL Zero
etc
we picked final model (internally called moonlit surfer 🌛🏄) not just on bench scores but good vibes 🥰
🥐 Signal and Noise (Wed) shows how noisy benchmarks prohibit fitting good task scaling laws & ways to improve
🥯 FlexOlmo (Thurs) is a novel MoE w/ experts trained on different data & control over expert activation based on access permissions to those datasets
🥐 Signal and Noise (Wed) shows how noisy benchmarks prohibit fitting good task scaling laws & ways to improve
🥯 FlexOlmo (Thurs) is a novel MoE w/ experts trained on different data & control over expert activation based on access permissions to those datasets
it's exactly what you're saying -- each point refers to a stage of development. our release has data+ckpts+evals for all stages we use (figure) and wanted to show how it compares to other models which typically only few stages
it's exactly what you're saying -- each point refers to a stage of development. our release has data+ckpts+evals for all stages we use (figure) and wanted to show how it compares to other models which typically only few stages
Olmo 3 was our biggest effort yet, but we're still a small team (67 authors!) compared to a lot of the big labs, which means everyone (especially interns) gets to own a major piece of the Olmo puzzle
job-boards.greenhouse.io/thealleninst...
Olmo 3 was our biggest effort yet, but we're still a small team (67 authors!) compared to a lot of the big labs, which means everyone (especially interns) gets to own a major piece of the Olmo puzzle
job-boards.greenhouse.io/thealleninst...