Lightnews — Scholar-powered news

Jacob Springer

@jacobspringer.bsky.social

140 followers 130 following 10 posts

Machine Learning (the science part) | PhD student @ CMU

Posts Replies Media Videos

Jacob Springer

@jacobspringer.bsky.social

Training with more data = better LLMs, right? 🚨

False! Scaling language models by adding more pre-training data can decrease your performance after post-training!
Introducing "catastrophic overtraining." 🥁🧵👇

arxiv.org/abs/2503.19206

1/10

March 26, 2025 at 6:35 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news