Lightnews — Scholar-powered news

hochadel.bsky.social

@hochadel.bsky.social

9 followers 180 following 0 posts

Posts Replies Media Videos

Reposted by hochadel.bsky.social

Milan Weibel 🔷

@weibac.bsky.social

w h a t

"Pruning as few as a single parameter can destroy an LLM's ability to generate text -- increasing perplexity by 3 orders of magnitude and reducing zero-shot accuracy to guessing."

The Super Weight in Large Language Models

Recent works have shown a surprising result: a small fraction of Large Language Model (LLM) parameter outliers are disproportionately important to the quality of the model. LLMs contain billions of pa...

arxiv.org

May 27, 2025 at 9:11 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news