Lightnews — Scholar-powered news

@nammuca.bsky.social

15 followers 12 following 0 posts

Hello, BlueSky!

Posts Replies Media Videos

Reposted

Alex Wettig

@awettig.bsky.social

🤔 Ever wondered how prevalent some type of web content is during LM pre-training?

In our new paper, we propose WebOrganizer which *constructs domains* based on the topic and format of CommonCrawl web pages 🌐

Key takeaway: domains help us curate better pre-training data! 🧵/N

February 18, 2025 at 12:31 PM

Reposted

Yann LeCun

@yann-lecun.bsky.social

WaPo Edito: Be thankful for the applications of AI in medicine.
More accurate detection of cancers (breast, prostate, skin, brain), faster diagnosis of strokes, sepsis, heart attacks, faster MRIs, full-body in 40 minutes.
Much more to come over the next years.
www.washingtonpost.com/opinions/202...

November 28, 2024 at 7:54 PM

Reposted

Yann LeCun

@yann-lecun.bsky.social

Auto-Regressive LLMs (auto-encoders with causal transformer architectures) and BERT-style models (denoising auto-encoders with transformer archis) are smashing demonstrations of the power of self-supervised (pre-)training.
But they only work for sequences of discrete symbols: language, proteins...

Eugene Yan @eugeneyan.com · Nov 17

Also see Karparthy’s take on it

It’s hard to understand now, the Atari RL paper of 2013 and its extensions was the by far dominant meme. One single general learning algorithm discovered an optimal strategy to Breakout and so many other games. You just had to improve and scale it enough. My recollection of the memetics is that Yann LeCun was one prominent person who really didn’t care much and talked about the cake over and over again, where RL was just the final cherry on top with representation learning as the meat and supervised learning the icing, and he was conceptually exactly right about that at least with today’s stack and hindsight (pretraining = meat, SFT = icing, RLHF = cherry, ie the basic ChatGPT training pipeline). Which is fun because today he really doesn’t care much for LLMs either. (But for reasons that I tbh don’t always fully follow.)

November 24, 2024 at 9:44 PM

Reposted

Yann LeCun

@yann-lecun.bsky.social

I first showed this "cake" slide in 2016.

Eugene Yan @eugeneyan.com · Nov 17

Eight years later, Yann LeCun’s cake 🍰 analogy was spot on: self-supervised > supervised > RL

> “If intelligence is a cake, the bulk of the cake is unsupervised learning, the icing on the cake is supervised learning, and the cherry on the cake is reinforcement learning (RL).”

Yann LeCun’s analogy of intelligence being a cake of self-supervised, supervised, and RL

November 24, 2024 at 9:35 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news