Backprop Blues
banner
backpropblues.bsky.social
Backprop Blues
@backpropblues.bsky.social
AI / ML - Chief Scientist at Macrodata Refinement
Pinned
I often end my random chores and DIY projects with a proud “Let’s see an AI agent do that!” … So far I’m undefeated.
Incredible new audiobook out. Highly recommended by my friend Mark S.
February 1, 2025 at 12:17 AM
Reposted by Backprop Blues
Overfitting, as it is colloquially described in data science and machine learning, doesn’t exist. www.argmin.net/p/thou-shalt...
Thou Shalt Not Overfit
Venting my spleen about the persistent inanity about overfitting.
www.argmin.net
January 30, 2025 at 3:35 PM
Reposted by Backprop Blues
Very good (technical) explainer answering "How has DeepSeek improved the Transformer architecture?". Aimed at readers already familiar with Transformers.

epoch.ai/gradient-upd...
How has DeepSeek improved the Transformer architecture?
This Gradient Updates issue goes over the major changes that went into DeepSeek’s most recent model.
epoch.ai
January 30, 2025 at 9:07 PM
Reposted by Backprop Blues
Wrangling string columns for machine learning, the new StringEncoder in @skrub-data.bsky.social gives such a good compute/prediction performance tradeoff.

It's mostly just a bunch of simple tricks, but with well-chosen defaults. This is what we aim for in skrub

skrub-data.org/stable/refer...
January 28, 2025 at 5:47 PM
Rewatching S1 before S2 drops... How is Severance even better the second time around? The attention to detail, incredible cinematography, unsettling atmosphere, Adam Scott's brilliant performance. Such a great show.
December 23, 2024 at 2:13 AM
Reposted by Backprop Blues
I'll get straight to the point.

We trained 2 new models. Like BERT, but modern. ModernBERT.

Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc. Real practical stuff.

It's much faster, more accurate, longer context, and more useful. 🧵
December 19, 2024 at 4:45 PM
Reposted by Backprop Blues
Some folks are pretty good seeing things coming.
December 18, 2024 at 11:37 AM
Good morning from Cascade Canyon, CO ❄️
December 18, 2024 at 1:17 PM
Anybody know if there a way to fix this on Claude mobile in iOS?
December 13, 2024 at 3:20 AM
Interesting paper from Meta shows how letting AI models reason directly in neural space, rather than through tokens, leads to more efficient and flexible problem-solving

arxiv.org/pdf/2412.06769
December 11, 2024 at 2:15 PM
Any recommendations for tiling window managers on macOS?

Looking for something similar to i3, but I don’t need lots of customization. Just looking for reliable.
December 11, 2024 at 2:33 AM
Helpful video on Yann LeCun's JEPA as explained by Yannic Kilcher
JEPA - A Path Towards Autonomous Machine Intelligence (Paper Explained)
YouTube video by Yannic Kilcher
www.youtube.com
December 11, 2024 at 12:10 AM
Reposted by Backprop Blues
Biden has agreed to let Ukraine hit targets in Russia with US missiles. But there are more ways to shore up Ukraine in what remains an existential war. Europeans, for example, could release the $300 bln in frozen Russian assets...

More: www.theatlantic.com/internationa...
Putin Isn’t Fighting for Land in Ukraine
And Biden has mere weeks to give the Ukrainians the resources they need to fight.
www.theatlantic.com
November 21, 2024 at 9:28 AM
Reposted by Backprop Blues
I still have to finish reading this post but it’s the first time even since the transformer paper I feel like grok what “positional encoding” really is.

fleetwood.dev/posts/you-co...
November 18, 2024 at 10:50 PM
How do I find all the AI/ML researchers and engineers here in ”the good place”?

Asking for my research assistant. 📚
November 15, 2024 at 6:33 PM