epoch.ai/gradient-upd...
epoch.ai/gradient-upd...
It's mostly just a bunch of simple tricks, but with well-chosen defaults. This is what we aim for in skrub
skrub-data.org/stable/refer...
It's mostly just a bunch of simple tricks, but with well-chosen defaults. This is what we aim for in skrub
skrub-data.org/stable/refer...
We trained 2 new models. Like BERT, but modern. ModernBERT.
Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc. Real practical stuff.
It's much faster, more accurate, longer context, and more useful. 🧵
We trained 2 new models. Like BERT, but modern. ModernBERT.
Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc. Real practical stuff.
It's much faster, more accurate, longer context, and more useful. 🧵
arxiv.org/pdf/2412.06769
arxiv.org/pdf/2412.06769
Looking for something similar to i3, but I don’t need lots of customization. Just looking for reliable.
Looking for something similar to i3, but I don’t need lots of customization. Just looking for reliable.
More: www.theatlantic.com/internationa...
More: www.theatlantic.com/internationa...
fleetwood.dev/posts/you-co...
fleetwood.dev/posts/you-co...
Asking for my research assistant. 📚
Asking for my research assistant. 📚