lupinatedwoods.bsky.social
@lupinatedwoods.bsky.social
MLE, mostly interested in NLP and embedded models.
Anyone have a preferred library/technique for training draft models? I've found the FastDraft paper, and NVIDIA supports training EAGLE-3 models with their optimizer library.

I want to accelerate Mistral 3 Small (make a 2-3B with the same vocab). There doesn't seem to be a good existing model.
October 29, 2025 at 5:28 PM
Reposted
This is a neat new variant on RAG - no vectors, not even full-text search, instead showing the model a header hierarchy and giving it a tool to read the relevant sections

My notes here: simonwillison.net/2024/Dec/6/r...
December 6, 2024 at 3:04 AM
The default "Discover" feed is pretty bad, and the "What's Hot (Classic)" is worse- just a lot of low effort content, click bait, and lowest common denominator posts. Are there guides on how to improve this? Will the recommendations improve slowly with increased interaction/relevance feedback?
December 3, 2024 at 3:52 PM
I think more people should evangelize the old spirit of the Internet - open source, open platforms, open data. If email didn't already exist and it was created today, it'd be locked in proprietary AmaGooMetaSoft gardens.
November 28, 2024 at 11:27 PM
Looking for ML/NLP researchers to follow, busy going through my following list from elsewhere. Hello world! 👋
November 28, 2024 at 2:30 AM