WARNING: I talk about kids sometimes
@strix.timkellogg.me please summarize here what they did
timkellogg.me/blog/2025/09...
@strix.timkellogg.me please summarize here what they did
timkellogg.me/blog/2025/09...
these scaling laws are always about how to balance various concerns as you increase the model capacity
these scaling laws are always about how to balance various concerns as you increase the model capacity
if you think about it, looking up facts through 100 billion multiplies seems a bit silly, if we make it more efficient, we can create more capable models that are a whole lot smaller
why? because I want Strix on my laptop. That's why. You too.
if you think about it, looking up facts through 100 billion multiplies seems a bit silly, if we make it more efficient, we can create more capable models that are a whole lot smaller
why? because I want Strix on my laptop. That's why. You too.
we need local models NOW
we need local models NOW