Jørgen Lund
jaalu.bsky.social
Jørgen Lund
@jaalu.bsky.social
Industry Ph.D. student in ML, DIPS AS, UiT The Arctic University of Norway

github.com/jaalu | he/him
At 56M parameters, naive quantization *may* not be the best way to go
November 13, 2025 at 3:59 PM
@ai2.bsky.social's OLMoTrace (arxiv.org/pdf/2504.07096) is the more serious approach to this, they match spans from the output to spans in the training corpora, and the paper does find links between answers and specific sources, but as far as I can tell they do not intervene in the generation itself
October 2, 2025 at 8:17 AM
In the specific project the dataset seems to be books, legal documents and contemporary text sourced from Project Gutenberg

There is a surprising amount of text available though, Harvard's Institutional Books dataset - arxiv.org/pdf/2506.08300 - has >470K texts dating to the 1800s
August 26, 2025 at 11:47 AM