Lightnews — Scholar-powered news

Jørgen Lund

@jaalu.bsky.social

30 followers 69 following 48 posts

Industry Ph.D. student in ML, DIPS AS, UiT The Arctic University of Norway

github.com/jaalu | he/him

Posts Replies Media Videos

Jørgen Lund

@jaalu.bsky.social

At 56M parameters, naive quantization *may* not be the best way to go

A screenshot of a 4-bit quantized version of the 56M PleIAs Monad model, prompted with "Which are the first elements in the periodic table?" and responding by repeating "core facts" and "constraints" all about polymers

November 13, 2025 at 3:59 PM

Jørgen Lund

@jaalu.bsky.social

@ai2.bsky.social's OLMoTrace (arxiv.org/pdf/2504.07096) is the more serious approach to this, they match spans from the output to spans in the training corpora, and the paper does find links between answers and specific sources, but as far as I can tell they do not intervene in the generation itself

A picture of OLMoTrace showing the OLMo 2 model answering the question "How does a CPU work?", with the phrase "[the program counter] holds the location of the next instruction to be executed" highlighted and linked to a document in the training corpus

October 2, 2025 at 8:17 AM

Jørgen Lund

@jaalu.bsky.social

In the specific project the dataset seems to be books, legal documents and contemporary text sourced from Project Gutenberg

There is a surprising amount of text available though, Harvard's Institutional Books dataset - arxiv.org/pdf/2506.08300 - has >470K texts dating to the 1800s

Table App.B1 from the technical report of Harvard's Institutional Books dataset, showing that 43.73% of books in it, or 470 468 volumes, date to the 1800s

August 26, 2025 at 11:47 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news