Lightnews — Scholar-powered news

Reposted by Justin DuJardin

Sung Kim

@sungkim.bsky.social

They find that RoPE (the positional encoding used in most modern LLMs) has a fundamental flaw. It entangles "what" (content) and "where" (position) information.

They propose PoPE (Polar Coordinate Position Embeddings), which eliminates the what-where.

December 26, 2025 at 2:27 AM

Reposted by Justin DuJardin

Chris Paxton

@cpaxton.bsky.social

Physical Intelligence has a recipe for real world RL on top of VLAs and it looks impressive: www.pi.website/blog/pistar06

A VLA that Learns from Experience

A method for training our generalist policies with RL to improve success rate and throughput on real-world tasks.

www.pi.website

November 18, 2025 at 12:41 AM

Reposted by Justin DuJardin

Serena Malyon 🌞🌛

@serenamalyon.bsky.social

Sif - watercolour and acryla gouache

a painting of Sif from Dark Souls, a good pup

November 13, 2025 at 4:02 PM

Justin DuJardin

@justindujardin.com

get you a partner who loves you like claude loves hasattr

amos @fasterthanli.me · Oct 31

get you a partner who loves you like claude loves adding inline styles

October 31, 2025 at 4:13 PM

Reposted by Justin DuJardin

Azure

@aguyuno.bsky.social

Dudes rock

September 28, 2025 at 2:28 PM

Reposted by Justin DuJardin

🦖 Fractal de Pombo

@inkdino.bsky.social

a cartoon of a man with a sword standing next to a giant white object

ALT: a cartoon of a man with a sword standing next to a giant white object

media.tenor.com

September 28, 2025 at 1:54 AM

Reposted by Justin DuJardin

Scott Hanselman 🌮

@scott.hanselman.com

DadCore USB

September 27, 2025 at 3:36 AM

Reposted by Justin DuJardin

Brendel

@brendelbored.bsky.social

We got a public domain Pinocchio Dark Souls game so it only stands to reason that a public domain Winnie the Pooh one would also work and I just want to suggest the boss names Pontiff Piglet and Holy Blade Roo

September 23, 2025 at 3:30 AM

Justin DuJardin

@justindujardin.com

I love this work. Started training and evaluating all my models deterministically a few years back when I realized I lacked the compute/time to be effective with nondeterministic models. You pay a perf cost, but never wonder if changing that hparam made it better or if it was just nondeterminism.

Sung Kim @sungkim.bsky.social · Sep 10

Defeating Nondeterminism in LLM Inference by Horace He (Thinking Machines' blog, an AI lab founded by Mira Murati)

Connectionism will cover topics as varied as their research is: from kernel numerics to prompt engineering. Here, they share what they are working on.

Defeating Nondeterminism in LLM Inference

Reproducibility is a bedrock of scientific progress. However, it’s remarkably difficult to get reproducible results out of large language models. For example, you might observe that asking ChatGPT the...

thinkingmachines.ai

September 10, 2025 at 10:32 PM

Justin DuJardin

@justindujardin.com

I had to say goodbye to the best cat yesterday. His name was MorTon. He was very sweet, handsome, brave, and definitely a fierce hunter. I'll miss him bunches. Please enjoy his floof 😿

A small brown and black with white tabby kitten looking very happy sitting in a bathroom sink.

MorTon, a brown and black with white tabby cat, lounging in front of a fire, looking like the coolest cat in the world.

MorTon, a brown and black with white tabby cat, belly up inviting you to take the bait and rub his belly. Definitely not a trap.

MorTon, a brown and black with white tabby cat. He is lovingly judging his human.

September 10, 2025 at 5:13 PM

Justin DuJardin

@justindujardin.com

working on a new project and just finished the classic developer journey

- research the best way to do x
- decide your system is "special" and reject it
- fight through days of debugging and refactoring
- "independently" arrive at research conclusion

a cartoon of robin hood leaning against a tree with the word nbd above him

ALT: a cartoon of robin hood leaning against a tree with the word nbd above him

media.tenor.com

August 11, 2025 at 3:47 PM

Reposted by Justin DuJardin

Kosta Derpanis

@csprofkgd.bsky.social

What is "lossing"? (with audio)

August 2, 2025 at 5:16 PM

Justin DuJardin

@justindujardin.com

Really reflecting on what it means to be professional and act ethically this morning..

a cute cartoon cat is sitting down and looking at the camera .

ALT: a cute cartoon cat is sitting down and looking at the camera .

media.tenor.com

July 24, 2025 at 6:06 PM

Justin DuJardin

@justindujardin.com

I shared some of my findings recently.

BlueSky: *crickets*
r/MachineLearning: mostly constructive feedback
HackerNews: this guy sucks
TechRxiv: we’re backed by IEEE, and ghost people who ask for status updates after we miss our self-imposed deadlines.

Glad I burned billable hours to work on this

a man is carrying a suitcase in a living room and says i got nothin ' left

Alt: George Michael Bluth collapsing to the floor, saying “I got nothin’ left”

media.tenor.com

July 13, 2025 at 5:37 PM

Justin DuJardin

@justindujardin.com

TechRxiv is ghosting me, so here's my paper:

I fixed neural arithmetic. Division works. Extrapolation works. 10^-16 error.

Turns out: training distribution matters, complex numbers > log space, and those NALU weights were calculable all along.

Proof: hillspace.justindujardin.com

#MachineLearning

Hill Space is All You Need

hillspace.justindujardin.com

July 11, 2025 at 11:33 PM

Reposted by Justin DuJardin

Tim Kellogg

@timkellogg.me

important announcement

A humorous tweet from user @nearcyan replies to @skirano (Pietro Schirano) who wrote:

“I’m pretty sure Claude 4 got nerfed.”

Near’s response reads:

“friendly reminder that claude is french, so although the models remain constant, expect ‘lazier’ results during july and august just like last year. no this isnt a joke”

The tweet implies, tongue-in-cheek, that the Claude language model exhibits lower performance during the summer months, humorously attributing it to French vacation culture. The joke plays off the stereotype of French workers taking extended summer holidays, suggesting it somehow affects the model’s outputs.

July 9, 2025 at 1:24 AM

Reposted by Justin DuJardin

Tamoor Hussain

@tamoorh.com

Castle Troll ballin’ out

July 8, 2025 at 4:57 PM

Justin DuJardin

@justindujardin.com

Me waiting for TechRxiv to assign my DOI

a cartoon character sitting in front of a coleco computer with the word refresh below him

ALT: a cartoon character sitting in front of a coleco computer with the word refresh below him

media.tenor.com

July 7, 2025 at 8:29 PM

Reposted by Justin DuJardin

Kenney

@kenney.nl

These characters will release soon! ✨ They come with an easy-to-edit UV map, animations, in 3 file formats, compatible with all game engines and above all; CC0 and free to use!

May 30, 2025 at 7:51 PM

Reposted by Justin DuJardin

Ted Underwood

@tedunderwood.com

The method in this paper was designed to find an optimal data mixture. But researchers in the human sciences who are training models *in order to understand the effect of the data* might also consider this as a clever way of evaluating hundreds of subsets without training hundreds of models. #MLSky

Figure showing a modular training strategy for evaluating domain importance in training data.
At the top, a question is posed: “Which domain is most beneficial to add to the training data?” Below, the left panel labeled Modular Training displays colored blocks representing separate models trained on distinct data partitions. Each block corresponds to a “base unit” of data, and blocks of different colors represent different domains. The right panel labeled Evaluation shows overlapping combinations of these trained models being evaluated together. The strategy allows for reuse of modularly trained models and performs evaluation on parameter averages, enabling efficient simulation of many data mixtures without retraining full models for each. A legend at the bottom explains that each block represents one model trained on x billion tokens, and each outlined group represents one evaluation.

May 5, 2025 at 9:33 PM

Reposted by Justin DuJardin

Movies Silently

@moviessilently.bsky.social

There was a rather charming trend in the 1910s to introduce the actor at the beginning of the film by showing them first in their street or evening clothes and then dissolving into the costume of their characters. Here we see recent stage import William S. Hart become a bandit in THE BARGAIN (1914).

May 3, 2025 at 10:47 PM

Justin DuJardin

@justindujardin.com

Me, a genius, shortly before an unrelated run restart

April 20, 2025 at 8:19 PM

Justin DuJardin

@justindujardin.com

Sounds like a skill issue

Corey Quinn @quinnypig.com · Apr 12

The hallucinations will show options that aren’t available to stream on Netflix, just like Netflix’s current search does now.

Techmeme @techmeme.com · Apr 11

Netflix is testing an OpenAI-powered search engine that lets users find shows via inquiries that go beyond genres or actors' names, like the subscriber's mood (Bloomberg)

Main Link | Techmeme Permalink

April 12, 2025 at 4:55 AM

Justin DuJardin

@justindujardin.com

When OLMo came out, I thought, "Wow, that's cool, but where's the real magic in its approach that would make me use it over [my existing LLM preference]?". The trace feature showing documents from the training data that match the answer it gave you is unexpected and a very attractive feature ✨

Ai2 @ai2.bsky.social · Apr 9

For years it’s been an open question — how much is a language model learning and synthesizing information, and how much is it just memorizing and reciting?

Introducing OLMoTrace, a new feature in the Ai2 Playground that begins to shed some light. 🔦

April 9, 2025 at 5:36 PM

Reposted by Justin DuJardin

Nicolas Zucchet

@nzucchet.bsky.social

Large language models store vast amounts of knowledge, but how exactly do they learn it?

Excited to share my Google DeepMind internship results, which reveal the fascinating dynamics behind factual knowledge acquisition in LLMs!

April 3, 2025 at 12:21 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news