Lightnews — Scholar-powered news

Yoav Goldberg

@yoavgo.bsky.social

play this

www.puzzlescript.net/play.html?p=...

December 20, 2025 at 10:24 PM

Yoav Goldberg

@yoavgo.bsky.social

I complain a lot about RL lately, and here we go again.

The CS view of RL is wrong in how it thinks about rewards, already at the setup level. Briefly, the reward computation should be part of the agent, not part of the environment.

More at length here:

gist.github.com/yoavg/3eb3e7...

rl-wrong-about-rewards.md

GitHub Gist: instantly share code, notes, and snippets.

gist.github.com

December 5, 2025 at 11:37 PM

Yoav Goldberg

@yoavgo.bsky.social

the fascinating (to me) quality of hard-core RL researchers (e.g Sutton) is the ability to have an all encompassing view of RL as the basis of intelligence, while at the same time working on super low level stuff like tabular TD algorithms, and yet strongly believe these are actually the same thing

November 27, 2025 at 4:32 PM

Yoav Goldberg

@yoavgo.bsky.social

what's the latest-and-greatest attempt to reverse-engineer and document the inner-working of claude-code?

November 17, 2025 at 10:23 AM

Yoav Goldberg

@yoavgo.bsky.social

lets talk about "In context learning". it is clearly NOT "learning", because its ephemeral. It IS some form of generalization from examples, which is very cool. but we need a name. how do we call this skill of generalization from example?

November 16, 2025 at 7:48 AM

Yoav Goldberg

@yoavgo.bsky.social

חשבתי שההתעלמות מסודאן היא סתם סוג-של הזנחה, אבל הנה אני קורא ספר שמתאמץ מאד להראות ש:
- מה שהיה בדארפור זה לא ג'נוסייד אלא רק תוצאת לוואי של מלחמת אזרחים אכזרית
- "Darfur is where the language of genocide had become an instrument"
- הערבים אינם settlers באפריקה, הם הגיעו ממקומות שונים
- המערב אשם

November 8, 2025 at 9:58 AM

Yoav Goldberg

@yoavgo.bsky.social

how many of you are aware of ai-2027? how many of you read it? curious what ya'll think.

November 2, 2025 at 9:36 PM

Yoav Goldberg

@yoavgo.bsky.social

"in a future in which AI assistants are being used by world leaders and influential figures, aren't you afraid of rogue AI using this access to control the world?"

well maybe, but I am more worried about near future in which these influentials act on random AI advice.

October 29, 2025 at 12:15 PM

Yoav Goldberg

@yoavgo.bsky.social

LISP code does not have significantly more parentheses than other typical languages. change my mind.

October 17, 2025 at 11:41 AM

Yoav Goldberg

@yoavgo.bsky.social

this "feature" in code editors where you type an opening character and it immediately inserts the closing one for you as well -- why?? at the very very best case, you will have to skip over this character with an arrow. so why?

September 11, 2025 at 6:18 PM

Yoav Goldberg

@yoavgo.bsky.social

can someone explain "serverless backends" to me? it seems that they run functions on demand. but if these functions cannot access any persistent state, why not run them on the client? the only reason I see is to hide DBs/APIs tokens/secrets from the client, but is that really all there is to it?

September 6, 2025 at 8:55 AM

Yoav Goldberg

@yoavgo.bsky.social

i created this thingy yesterday and now I cannot stop watching it.

yoavg.github.io/eternal/

Eternal Struggle

yoavg.github.io

August 31, 2025 at 2:44 PM

Yoav Goldberg

@yoavgo.bsky.social

a trivia fact about this paper is that we submitted it to arxiv weeks ago, and it was hanging there in limbo for quite a while. apparently because we submitted to "AI" while they moved it to "HCI".

Yoav Goldberg @yoavgo.bsky.social · Aug 27

When reading AI reasoning text (aka CoT), we (humans) form a narrative about the underlying computation process, which we take as a transparent explanation of model behavior. But what if our narratives are wrong? We measure that and find it usually is.

Now on arXiv: arxiv.org/abs/2508.16599

Humans Perceive Wrong Narratives from AI Reasoning Texts

A new generation of AI models generates step-by-step reasoning text before producing an answer. This text appears to offer a human-readable window into their computation process, and is increasingly r...

arxiv.org

August 27, 2025 at 9:45 PM

Yoav Goldberg

@yoavgo.bsky.social

When reading AI reasoning text (aka CoT), we (humans) form a narrative about the underlying computation process, which we take as a transparent explanation of model behavior. But what if our narratives are wrong? We measure that and find it usually is.

Now on arXiv: arxiv.org/abs/2508.16599

Humans Perceive Wrong Narratives from AI Reasoning Texts

A new generation of AI models generates step-by-step reasoning text before producing an answer. This text appears to offer a human-readable window into their computation process, and is increasingly r...

arxiv.org

August 27, 2025 at 9:30 PM

Yoav Goldberg

@yoavgo.bsky.social

if you REALLY want to understand DL, you should start by honing your Category Theory skills, as almost everything in DL at its core can be mapped to a functor or an endofunctor.

July 4, 2025 at 10:39 AM

Yoav Goldberg

@yoavgo.bsky.social

taking it a step further, I'd say in many cases using the algebra jargon is harmful to understanding, and its better to just describe whats really going on. ie, "we add an L2 penalty term" --> want the sum of squares to be small. "project to vocab space" --> compute similarity to each vocab item.

Yoav Goldberg @yoavgo.bsky.social · Jul 3

i'll elaborate: a common computation pattern in DL happens to coincide with a known operator in linear algebra (matmul), and so we conveniently borrow linalg notation and terminology (matrices, vectors, ranks, norms). but this is just jargon. the algebric properties arent needed.

Yoav Goldberg @yoavgo.bsky.social · Jul 3

"Modern ML is built on Linear Algebra".

lol no its not.

July 4, 2025 at 9:14 AM

Yoav Goldberg

@yoavgo.bsky.social

i'll elaborate: a common computation pattern in DL happens to coincide with a known operator in linear algebra (matmul), and so we conveniently borrow linalg notation and terminology (matrices, vectors, ranks, norms). but this is just jargon. the algebric properties arent needed.

Yoav Goldberg @yoavgo.bsky.social · Jul 3

"Modern ML is built on Linear Algebra".

lol no its not.

July 3, 2025 at 7:38 PM

Yoav Goldberg

@yoavgo.bsky.social

"Modern ML is built on Linear Algebra".

lol no its not.

July 3, 2025 at 6:41 PM

Yoav Goldberg

@yoavgo.bsky.social

why is "MCP" implemented as server exposing a set of endpoints, rather than as some JSON schema for defining tool descriptions and allowing these JSON files to be accessed over http? what is the purpose/benefit of the middleman server?

July 3, 2025 at 10:01 AM

Yoav Goldberg

@yoavgo.bsky.social

you know what, nah, we don't want to close it. it will be just 80% closed.

Yoav Goldberg @yoavgo.bsky.social · Jun 22

and now, we will proceed to peacefully close the strait of Hormuz. you know, for the environment.

June 22, 2025 at 1:32 PM

Yoav Goldberg

@yoavgo.bsky.social

and now, we will proceed to peacefully close the strait of Hormuz. you know, for the environment.

June 22, 2025 at 1:21 PM

Yoav Goldberg

@yoavgo.bsky.social

today, during a peaceful flight over an iranian mountain, a US airplane dropped a mostly peaceful bunker buster bomb, who flew peacefully until it hit the mountain and mostly peaceful facility underneath it. there was a brief period of violent detonation on impact, then peace again.

June 22, 2025 at 12:01 PM

Yoav Goldberg

@yoavgo.bsky.social

אחד מלקחי ליל המקלטים אמש הוא שאין לי סבלנות לקרוא ספרות מקצועית, אבל לקרוא ספרות קלה זה די סבבה. מצד שני הספר שהיה לי בנייד, הוא כזה שהתחלתי לקרוא והפסקתי והיתה סיבה שהפסקתי, הוא מייגע ומעפן. בקיצור שילחו המלצות לספרים. עברית או אנגלית, אבל באנגלית ככהנ יהיה לי יותר קל להתארגן.

June 14, 2025 at 4:16 PM

Yoav Goldberg

@yoavgo.bsky.social

there are lots of "AI agents are just software around LLMs" comments recently. this is true by definition, but also misses the point. while there is no "magic" in software, there is engineering and best practices. and we dont yet have them for systems that involve LLMs.

June 14, 2025 at 12:30 PM

Yoav Goldberg

@yoavgo.bsky.social

a good gag from the other social network:

June 8, 2025 at 9:35 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news