Here is a blog post summarizing the talk:
davidbau.com/archives/202...
About the question I see as central in AI ethics, interpretability, and safety. Can an AI take responsibility? I do not think so, but *not* because it's not smart enough.
davidbau.com/archives/20...
About the question I see as central in AI ethics, interpretability, and safety. Can an AI take responsibility? I do not think so, but *not* because it's not smart enough.
davidbau.com/archives/20...
I have been writing up some thoughts on what the research says about effective action, and what universities specifically can do.
davidbau.github.io/poetsandnurs...
It's on GitHub. Suggestions and pull requests welcome.
github.com/davidbau/poe...
I have been writing up some thoughts on what the research says about effective action, and what universities specifically can do.
davidbau.github.io/poetsandnurs...
It's on GitHub. Suggestions and pull requests welcome.
github.com/davidbau/poe...
I have been writing up some thoughts on what the research says about effective action, and what universities specifically can do.
davidbau.github.io/poetsandnurs...
It's on GitHub. Suggestions and pull requests welcome.
github.com/davidbau/poe...
Is copying all there is?
@ericwtodd.bsky.social trained on groups where tokens have no fixed meaning and found a basket of mechanisms beyond copying.
Watch them emerge, a grokking cascade! ↓
bsky.app/profile/eri...
Is copying all there is?
@ericwtodd.bsky.social trained on groups where tokens have no fixed meaning and found a basket of mechanisms beyond copying.
Watch them emerge, a grokking cascade! ↓
bsky.app/profile/eri...
I can finally read my great-grandfather's epitaph. Try it:
davidbau.com/archives/202...
I can finally read my great-grandfather's epitaph. Try it:
davidbau.com/archives/202...
What superhuman AGIs say when the boss is not around:
davidbau.com/archives/202...
What superhuman AGIs say when the boss is not around:
davidbau.com/archives/202...
Watch Claude Code grow my 780 lines to 13,600 - mandelbrot.page/coverage/ca...
Two fundamental rules for staying in control:
davidbau.com/archives/20...
Watch Claude Code grow my 780 lines to 13,600 - mandelbrot.page/coverage/ca...
Two fundamental rules for staying in control:
davidbau.com/archives/20...
Here is a blog post summarizing the talk:
davidbau.com/archives/202...
Here is a blog post summarizing the talk:
davidbau.com/archives/202...
In new work yesterday, @arnabsensharma.bsky.social et al identify a data type for *predicates*.
bsky.app/profile/arn...
In new work yesterday, @arnabsensharma.bsky.social et al identify a data type for *predicates*.
bsky.app/profile/arn...
That's easy! (you might think) Because surely it knows: amore, amor, amour are all based on the same Latin word. It can just drop the "e", or add a "u".
That's easy! (you might think) Because surely it knows: amore, amor, amour are all based on the same Latin word. It can just drop the "e", or add a "u".
I want to draw your attention to a COLM paper by my student @sfeucht.bsky.social that has totally changed the way I think and teach about LLM representations. The work is worth knowing.
And you can meet Sheridan at COLM, Oct 7!
bsky.app/profile/sfe...
Watch here: youtu.be/43NnaqGjArA
I also chat about our responsibility as machine learning scientists, and what we need to fix to get AI right.
Take a listen and reshare -
www.persuasion.community/p/david-bau
I also chat about our responsibility as machine learning scientists, and what we need to fix to get AI right.
Take a listen and reshare -
www.persuasion.community/p/david-bau
We’ve added more recent work and more immediately actionable directions for future work. Now published in Computational Linguistics!
I want to draw your attention to a COLM paper by my student @sfeucht.bsky.social that has totally changed the way I think and teach about LLM representations. The work is worth knowing.
And you can meet Sheridan at COLM, Oct 7!
bsky.app/profile/sfe...
I want to draw your attention to a COLM paper by my student @sfeucht.bsky.social that has totally changed the way I think and teach about LLM representations. The work is worth knowing.
And you can meet Sheridan at COLM, Oct 7!
bsky.app/profile/sfe...
This could be relevant to your research...
This could be relevant to your research...
www.youtube.com/channel/UCaQ...
www.youtube.com/channel/UCaQ...
The truth is our superpower.
davidbau.com/archives/202...
The truth is our superpower.
davidbau.com/archives/202...
Thursday: CDC chief dismissed, four top scientists resign.
Discredit, dismiss, blame.
History shows exactly where this three-step pattern leads.
Thursday: CDC chief dismissed, four top scientists resign.
Discredit, dismiss, blame.
History shows exactly where this three-step pattern leads.
Thursday: CDC chief dismissed, four top scientists resign.
Discredit, dismiss, blame.
History shows exactly where this three-step pattern leads.
Thursday: CDC chief dismissed, four top scientists resign.
Discredit, dismiss, blame.
History shows exactly where this three-step pattern leads.
goodfire.ai/ for sponsoring! nemiconf.github.io/summer25/
If you can't make it in person, the livestream will be here:
www.youtube.com/live/4BJBis...
goodfire.ai/ for sponsoring! nemiconf.github.io/summer25/
If you can't make it in person, the livestream will be here:
www.youtube.com/live/4BJBis...
Every week you will find new talks on recent research in the science of neural networks. The first few are posted: jackmerullo.bsky.social, Roy Rinberg, and me.
At the @ndif-team.bsky.social Youtube Channel: www.youtube.com/@NDIFTeam
Every week you will find new talks on recent research in the science of neural networks. The first few are posted: jackmerullo.bsky.social, Roy Rinberg, and me.
At the @ndif-team.bsky.social Youtube Channel: www.youtube.com/@NDIFTeam
Talks, posters, meals, discussion... Most of all, an excellent chance to chat about new ideas with other great researchers in the field!
Help spread the word - register and repost -
bsky.app/profile/koy...
Talks, posters, meals, discussion... Most of all, an excellent chance to chat about new ideas with other great researchers in the field!
Help spread the word - register and repost -
bsky.app/profile/koy...
70b/405b LLMs use double pointers, akin to C programmers' double (**) pointers. They show up when the LLM is "knowing what Sally knows Ann knows", i.e., Theory of Mind.
bsky.app/profile/nik...
70b/405b LLMs use double pointers, akin to C programmers' double (**) pointers. They show up when the LLM is "knowing what Sally knows Ann knows", i.e., Theory of Mind.
bsky.app/profile/nik...
Your help is needed to fix this. The current DC plan PERMANENTLY slashes NSF, NIH, all science training. Money isn't redirected—it's gone.
Please read+share what's happening
thevisible.net/posts/004-s...
Your help is needed to fix this. The current DC plan PERMANENTLY slashes NSF, NIH, all science training. Money isn't redirected—it's gone.
Please read+share what's happening
thevisible.net/posts/004-s...