Lightnews — Scholar-powered news

Brenden Lake @brendenlake.bsky.social · Sep 8

I am also trying something new: posting our current and future directions directly on the lab website. Interested in joining us or collaborating? Get in touch! (2/2) lake-lab.github.io/apply/

1

Brenden Lake @brendenlake.bsky.social · Sep 8

Our new lab for Human & Machine Intelligence is officially open at Princeton University!

Consider applying for a PhD or Postdoc position, either through Computer Science or Psychology. You can register interest on our new website lake-lab.github.io (1/2)

2 15 51

Reposted by Brenden Lake

Tom McCoy @rtommccoy.bsky.social · Aug 15

For much more, see the paper! arxiv.org/abs/2508.05776

By Tom Griffiths, Brenden Lake, Tom McCoy, Ellie Pavlick, and Taylor Webb (@cocoscilab.bsky.social‬, @brendenlake.bsky.social‬, ‪@rtommccoy.bsky.social‬, Ellie Pavlick, @taylorwwebb.bsky.social‬)

9/9

Whither symbols in the era of advanced neural networks?

Some of the strongest evidence that human minds should be thought about in terms of symbolic systems has been the way they combine ideas, produce novelty, and learn quickly. We argue that modern neura...

arxiv.org

1 10

Brenden Lake @brendenlake.bsky.social · Aug 1

Getting the lab + alums together at CogSci!

10

Brenden Lake @brendenlake.bsky.social · Jun 12

Some exciting Princeton Initiatives:

Natural and Artificial Minds
nam.ai.princeton.edu

Princeton AI Lab
ai.princeton.edu/ai-lab

Princeton Language and Intelligence
pli.princeton.edu

3

Brenden Lake @brendenlake.bsky.social · Jun 12

It's hard to leave NYU. I'll miss my incredible colleagues and the community that's meant so much over the past 8 years. NYU has become the largest hub for computational cognitive science that I know — it's been a joy and a privilege to be part of that. Thankfully, Princeton isn't too far.

1 4

Brenden Lake @brendenlake.bsky.social · Jun 12

I'm joining Princeton University as an Associate Professor of Computer Science and Psychology this fall! Princeton is ambitiously investing in AI and Natural & Artificial Minds, and I'm excited for my lab to contribute. Recruiting postdocs and Ph.D. students in CS and Psychology — join us!

Nassau Hall. Photo credit to Debbie and John O'Boyle

4 2 47

Reposted by Brenden Lake

Guy Davidson @guydav.bsky.social · May 30

Fantastic new work by @johnchen6.bsky.social (with @brendenlake.bsky.social and me trying not to cause too much trouble).

We study systematic generalization in a safety setting and find LLMs struggle to consistently respond safely when we vary how we ask naive questions. More analyses in the paper!

John (Yueh-Han) Chen @johnchen6.bsky.social · May 29

Do LLMs show systematic generalization of safety facts to novel scenarios?

Introducing our work SAGE-Eval, a benchmark consisting of 100+ safety facts and 10k+ scenarios to test this!

- Claude-3.7-Sonnet passes only 57% of facts evaluated
- o1 and o3-mini passed <45%! 🧵

3 10

Brenden Lake @brendenlake.bsky.social · May 29

Failures of systematic generalization in LLMs can lead to real-world safety issues.

New paper by @johnchen6.bsky.social and @guydav.bsky.social, arxiv.org/abs/2505.21828

John (Yueh-Han) Chen @johnchen6.bsky.social · May 29

Do LLMs show systematic generalization of safety facts to novel scenarios?

Introducing our work SAGE-Eval, a benchmark consisting of 100+ safety facts and 10k+ scenarios to test this!

- Claude-3.7-Sonnet passes only 57% of facts evaluated
- o1 and o3-mini passed <45%! 🧵

2 5

Brenden Lake @brendenlake.bsky.social · May 23

Before LLMs, neural nets were task-specific (while humans were task-general). Shockingly, LLMs changed that. How do LLMs represent a task, and do different prompts lead to the same task rep.? Love this by @guydav.bsky.social, and the function vectors of @ericwtodd.bsky.social @davidbau.bsky.social

Guy Davidson @guydav.bsky.social · May 23

New preprint alert! We often prompt ICL tasks using either demonstrations or instructions. How much does the form of the prompt matter to the task representation formed by a language model? Stick around to find out 1/N

1 4

Reposted by Brenden Lake

Mark Ho @markkho.bsky.social · Apr 4

🤔 Interested in models of social interaction and computational psychiatry?

🤗 If so, @shawnrhoadsphd.bsky.social and I are seeking a highly motivated and talented postdoc to work on these topics!

Please share widely!

apply.interfolio.com/165809

We are hiring! Interested in computational models of social interaction and computational psychiatry?

23 36

Reposted by Brenden Lake

Fred Callaway @fredcallaway.bsky.social · May 7

Despite the world being on fire, I can't help but be thrilled to announce that I'll be starting as an Assistant Professor in the Cognitive Science Program at Dartmouth in Fall '26. I'll be recruiting grad students this upcoming cycle—get in touch if you're interested!

17 24 140

Brenden Lake @brendenlake.bsky.social · May 13

Amazing, congratulations Fred!!

2

Reposted by Brenden Lake

NYU Center for Data Science @nyudatascience.bsky.social · Apr 10

New work in Nature Machine Intelligence by @guydav.bsky.social, @brendenlake.bsky.social, Todd Gureckis, Graham Todd, and Julian Togelius models how humans develop goals—research that could help bridge the gap between human intentions and AI systems.

nyudatascience.medium.com/what-is-a-go...

What is a Goal? Advancing Machine Agency While Understanding Human Goal Creation

New CDS research models how people understand and formulate goals — knowledge that could improve AI alignment

nyudatascience.medium.com

4 12

Brenden Lake @brendenlake.bsky.social · Feb 21

I snuck a moment with my son Logan (2.5), ever the creative goal generator, into Fig. 1: "Papa, I made a Truck Carrier Truck!"
How do people compose existing concepts to create new goals? Can models generate and understand goals too?
nature.com/articles/s4225

1 17

Brenden Lake @brendenlake.bsky.social · Dec 20

@solimlegris.bsky.social and Wai Keen Vong estimated that average human performance on ARC is about 64%(public eval set). Thus, o3 is clearly better than the average crowd worker tested. Note that almost all tasks were solvable by at least one person who tried it on MTurk. arxiv.org/abs/2409.01374

1 17

Brenden Lake @brendenlake.bsky.social · Dec 19

I'm new here. I heard bluesky is like science Twitter back in the day, and there are fewer posts from Elon Musk. Did I come to the right place?

2 14