Lightnews — Scholar-powered news

Stephanie Chan

@scychan.bsky.social

950 followers 280 following 15 posts

Staff Research Scientist at Google DeepMind. Artificial and biological brains 🤖 🧠

Posts Replies Media Videos

Reposted by Stephanie Chan

Andrew Lampinen

@lampinen.bsky.social

In neuroscience, we often try to understand systems by analyzing their representations — using tools like regression or RSA. But are these analyses biased towards discovering a subset of what a system represents? If you're interested in this question, check out our new commentary! Thread:

What do representations tell us about a system? Image of a mouse with a scope showing a vector of activity patterns, and a neural network with a vector of unit activity patterns
Common analyses of neural representations: Encoding models (relating activity to task features) drawing of an arrow from a trace saying [on_____on____] to a neuron and spike train. Comparing models via neural predictivity: comparing two neural networks by their R^2 to mouse brain activity. RSA: assessing brain-brain or model-brain correspondence using representational dissimilarity matrices

August 5, 2025 at 2:36 PM

Stephanie Chan

@scychan.bsky.social

Great new paper by @jessegeerts.bsky.social, looking at a certain type of generalization in transformers -- transitive inference -- and what conditions induce this type of generalization

Jesse Geerts @jessegeerts.bsky.social · Jun 6

🧠 How do transformers learn relational reasoning? We trained small transformers on transitive inference (if A>B and B>C, then A>C) and discovered striking differences between learning paradigms. Our latest work reveals when and why AI systems generalize beyond training data 🤖

June 6, 2025 at 3:50 PM

Stephanie Chan

@scychan.bsky.social

New paper: Generalization from context often outperforms generalization from finetuning.

And you might get the best of both worlds by spending extra compute and train time to augment finetuning.

Andrew Lampinen @lampinen.bsky.social · May 2

How do language models generalize from information they learn in-context vs. via finetuning? In arxiv.org/abs/2505.00661 we show that in-context learning can generalize more flexibly, illustrating key differences in the inductive biases of these modes of learning — and ways to improve finetuning. 1/

arxiv.org

May 2, 2025 at 5:48 PM

Stephanie Chan

@scychan.bsky.social

New work led by
@aaditya6284.bsky.social

"Strategy coopetition explains the emergence and transience of in-context learning in transformers."

We find some surprising things!! E.g. that circuits can simultaneously compete AND cooperate ("coopetition") 😯 🧵👇

March 11, 2025 at 6:18 PM

Stephanie Chan

@scychan.bsky.social

Sadly, we have lost a brilliant researcher and colleague, Felix Hill. Please see this note, where I have tried to compile some of his writings: docs.google.com/document/d/1...

For Felix

Devastatingly, we have lost a bright light in our field. Felix Hill was not only a deeply insightful thinker -- he was also a generous, thoughtful mentor to many researchers. He majorly changed my lif...

docs.google.com

January 4, 2025 at 12:28 PM

Reposted by Stephanie Chan

Andrew Lampinen

@lampinen.bsky.social

What counts as in-context learning (ICL)? Typically, you might think of it as learning a task from a few examples. However, we’ve just written a perspective (arxiv.org/abs/2412.03782) suggesting interpreting a much broader spectrum of behaviors as ICL! Quick summary thread: 1/7