Lightnews — Scholar-powered news

Maxime Peyrard

@peyrardmax.bsky.social

570 followers 170 following 8 posts

Junior Professor CNRS (previously EPFL, TU Darmstadt) -- AI Interpretability, causal machine learning, and NLP. Currently visiting @NYU https://peyrardm.github.io

peyrardm.github.io

Posts Media Videos Starter Packs

Reposted by Maxime Peyrard

Maxime Méloux @maximemeloux.bsky.social · Apr 26

I'm very happy to present our work "Everything, Everywhere, All at Once: Is Mechanistic Interpretability Identifiable?" this afternoon at #ICLR2025! Come have a chat at stand #439 :)

1 10

Maxime Peyrard @peyrardmax.bsky.social · Apr 21

What can be done?

👉 Stricter validity criteria?
👉 Maybe interpretability is inherently underdetermined? and we can only get control and predictability but not "understanding"

This is a fascinating topic, and we keep investigating. If you're interested, come and chat at ICLR!

Maxime Peyrard @peyrardmax.bsky.social · Apr 21

We find a lot of identifiability issues:
- Multiple explanatory algorithms exists
- Even for one algorithm, there are many localizations in the network

Identifiability problems remain across scenarios: changing levels of over-parametrization, progress in training, multi-tasks, model size.

Maxime Peyrard @peyrardmax.bsky.social · Apr 21

In our work, we stress-test the identifiability of research programs of MI with small MLPs and simple boolean logic tasks.
Why? It allows us to enumerate all possible explanations and see how many pass various MI testing criteria.

Maxime Peyrard @peyrardmax.bsky.social · Apr 21

This brings us to identifiability. In statistics a property is identifiable if a unique value is compatible with the data. Identifiability matters because it is a prerequisite for doing statistical and causal inference.

Interpretability is also an exercise in causal inference!

Maxime Peyrard @peyrardmax.bsky.social · Apr 21

Mechanistic Interpretability aims to produce statements like: "Model M solves task T by doing X."
To do so, many causal manipulations are performed to validate an explanation. But what if (many) other, incompatible explanations also pass the causal tests?

Illustration of different strategies for mechanistic interpretability

Maxime Peyrard @peyrardmax.bsky.social · Apr 21

Our paper "Everything, Everywhere, All at Once: Is Mechanistic Interpretability Identifiable?" will be presented at #ICLR2025!
It's also the first paper of my first PhD student, congrats @maximemeloux.bsky.social ! 🎉

blog: melouxm.github.io/MI-identifia...

An explanatory thread 🧵:

1 9 17

Reposted by Maxime Peyrard

Tom Aarsen @tomaarsen.com · Mar 10

An assembly of 18 European companies, labs, and universities have banded together to launch 🇪🇺 EuroBERT!

It's a state-of-the-art multilingual encoder for 15 European languages, designed to be finetuned for retrieval, classification, etc.

Details in 🧵

5 20 81

Reposted by Maxime Peyrard

kyunghyuncho.bsky.social @kyunghyuncho.bsky.social · Feb 18

bc i haven't done so yet, i decided to burn any remaining bridge to the land of statistics. it wasn't statisticians nor statistics but it was me. i am simply not good enough to do statistics myself.

so, @peyrardmax.bsky.social and i decided to turn statistical estimation into supervised learning.

3 9 30

Reposted by Maxime Peyrard

Kristina Gligoric @gligoric.bsky.social · Dec 4

Check out our new paper on social determinants of on-campus food choice, now out in @pnasnexus.org!

academic.oup.com/pnasnexus/ar...

3 12

Maxime Peyrard @peyrardmax.bsky.social · Nov 24

Hey, thanks for making it, can you also add me

Reposted by Maxime Peyrard

Julian Skirzynski @jskirzynski.bsky.social · Nov 23

I tried to find everyone who works in the area but I certainly missed some folks so please lmk...
go.bsky.app/BYkRryU

32 18 53

Maxime Peyrard @peyrardmax.bsky.social · Nov 23

Thanks for creating the pack, I am also working on this topic :)