Andreas Madsen
@andreasmadsen.bsky.social
320 followers
170 following
10 posts
Ph.D. in NLP Interpretability from Mila. Previously: independent researcher, freelancer in ML, and Node.js core developer.
Posts
Media
Videos
Starter Packs
Andreas Madsen
@andreasmadsen.bsky.social
· Nov 28
Andreas Madsen
@andreasmadsen.bsky.social
· Nov 28
Andreas Madsen
@andreasmadsen.bsky.social
· Nov 28
New Faithfulness-Centric Interpretability Paradigms for Natural Language Processing
As machine learning becomes more widespread and is used in more critical applications, it's important to provide explanations for these models, to prevent unintended behavior. Unfortunately, many curr...
arxiv.org
Andreas Madsen
@andreasmadsen.bsky.social
· Nov 28
Interpretability Needs a New Paradigm
Interpretability is the study of explaining models in understandable terms to humans. At present, interpretability is divided into two paradigms: the intrinsic paradigm, which believes that only model...
arxiv.org
Andreas Madsen
@andreasmadsen.bsky.social
· Nov 27