Lightnews — Scholar-powered news

Sumedh Hindupur

@sumedh-hindupur.bsky.social

18 followers 12 following 9 posts

Grad Student at Harvard SEAS
Interested in ML Interpretability, Computational Neuroscience, Signal Processing

Posts Replies Media Videos

Sumedh Hindupur

@sumedh-hindupur.bsky.social

New preprint alert!
Do Sparse Autoencoders (SAEs) reveal all concepts a model relies on? Or do they impose hidden biases that shape what we can even detect?
We uncover a fundamental duality between SAE architectures and concepts they can recover.
Link: arxiv.org/abs/2503.01822

March 7, 2025 at 2:48 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news