Nora Belrose
@norabelrose.bsky.social
960 followers
15 following
36 posts
AI, philosophy, spirituality
Head of interpretability research at EleutherAI, but posts are my own views, not Eleuther’s.
Posts
Media
Videos
Starter Packs
Nora Belrose
@norabelrose.bsky.social
· Jun 13
Nora Belrose
@norabelrose.bsky.social
· Jun 12
Nora Belrose
@norabelrose.bsky.social
· Mar 27
Nora Belrose
@norabelrose.bsky.social
· Mar 13
Mixture-of-Depths: Dynamically allocating compute in...
Transformer-based language models spread FLOPs uniformly across input sequences. In this work we demonstrate that transformers can instead learn to dynamically allocate FLOPs (or compute) to...
arxiv.org
Nora Belrose
@norabelrose.bsky.social
· Feb 24
Nora Belrose
@norabelrose.bsky.social
· Feb 24
Nora Belrose
@norabelrose.bsky.social
· Feb 7
Nora Belrose
@norabelrose.bsky.social
· Feb 6
Nora Belrose
@norabelrose.bsky.social
· Feb 3
GitHub - EleutherAI/basin-volume: Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors
Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors - EleutherAI/basin-volume
github.com
Nora Belrose
@norabelrose.bsky.social
· Feb 3
Estimating the Probability of Sampling a Trained Neural Network at Random
We present an algorithm for estimating the probability mass, under a Gaussian or uniform prior, of a region in neural network parameter space corresponding to a particular behavior, such as achieving ...
arxiv.org
Nora Belrose
@norabelrose.bsky.social
· Feb 3
Nora Belrose
@norabelrose.bsky.social
· Feb 3
Nora Belrose
@norabelrose.bsky.social
· Feb 3
Nora Belrose
@norabelrose.bsky.social
· Jan 24
Nora Belrose
@norabelrose.bsky.social
· Dec 29
Nora Belrose
@norabelrose.bsky.social
· Dec 28
Nora Belrose
@norabelrose.bsky.social
· Dec 28
There's Plenty of Room Right Here: Biological Systems as Evolved, Overloaded, Multi-scale Machines
The applicability of computational models to the biological world is an active topic of debate. We argue that a useful path forward results from abandoning hard boundaries between categories and adopt...
arxiv.org
Nora Belrose
@norabelrose.bsky.social
· Dec 20