annalhq.bsky.social
@annalhq.bsky.social
Reposted
Do Machine Learning Models Memorize or Generalize?

In this article they’ll examine the training dynamics of a tiny model and reverse engineer the solution it finds – and in the process provide an illustration of the exciting emerging field of mechanistic interpretability.
November 9, 2024 at 6:32 AM