Annabelle Michael Carrell
@ab-carrell.bsky.social
23 followers 4 following 3 posts
PhD student in ML
Posts Media Videos Starter Packs
ab-carrell.bsky.social
So you want to skip our thinning proofs—but you’d still like our out-of-the-box attention speedups? I’ll be presenting the Thinformer at two ICML workshop posters tomorrow!

Catch me at Es-FoMo (1-2:30, East hall A) and at LCFM (10:45-11:30 & 3:30-4:30, West 202-204)
ab-carrell.bsky.social
Your data is low-rank, so stop wasting compute! In our new paper on low-rank thinning, we share one weird trick to speed up Transformer inference, SGD training, and hypothesis testing at scale. Come by ICML poster W-1012 Tuesday at 4:30!
lestermackey.bsky.social
New guarantees for approximating attention, accelerating SGD, and testing sample quality in near-linear time
ab-carrell.bsky.social
Your data is low-rank, so stop wasting compute! In our new paper on low-rank thinning, we share one weird trick to speed up Transformer inference, SGD training, and hypothesis testing at scale. Come by ICML poster W-1012 Tuesday at 4:30!
lestermackey.bsky.social
New guarantees for approximating attention, accelerating SGD, and testing sample quality in near-linear time
https://arxiv.org/abs/2502.12063
Reposted by Annabelle Michael Carrell