Sham Kakade
@shamkakade.bsky.social
880 followers
87 following
5 posts
Harvard Professor.
ML and AI.
Co-director of the Kempner Institute.
https://shamulent.github.io
Posts
Media
Videos
Starter Packs
Reposted by Sham Kakade
Reposted by Sham Kakade
Reposted by Sham Kakade
Reposted by Sham Kakade
Sham Kakade
@shamkakade.bsky.social
· Nov 22
Sham Kakade
@shamkakade.bsky.social
· Nov 22
How Does Critical Batch Size Scale in Pre-training?
Training large-scale models under given resources requires careful design of parallelism strategies. In particular, the efficiency notion of critical batch size (CBS), concerning the compromise betwee...
arxiv.org
Reposted by Sham Kakade