Dylan Foster 🐢
@djfoster.bsky.social
2.4K followers
830 following
89 posts
Principal Researcher in AI/ML/RL Theory @ Microsoft Research NE/NYC. Previously @ MIT, Cornell. http://dylanfoster.net
RL Theory Lecture Notes: https://arxiv.org/abs/2312.16730
Posts
Media
Videos
Starter Packs
Pinned
Reposted by Dylan Foster 🐢
Reposted by Dylan Foster 🐢
Tom Silver
@tomssilver.bsky.social
· Jun 29
The Power of Resets in Online Reinforcement Learning
Simulators are a pervasive tool in reinforcement learning, but most existing algorithms cannot efficiently exploit simulator access -- particularly in high-dimensional domains that require general fun...
arxiv.org
Reposted by Dylan Foster 🐢
Reposted by Dylan Foster 🐢
Clément Canonne
@ccanonne.github.io
· Jun 4
Reposted by Dylan Foster 🐢
Reposted by Dylan Foster 🐢
Dylan Foster 🐢
@djfoster.bsky.social
· May 9
Csaba Szepesvari (@skiandsolve.bsky.social)
⛷️ ML Theorist carving equations and mountain trails | 🚴♂️ Biker, Climber, Adventurer | 🧠 Reinforcement Learning: Always seeking higher peaks, steeper walls and better policies.
https://ualberta.ca/~...
skiandsolve.bsky.social
Dylan Foster 🐢
@djfoster.bsky.social
· May 3
Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment
Inference-time computation offers a powerful axis for scaling the performance of language models. However, naively increasing computation in techniques like Best-of-N sampling can lead to performance ...
arxiv.org