Dylan Foster 🐢
@djfoster.bsky.social
2.4K followers 830 following 89 posts
Principal Researcher in AI/ML/RL Theory @ Microsoft Research NE/NYC. Previously @ MIT, Cornell. http://dylanfoster.net RL Theory Lecture Notes: https://arxiv.org/abs/2312.16730
Posts Media Videos Starter Packs
Pinned
djfoster.bsky.social
As my first post on this platform, allow me to advertise the RL theory lecture notes I have been developing with Sasha Rakhlin: arxiv.org/abs/2312.16730

(shameless repost of my pinned tweet)
Reposted by Dylan Foster 🐢
mdudik.bsky.social
🚨Microsoft Research NYC is hiring🚨

We're hiring postdocs and senior researchers in AI/ML broadly, and in specific areas like test-time scaling and science of DL. Postdoc applications due Oct 22, 2025. Senior researcher applications considered on a rolling basis.

Links to apply: aka.ms/msrnyc-jobs
Microsoft Research Lab - New York City - Microsoft Research
Apply for a research position at Microsoft Research New York & collaborate with academia to advance economics research, prediction markets & ML.
aka.ms
djfoster.bsky.social
Microsoft Research New York City (www.microsoft.com/en-us/resear...) is seeking applicants for multiple Postdoctoral Researcher positions in ML/AI!

These are positions for up to 2 years, starting in July 2026.

Application deadline: October 22, 2025
Microsoft Research Lab - New York City - Microsoft Research
Apply for a research position at Microsoft Research New York & collaborate with academia to advance economics research, prediction markets & ML.
www.microsoft.com
djfoster.bsky.social
Quick reminder: The deadline for our workshop on Foundations of Reasoning in Language Models (FoRLM) at NeurIPS 2025 is next Wednesday, Sept 3!
djfoster.bsky.social
Announcing the first workshop on Foundations of Language Model Reasoning (FoRLM) at NeurIPS 2025!

📝Soliciting abstracts that advance foundational understanding of reasoning in language models, from theoretical analyses to rigorous empirical studies.

📆 Deadline: Sept 3, 2025
djfoster.bsky.social
Help us understand how reasoning emerges, where it fails, and how it can be systematically improved.

Website (& CFP/instructions): reasoning-workshop.github.io

Submission link: openreview.net/group?id=Neu...

See you in San Diego!
FoRLM @ NeurIPS'25
NeurIPS 2025 Workshop -- San Diego, California, USA
reasoning-workshop.github.io
djfoster.bsky.social
Led by Audrey Huang (ahahaudrey.bsky.social), with co-organizers Adam Block, Sadhika Malladi (sadhika.bsky.social), Will Merrill (lambdaviking.bsky.social), Pavel Izmailov, Akshay Krishnamurthy (akshaykr.bsky.social), Tatsunori Hashimoto, and myself.
Bluesky
ahaaudrey.bsky.social
djfoster.bsky.social
Featuring amazing speakers Alekh Agarwal, Yejin Choi (yejinchoinka.bsky.social), Michael Hahn (m-hahn.bsky.social), and Nathan Lambert (natolambert.bsky.social).
yejinchoinka.bsky.social
yejinchoinka.bsky.social
djfoster.bsky.social
Announcing the first workshop on Foundations of Language Model Reasoning (FoRLM) at NeurIPS 2025!

📝Soliciting abstracts that advance foundational understanding of reasoning in language models, from theoretical analyses to rigorous empirical studies.

📆 Deadline: Sept 3, 2025
djfoster.bsky.social
For those at ICML, Audrey will be presenting this paper at the 4:30pm poster session this afternoon! West Exhibition Hall B2-B3 W-1009
Reposted by Dylan Foster 🐢
gautamkamath.com
ICML's election for their board of directors has begun. I've thrown my hat in the ring. Please consider voting for Gautam Kamath.

I have experience with the governance of TMLR, COLT, and ALT, and I think I've demonstrated myself as a consciencious and engaged community member.
Reposted by Dylan Foster 🐢
tomssilver.bsky.social
This week's #PaperILike is "The Power of Resets in Online Reinforcement Learning" (Mhammedi et al., 2024).

If you're doing RL in sim, why not use the sim to its full potential? Reset to any state! (gym.Env.reset() is not all we need.)

PDF: arxiv.org/abs/2404.15417
The Power of Resets in Online Reinforcement Learning
Simulators are a pervasive tool in reinforcement learning, but most existing algorithms cannot efficiently exploit simulator access -- particularly in high-dimensional domains that require general fun...
arxiv.org
Reposted by Dylan Foster 🐢
let-all.com
📣Join us at COLT 2025 in Lyon for a community event!
📅When: Mon, June 30 | 16:00 CET
What: Fireside chat w/ Peter Bartlett & Vitaly Feldman on communicating a research agenda, followed by mentorship roundtable to practice elevator pitches & mingle w/ COLT community!
let-all.com/colt25.html
Reposted by Dylan Foster 🐢
eugenevinitsky.bsky.social
Hiring a postdoc to scale up and deploy RL-based planning onto some self-driving cars! We'll be building on arxiv.org/abs/2502.03349 and learn what the limits and challenges of RL planning are. Shoot me a message if interested and help spread the word please!

Full posting to come in a bit.
Robust Autonomy Emerges from Self-Play
Self-play has powered breakthroughs in two-player and multi-player games. Here we show that self-play is a surprisingly effective strategy in another domain. We show that robust and naturalistic drivi...
arxiv.org
Reposted by Dylan Foster 🐢
jasonhartline.bsky.social
At the IDEAL annual meeting and saw this paper presented. Basically: reducing length of chain of thought LLM computations by deleting intermediate computations, more like classical functional programming where only function call and return values are important.

arxiv.org/abs/2503.14337
PENCIL: Long Thoughts with Short Memory
While recent works (e.g. o1, DeepSeek R1) have demonstrated great promise of using long Chain-of-Thought (CoT) to improve reasoning capabilities of language models, scaling it up during test-time is c...
arxiv.org
Reposted by Dylan Foster 🐢
djfoster.bsky.social
Dhruv Rohatgi will be giving a lecture on our recent work on comp-stat tradeoffs in next-token prediction at the RL Theory virtual seminar series (rl-theory.bsky.social) tomorrow at 2pm EST! Should be a fun talk---come check it out!!
Reposted by Dylan Foster 🐢
rl-theory.bsky.social
Later today, Sikata and Marcel will talk about their recent work on oracle-efficient RL with ensembles. Join us!
djfoster.bsky.social
The abstract submission deadline for FoPt has been extended to the 21st of May (11:59pm UTC).

Submission website: openreview.net/group?id=lea...
djfoster.bsky.social
Announcing the first workshop on Foundations of Post-Training (FoPT) at COLT 2025!

📝 Soliciting abstracts/posters exploring theoretical & practical aspects of post-training and RL with language models!

🗓️ Deadline: May 19, 2025
Reposted by Dylan Foster 🐢
djfoster.bsky.social
Announcing the first workshop on Foundations of Post-Training (FoPT) at COLT 2025!

📝 Soliciting abstracts/posters exploring theoretical & practical aspects of post-training and RL with language models!

🗓️ Deadline: May 19, 2025
djfoster.bsky.social
Website (& CFP/instructions): fopt-workshop.github.io

Submission link: openreview.net/group?id=lea...

See you in Lyon!
djfoster.bsky.social
Announcing the first workshop on Foundations of Post-Training (FoPT) at COLT 2025!

📝 Soliciting abstracts/posters exploring theoretical & practical aspects of post-training and RL with language models!

🗓️ Deadline: May 19, 2025