Antonin Raffin
@araffin.bsky.social
3.3K followers 230 following 91 posts
Researcher in robotics and machine learning (Reinforcement Learning). Maintainer of Stable-Baselines (SB3). https://araffin.github.io/
Posts Media Videos Starter Packs
Pinned
araffin.bsky.social
Post your most popular 🐦 from Twitter

Types of Reinforcement Learning Paper
Original image: @xkcd.com
Types of reinforcement learning papers, using xkcd original artwork
araffin.bsky.social
SBX (SB3 Jax) v0.23.0 is out =)!

I added CNN support for PPO.
It turns out that using a shared features extractor (CNN in this case) is important for achieving good performance on Atari games.

Perf report: wandb.ai/openrlbenchm...

github.com/araffin/sbx
GitHub - araffin/sbx: SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms
SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms - araffin/sbx
github.com
araffin.bsky.social
Training a small humanoid robot with reinforcement learning using another robot for reset.

by Kaizhe Hu et al. (ToddlerBot Stanford)

Project page: robot-trains-robot.github.io
a robot arm support a robot humanoid on a treadmill
araffin.bsky.social
Open-Source Hardware in the Era of Robot Learning Workshop @ CoRL 2025

Website: open-hardware-robots.github.io/CoRL2025/
Reposted by Antonin Raffin
locoscaron.fosstodon.org.ap.brid.gy
The CoRL 2025 workshop on Open-Source Hardware in the Era of Robot Learning is starting now! You can join the conversation online via live streaming: https://www.youtube.com/live/ZVPIJzF1df4
Reposted by Antonin Raffin
sophie-xhonneux.bsky.social
📣 Call for Blog Posts at #ICLR2026 @iclr_conf

Following the success of the past iterations, we are opening the Call for Blog Posts 2026!

iclr-blogposts.github.io/2026/about/#...

Please retweet!
abs-0.twimg.com
araffin.bsky.social
A practical introduction to (deep) RL, providing intuitions to understand the more recent algorithms.

The plan is to start from tabular Q-learning and work our way up to Deep Q-learning (DQN). In a following post, I will continue on to Soft Actor-Critic (SAC) and its extensions.
Reposted by Antonin Raffin
locoscaron.fosstodon.org.ap.brid.gy
Next Saturday, 𝗔𝗻𝘁𝗼𝗶𝗻𝗲 𝗣𝗶𝗿𝗿𝗼𝗻𝗲 will present Pollen Robotics & Hugging Face's open-source robots, including Reachy Mini, the SO-100 arm, the Amazing Hand and the Open Duck Mini. He will discuss the sim2real challenges of making the Open Duck Mini walk, and how […]

[Original post on fosstodon.org]
The Open Duck Mini open-source and open-hardware robot.
Reposted by Antonin Raffin
prefix.dev
Package building with Pixi is being rolled out! Dive into our latest blog post on crafting C++ packages.

And guess what? It’s not just for C++; Pixi plays nice with Python, Rust, ROS, Mojo, and beyond!

prefix.dev/blog/pixi-b...
Build C++ projects with Pixi
Painless dependency management (including shared libraries), monorepos and CI/CD is here for C++/CMake projects with Pixi.
prefix.dev
Reposted by Antonin Raffin
Reposted by Antonin Raffin
mvandepanne.bsky.social
This is absolutely true -- this is a superb and much-needed consolidation of so much of modern RL. Kevin, inquiring minds want to understand the process you use to put this artwork together! @sirbayes.bsky.social Perhaps this is also the ultimate benchmark for Gemini Deep Research reports. ;-p
Reposted by Antonin Raffin
Reposted by Antonin Raffin
zeynepakata.bsky.social
NeurIPS has decided to do what ICLR did: As a SAC I received the message 👇 This is wrong! If the review process cannot handle so many papers, the conference needs yo split instead of arbitrarily rejecting 400 papers.
Reposted by Antonin Raffin
schaul.bsky.social
Where do some of Reinforcement Learning's great thinkers stand today?

Find out! Keynotes of the RL Conference are online:
www.youtube.com/playlist?lis...

Wanting vs liking, Agent factories, Theoretical limit of LLMs, Pluralist value, RL teachers, Knowledge flywheels
(guess who talked about which!)
Reposted by Antonin Raffin
jmac-ai.bsky.social
This one's been a long time coming.

In this post on Decisions & Dragons I answer "Should we abandon RL?"

The answer is obviously no, but people ask because they have a fundamental misunderstanding of what RL is.

RL is a problem, not an approach.

www.decisionsanddragons.com/posts/should...
Reposted by Antonin Raffin
ewrl18.bsky.social
📣Registration for EWRL is now open📣
Register now 👇 and join us in Tübingen for 3 days (17th-19th September) full of inspiring talks, posters and many social activities to push the boundaries of the RL community!
PheedLoop
PheedLoop: Hybrid, In-Person & Virtual Event Software
site.pheedloop.com
Reposted by Antonin Raffin
beenwrekt.bsky.social
If machine learning is a game, it’s Calvinball. Bitter lessons from chess don’t apply.
All our games turn into Calvinball
Why lessons from chess don't apply to machine learning
www.argmin.net
Reposted by Antonin Raffin
locoscaron.fosstodon.org.ap.brid.gy
Join us for the 𝗖𝗼𝗥𝗟 𝟮𝟬𝟮𝟱 𝘄𝗼𝗿𝗸𝘀𝗵𝗼𝗽 𝗼𝗻 𝗢𝗽𝗲𝗻-𝗦𝗼𝘂𝗿𝗰𝗲 𝗛𝗮𝗿𝗱𝘄𝗮𝗿𝗲 𝗶𝗻 𝘁𝗵𝗲 𝗘𝗿𝗮 𝗼𝗳 𝗥𝗼𝗯𝗼𝘁 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴! The day will bring together researchers and makers from both academia and the industry to discuss open-source robot design, integration with reinforcement learning […]

[Original post on fosstodon.org]
Flyer for the workshop on Open-Source Hardware in the Era of Robot Learning taking place at CoRL 2025
Reposted by Antonin Raffin
beenwrekt.bsky.social
Which ideas in computer science don't do better with faster computers?

And what is a "human inductive bias?" (I never know what anyone means by "inductive bias." It is a term that defies definition.)
Reposted by Antonin Raffin
rl4rsworkshop.bsky.social
Pierre-Luc Bacon: Beyond the Gym Interface

RL4RS Workshop
@rl-conference.bsky.social
Reposted by Antonin Raffin
exploration.esa.int
🚀 As we prepare for the next era of space exploration, coordinating intelligent robotic teams will be essential. Surface Avatar lays the groundwork for astronauts and robots to explore the Moon 🌙, Mars 🔴 and beyond 🌌, together 🤝
📖 Read more: blogs.esa.int/exploration/...
Bert admires a prehistoric-inspired cave drawing! 
Credit: DLR-E. Hellerslien
Reposted by Antonin Raffin
exploration.esa.int
🤖 Meet the robots!
• Interact: @esa.int's wheeled robotic arm 🦾
• Spot: @esa.int's nimble robot dog 🐕‍🦺
• Justin: @dlr-en.bsky.social's humanoid robot 🤖
• Bert: @dlr-en.bsky.social's small robot dog 🐾
Together, they’re pioneering seamless human-robot collaboration for space exploration 🚀
The four robots – ESA’s Interact and Spot, and DLR’s Justin and Bert.
Credit: DLR-E. Hellerslien
Reposted by Antonin Raffin
exploration.esa.int
🤖 Imagine controlling 4 robots exploring a Mars-like landscape… from the International Space Station 💫
Last week, NASA astronaut Jonny Kim guided @esa.int and @dlr-en.bsky.social robots in the final Surface Avatar experiment, testing human-robot teamwork for future space missions 🌕🚀
🧵👇
Joint ESA/DLR Surface Avatar team with the four robots.
Credit: ESA-F. Malavasi