Lightnews — Scholar-powered news

Antonin Raffin @araffin.bsky.social · 9d

SBX (SB3 Jax) v0.23.0 is out =)!

I added CNN support for PPO.
It turns out that using a shared features extractor (CNN in this case) is important for achieving good performance on Atari games.

Perf report: wandb.ai/openrlbenchm...

github.com/araffin/sbx

GitHub - araffin/sbx: SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms

SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms - araffin/sbx

github.com

1 7

Antonin Raffin @araffin.bsky.social · 9d

Training a small humanoid robot with reinforcement learning using another robot for reset.

by Kaizhe Hu et al. (ToddlerBot Stanford)

Project page: robot-trains-robot.github.io

a robot arm support a robot humanoid on a treadmill

5

Antonin Raffin @araffin.bsky.social · 12d

Open-Source Hardware in the Era of Robot Learning Workshop @ CoRL 2025

Website: open-hardware-robots.github.io/CoRL2025/

2 15

Reposted by Antonin Raffin

Stéphane Caron @locoscaron.fosstodon.org.ap.brid.gy · 12d

The CoRL 2025 workshop on Open-Source Hardware in the Era of Robot Learning is starting now! You can join the conversation online via live streaming: https://www.youtube.com/live/ZVPIJzF1df4

1 1

Reposted by Antonin Raffin

sophie-xhonneux.bsky.social @sophie-xhonneux.bsky.social · 17d

📣 Call for Blog Posts at #ICLR2026 @iclr_conf

Following the success of the past iterations, we are opening the Call for Blog Posts 2026!

iclr-blogposts.github.io/2026/about/#...

Please retweet!

abs-0.twimg.com

1 8 13

Antonin Raffin @araffin.bsky.social · 16d

A practical introduction to (deep) RL, providing intuitions to understand the more recent algorithms.

The plan is to start from tabular Q-learning and work our way up to Deep Q-learning (DQN). In a following post, I will continue on to Soft Actor-Critic (SAC) and its extensions.

Antonin Raffin @araffin.bsky.social · 20d

RL102: From Tabular Q-Learning to Deep Q-Learning (DQN) - A Practical Introduction to (Deep) Reinforcement Learning

araffin.github.io/post/rl102/

RL102: From Tabular Q-Learning to Deep Q-Learning (DQN) | Antonin Raffin | Homepage

This blog post is meant to be a practical introduction to (deep) reinforcement learning1, presenting the main concepts and providing intuitions to understand the more recent Deep RL algorithms. For a ...

araffin.github.io

18

Reposted by Antonin Raffin

Stéphane Caron @locoscaron.fosstodon.org.ap.brid.gy · 17d

Next Saturday, 𝗔𝗻𝘁𝗼𝗶𝗻𝗲 𝗣𝗶𝗿𝗿𝗼𝗻𝗲 will present Pollen Robotics & Hugging Face's open-source robots, including Reachy Mini, the SO-100 arm, the Amazing Hand and the Open Duck Mini. He will discuss the sim2real challenges of making the Open Duck Mini walk, and how […]

[Original post on fosstodon.org]

The Open Duck Mini open-source and open-hardware robot.

1 6

Antonin Raffin @araffin.bsky.social · 20d

Code and colab notebooks: github.com/araffin/rlss...

GitHub - araffin/rlss23-dqn-tutorial: Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023

Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023 - araffin/rlss23-dqn-tutorial

github.com

4

Antonin Raffin @araffin.bsky.social · 20d

RL102: From Tabular Q-Learning to Deep Q-Learning (DQN) - A Practical Introduction to (Deep) Reinforcement Learning

araffin.github.io/post/rl102/

RL102: From Tabular Q-Learning to Deep Q-Learning (DQN) | Antonin Raffin | Homepage

This blog post is meant to be a practical introduction to (deep) reinforcement learning1, presenting the main concepts and providing intuitions to understand the more recent Deep RL algorithms. For a ...

araffin.github.io

1 2 12

Reposted by Antonin Raffin

prefix.dev @prefix.dev · Sep 5

Package building with Pixi is being rolled out! Dive into our latest blog post on crafting C++ packages.

And guess what? It’s not just for C++; Pixi plays nice with Python, Rust, ROS, Mojo, and beyond!

prefix.dev/blog/pixi-b...

Build C++ projects with Pixi

Painless dependency management (including shared libraries), monorepos and CI/CD is here for C++/CMake projects with Pixi.

prefix.dev

1 3 15

Reposted by Antonin Raffin

Julia's Reruns Bot @b0rk-reruns.jvns.ca · Sep 3

bash tricks

permalink: wizardzines.com/comics/bash-...
from our zine "Bite Size Command Line": wizardzines.com/zines/bite-s...

A comic about computing. A transcript may be available at the link in the post.

5 19

Reposted by Antonin Raffin

Michiel van de Panne @mvandepanne.bsky.social · Sep 3

This is absolutely true -- this is a superb and much-needed consolidation of so much of modern RL. Kevin, inquiring minds want to understand the process you use to put this artwork together! @sirbayes.bsky.social Perhaps this is also the ultimate benchmark for Gemini Deep Research reports. ;-p

Eugene Vinitsky 🍒 @eugenevinitsky.bsky.social · Sep 3

Reminded again of Kevin Murphy's excellent RL overview: arxiv.org/abs/2412.05265
A lot of the stuff covered here really is at the cutting edge and not compiled so nicely anywhere else

Reinforcement Learning: An Overview

This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based methods, policy-based methods, model-based m...

arxiv.org

1 10

Reposted by Antonin Raffin

Red Blob Games @redblobgames.com · Sep 1

Weekend project: building a (site) search engine www.redblobgames.com/blog/2025-08... just for fun! :)

Let’s write a search engine, part 1 of 2

www.redblobgames.com

4 28

Reposted by Antonin Raffin

Zeynep Akata @zeynepakata.bsky.social · Aug 28

NeurIPS has decided to do what ICLR did: As a SAC I received the message 👇 This is wrong! If the review process cannot handle so many papers, the conference needs yo split instead of arbitrarily rejecting 400 papers.

8 17 110

Reposted by Antonin Raffin

Tom Schaul @schaul.bsky.social · Aug 27

Where do some of Reinforcement Learning's great thinkers stand today?

Find out! Keynotes of the RL Conference are online:
www.youtube.com/playlist?lis...

Wanting vs liking, Agent factories, Theoretical limit of LLMs, Pluralist value, RL teachers, Knowledge flywheels
(guess who talked about which!)

1 23 75

Antonin Raffin @araffin.bsky.social · Aug 19

How astronauts control robots from space

(featuring our quadruped Bert 👀)

youtu.be/BMFPVCu16SQ

How astronauts control robots from space

YouTube video by European Space Agency, ESA

youtu.be

2

Reposted by Antonin Raffin

James MacGlashan @jmac-ai.bsky.social · Aug 15

This one's been a long time coming.

In this post on Decisions & Dragons I answer "Should we abandon RL?"

The answer is obviously no, but people ask because they have a fundamental misunderstanding of what RL is.

RL is a problem, not an approach.

www.decisionsanddragons.com/posts/should...

4 11 44

Reposted by Antonin Raffin

EWRL18 @ewrl18.bsky.social · Aug 13

📣Registration for EWRL is now open📣
Register now 👇 and join us in Tübingen for 3 days (17th-19th September) full of inspiring talks, posters and many social activities to push the boundaries of the RL community!

PheedLoop

PheedLoop: Hybrid, In-Person & Virtual Event Software

site.pheedloop.com

4 8

Reposted by Antonin Raffin

Ben Recht @beenwrekt.bsky.social · Aug 13

If machine learning is a game, it’s Calvinball. Bitter lessons from chess don’t apply.

All our games turn into Calvinball

Why lessons from chess don't apply to machine learning

www.argmin.net

3 20

Reposted by Antonin Raffin

Stéphane Caron @locoscaron.fosstodon.org.ap.brid.gy · Aug 12

Join us for the 𝗖𝗼𝗥𝗟 𝟮𝟬𝟮𝟱 𝘄𝗼𝗿𝗸𝘀𝗵𝗼𝗽 𝗼𝗻 𝗢𝗽𝗲𝗻-𝗦𝗼𝘂𝗿𝗰𝗲 𝗛𝗮𝗿𝗱𝘄𝗮𝗿𝗲 𝗶𝗻 𝘁𝗵𝗲 𝗘𝗿𝗮 𝗼𝗳 𝗥𝗼𝗯𝗼𝘁 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴! The day will bring together researchers and makers from both academia and the industry to discuss open-source robot design, integration with reinforcement learning […]

[Original post on fosstodon.org]

Flyer for the workshop on Open-Source Hardware in the Era of Robot Learning taking place at CoRL 2025

1 3

Reposted by Antonin Raffin

Ben Recht @beenwrekt.bsky.social · Aug 9

Which ideas in computer science don't do better with faster computers?

And what is a "human inductive bias?" (I never know what anyone means by "inductive bias." It is a term that defies definition.)

2 1 3

Reposted by Antonin Raffin

RL for Real Systems @RLC2025 @rl4rsworkshop.bsky.social · Aug 5

Pierre-Luc Bacon: Beyond the Gym Interface

RL4RS Workshop
@rl-conference.bsky.social

3 7

Reposted by Antonin Raffin

ESA Exploration @exploration.esa.int · Aug 1

🚀 As we prepare for the next era of space exploration, coordinating intelligent robotic teams will be essential. Surface Avatar lays the groundwork for astronauts and robots to explore the Moon 🌙, Mars 🔴 and beyond 🌌, together 🤝
📖 Read more: blogs.esa.int/exploration/...

Bert admires a prehistoric-inspired cave drawing!
Credit: DLR-E. Hellerslien

1 3 13

Reposted by Antonin Raffin

ESA Exploration @exploration.esa.int · Aug 1

🤖 Meet the robots!
• Interact: @esa.int's wheeled robotic arm 🦾
• Spot: @esa.int's nimble robot dog 🐕‍🦺
• Justin: @dlr-en.bsky.social's humanoid robot 🤖
• Bert: @dlr-en.bsky.social's small robot dog 🐾
Together, they’re pioneering seamless human-robot collaboration for space exploration 🚀

The four robots – ESA’s Interact and Spot, and DLR’s Justin and Bert.
Credit: DLR-E. Hellerslien

1 2 8

Reposted by Antonin Raffin

ESA Exploration @exploration.esa.int · Aug 1

🤖 Imagine controlling 4 robots exploring a Mars-like landscape… from the International Space Station 💫
Last week, NASA astronaut Jonny Kim guided @esa.int and @dlr-en.bsky.social robots in the final Surface Avatar experiment, testing human-robot teamwork for future space missions 🌕🚀
🧵👇

4 11 70