Lightnews — Scholar-powered news

vmoens

@vmoens.bsky.social

2.1K followers 640 following 92 posts

Member of technical staff @periodiclabs
Open-source/open science advocate
Maintainer of torchrl / tensordict / leanrl

Former MD - Neuroscience PhD

https://github.com/vmoens

Posts Replies Media Videos

vmoens

@vmoens.bsky.social

March 19, 2025 at 2:57 PM

vmoens

@vmoens.bsky.social

Today we're opensourcing MLGym, an API for AI research agents.

MLGym relies on a gym environment that wraps a docker image. Each env has a task specified as a YAML file, telling in plain english what you want your LLM to achieve
👇

February 21, 2025 at 4:44 PM

vmoens

@vmoens.bsky.social

Wrong answers only:
What does this `Human-computer` sticker seen at neurips hide?

December 19, 2024 at 10:53 AM

vmoens

@vmoens.bsky.social

Check out Motivo, a behavioral foundation model for humanoid control by FAIR.

It's a one-of-its-kind unsupervised RL project, and it comes with a demo that is SO fun to play with!

metamotivo.metademolab.com

(for the record, they use compile and cudagraphs -> github.com/facebookrese...)

December 14, 2024 at 12:44 AM

vmoens

@vmoens.bsky.social

I’m 100% sure this button never does anything

December 13, 2024 at 4:18 AM

vmoens

@vmoens.bsky.social

Built on TorchRL and PyTorch, BenchMARL ensures high performance and state-of-the-art implementations, while its flexible configuration and standardized reporting make it a breeze to use.

December 11, 2024 at 12:47 AM

vmoens

@vmoens.bsky.social

BenchMARL is a cutting-edge training library designed to bring standardized benchmarking to the world of Multi-Agent Reinforcement Learning (MARL). It allows for easy comparison across different algorithms, models, and environments, making it a game-changer for researchers and developers alike.

December 11, 2024 at 12:47 AM

vmoens

@vmoens.bsky.social

Tomorrow with Matteo Bettini we'll be presenting BenchMARL at #NeurIPS (@neuripsconf.bsky.social) in #Vancouver

December 11, 2024 at 12:47 AM

vmoens

@vmoens.bsky.social

We believe @PyTorch should have a stronger place in STEM education. With BricksRL, we aim to provide a fun and interactive way to learn about control and Reinforcement Learning, inspiring the next generation of researchers and engineers

December 9, 2024 at 7:19 PM

vmoens

@vmoens.bsky.social

Why Lego? It offers low-cost hardware for learning & experimenting with control or Reinforcement Learning in the real world. Standardized, reproducible, and fun! Plus, the Lego community is active and vibrant, making it a great fit for our project.

December 9, 2024 at 7:19 PM

vmoens

@vmoens.bsky.social

I'm teaming up with Sebastian Ditter & @gdefabritiis.bsky.social to present our paper on BricksRL, a library that enables control of Lego robots using #PyTorch

December 9, 2024 at 7:19 PM

vmoens

@vmoens.bsky.social

I'm excited to be in #Vancouver for #NeurIPS 2024 where I brought a bunch of @LEGO_Group bricks in my bag!

*no, I'm not Santa!

December 9, 2024 at 7:19 PM

vmoens

@vmoens.bsky.social

They've already taken over london!

November 29, 2024 at 1:37 PM

vmoens

@vmoens.bsky.social

define "amusingly"

November 28, 2024 at 9:20 AM

vmoens

@vmoens.bsky.social

You know this annoying thing where you're thinking "I'd like to see the code of this pytorch class!" and you cick on source and it leads you to some weird looking html page?
Fear no more! Now (on the `main` doc) you'll end up right on github!

November 27, 2024 at 8:55 AM

vmoens

@vmoens.bsky.social

Not only that, but you also get a better utilization of your GPU: the total usage with compile and cudagraphs is lower than eager, compile or cudagraphs in isolation, meaning that you could for instance easily train two networks at the same time.

November 22, 2024 at 6:38 AM

vmoens

@vmoens.bsky.social

We get an 6.8x speed-up with PPO (Atari), 5.7x for SAC, 3.4x for TD3 and 2.7x for PPO with continuous actions.

November 22, 2024 at 6:38 AM

vmoens

@vmoens.bsky.social

One of my fav projects: LeanRL, a simple RL library that provides recipes for fast RL training using torch.compile and cudagraphs.
Using these, we got >6x speed-ups compared to the original CleanRL implementations.
github.com/pytorch-labs...

November 22, 2024 at 6:38 AM

vmoens

@vmoens.bsky.social

I stand by my point that any interesting math equation is better understood in code format

November 21, 2024 at 4:03 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news