Open-source/open science advocate
Maintainer of torchrl / tensordict / leanrl
Former MD - Neuroscience PhD
https://github.com/vmoens
MLGym relies on a gym environment that wraps a docker image. Each env has a task specified as a YAML file, telling in plain english what you want your LLM to achieve
👇
MLGym relies on a gym environment that wraps a docker image. Each env has a task specified as a YAML file, telling in plain english what you want your LLM to achieve
👇
What does this `Human-computer` sticker seen at neurips hide?
What does this `Human-computer` sticker seen at neurips hide?
It's a one-of-its-kind unsupervised RL project, and it comes with a demo that is SO fun to play with!
metamotivo.metademolab.com
(for the record, they use compile and cudagraphs -> github.com/facebookrese...)
It's a one-of-its-kind unsupervised RL project, and it comes with a demo that is SO fun to play with!
metamotivo.metademolab.com
(for the record, they use compile and cudagraphs -> github.com/facebookrese...)
BenchMARL is a cutting-edge training library designed to bring standardized benchmarking to the world of Multi-Agent Reinforcement Learning (MARL). It allows for easy comparison across different algorithms, models, and environments, making it a game-changer for researchers and developers alike.
BenchMARL is a cutting-edge training library designed to bring standardized benchmarking to the world of Multi-Agent Reinforcement Learning (MARL). It allows for easy comparison across different algorithms, models, and environments, making it a game-changer for researchers and developers alike.
*no, I'm not Santa!
*no, I'm not Santa!
Fear no more! Now (on the `main` doc) you'll end up right on github!
Fear no more! Now (on the `main` doc) you'll end up right on github!
Using these, we got >6x speed-ups compared to the original CleanRL implementations.
github.com/pytorch-labs...
Using these, we got >6x speed-ups compared to the original CleanRL implementations.
github.com/pytorch-labs...