Pranaya Jajoo
pranayajajoo.bsky.social
Pranaya Jajoo
@pranayajajoo.bsky.social
Reinforcement Learning @ University of Alberta
pranayajajoo.github.io
RL Zero combines video generation models with unsupervised RL to generate zero-shot prompt-to-policy behavior. This can help generate desired behaviors without complex reward design.
Check out this exciting work at
- arxiv.org/abs/2412.05718
- hari-sikchi.github.io/rlzero/
December 12, 2024 at 5:43 PM