Alizée Pace
@alizeepace.bsky.social
1.3K followers
150 following
5 posts
Gemini Post-Training @ Google DeepMind
Previously: ETH Zurich, Cambridge, CERN
alizeepace.com
Posts
Media
Videos
Starter Packs
Alizée Pace
@alizeepace.bsky.social
· Apr 26
Preference Elicitation for Offline Reinforcement Learning
Applying reinforcement learning (RL) to real-world problems is often made challenging by the inability to interact with the environment and the difficulty of designing reward functions. Offline RL add...
arxiv.org
Alizée Pace
@alizeepace.bsky.social
· Apr 26
Reposted by Alizée Pace
Alizée Pace
@alizeepace.bsky.social
· Feb 13
Alizée Pace
@alizeepace.bsky.social
· Feb 13
Preference Elicitation for Offline Reinforcement Learning
Applying reinforcement learning (RL) to real-world problems is often made challenging by the inability to interact with the environment and the difficulty of designing reward functions. Offline RL...
arxiv.org
Reposted by Alizée Pace