Augustine Mavor-Parker
mavorparker.bsky.social
Augustine Mavor-Parker
@mavorparker.bsky.social
Reinforcement Learning PhD Student at UCL
I am working on something new at Vmax. We are building agents that leverage the inherent structure in large company datasets to infer trajectories of states, actions and rewards. With this data, we are building multistep RL agents to carry out long-horizon tasks. See our preview!

vmax-ai.com
December 14, 2024 at 7:26 PM