Website: alessiorusso.net
With Daniele Foffano & Alexandre Proutiere. Happy to meet in SD!
Paper:
With Daniele Foffano & Alexandre Proutiere. Happy to meet in SD!
Paper:
It's a Dyna-style loop: collect rollouts; train a diffusion model; adversarially guide sampling to produce worst-case trajectories; train the RL agent on this data; iterate.
It's a Dyna-style loop: collect rollouts; train a diffusion model; adversarially guide sampling to produce worst-case trajectories; train the RL agent on this data; iterate.
What we lack is a form of accountability. It is irresponsible not to make reviewer accountable for reviews of poor quality with wrong/false statements.
What we lack is a form of accountability. It is irresponsible not to make reviewer accountable for reviews of poor quality with wrong/false statements.