you've fancied some e-values and processes, @sharky6000.bsky.social , have you not
you've fancied some e-values and processes, @sharky6000.bsky.social , have you not
world model part with online off-policy algorithms looks clearer
world model part with online off-policy algorithms looks clearer
hope it brings back importance of performing offline RL and not behavioural cloning on anything apart from mujoco benchmarks
offline RL should be better but it's not
hope it brings back importance of performing offline RL and not behavioural cloning on anything apart from mujoco benchmarks
offline RL should be better but it's not
thank you
thank you
maybe these models are so mind-boggling, no idea
hope people at Sakana will go further with their CTMs, and we will be able to talk about something more than “foundational models”
maybe these models are so mind-boggling, no idea
hope people at Sakana will go further with their CTMs, and we will be able to talk about something more than “foundational models”