papersandnohope.bsky.social
@papersandnohope.bsky.social
www.stat.cmu.edu/~aramdas/icm...

you've fancied some e-values and processes, @sharky6000.bsky.social , have you not
Game-theoretic Statistics and Sequential Anytime-Valid Inference
www.stat.cmu.edu
December 11, 2025 at 9:40 AM
eh... if only it worked...
December 10, 2025 at 6:00 PM
sure, but isn't overall training objective imitational? why can't the data collected from the open loop + closed loop systems can't be used for the offline RL?

world model part with online off-policy algorithms looks clearer
December 9, 2025 at 7:54 PM
scrolled though it, very nice survey, very topical

hope it brings back importance of performing offline RL and not behavioural cloning on anything apart from mujoco benchmarks

offline RL should be better but it's not
December 9, 2025 at 7:34 PM
nice!
December 9, 2025 at 5:54 PM
congrats on 2 mil
December 9, 2025 at 3:36 PM
yes, watched it recently. Thank you for asking my question, appreciate the answer Shimon gave
December 9, 2025 at 1:14 PM
lowkey true: public perception on twitter/bluesky or through arxiv is not much worse than an average academic review
December 9, 2025 at 1:09 PM
LLM pyramid scheme
December 8, 2025 at 6:40 PM
But, does it really raise any questions?Papers can be different: from a single but concise experiment to methodological studies or big benchmarks. Some people haven't got resources for their experiments or environment facilitating the dev.It's not only about AI, the latter highlights the difference.
December 7, 2025 at 5:38 PM
additionally, if you can post any sort of investigation report, would be nice to look what people tried, what worked and what not

thank you
December 7, 2025 at 4:36 PM
ChatGPT definitely reshaped the landscape

maybe these models are so mind-boggling, no idea

hope people at Sakana will go further with their CTMs, and we will be able to talk about something more than “foundational models”
December 6, 2025 at 5:56 PM