In our recent preprint we demonstrate that the use of well-structured representations (SEMs) can dramatically improve sample efficiency in RL agents.
1/X
In our recent preprint we demonstrate that the use of well-structured representations (SEMs) can dramatically improve sample efficiency in RL agents.
1/X
Read the paper at arxiv.org/abs/2510.13704
11/X
Read the paper at arxiv.org/abs/2510.13704
11/X
Se extiende el período de inscripciones 🎉
No pierdas esta oportunidad de ser parte de un evento único que puede transformar tu futuro en la inteligencia artificial.
👉 Regístrate ahora y asegura tu lugar.
lasala.ai#top
Se extiende el período de inscripciones 🎉
No pierdas esta oportunidad de ser parte de un evento único que puede transformar tu futuro en la inteligencia artificial.
👉 Regístrate ahora y asegura tu lugar.
lasala.ai#top
We teach RL agents when to quit wasting effort, boosting efficiency with our proposed method LEAST.
Here's the story 🧵👇🏾
We teach RL agents when to quit wasting effort, boosting efficiency with our proposed method LEAST.
Here's the story 🧵👇🏾
@tmlrorg.bsky.social !
this was the bulk of ayoub's masters thesis and he put a lot of work and care into it!
a few details in thread below...
1/
@tmlrorg.bsky.social !
this was the bulk of ayoub's masters thesis and he put a lot of work and care into it!
a few details in thread below...
1/
thrilled to share our #ICML2025 paper led by Walter Mayor & @johanobandoc.bsky.social , with Aaron Courville, where we explore how data collection affects agents in parallelized setups.
1/
thrilled to share our #ICML2025 paper led by Walter Mayor & @johanobandoc.bsky.social , with Aaron Courville, where we explore how data collection affects agents in parallelized setups.
1/
roger & johan put in a lot of work and care in this work, check out more details in 🧵👇🏾 !
We propose gradient interventions that enable stable, scalable learning, unlocking significant performance gains across agents and environments!
Details below 👇
roger & johan put in a lot of work and care in this work, check out more details in 🧵👇🏾 !
storage.googleapis.com/deepmind-med...
(also soon to be up on Arxiv, once it's been processed there)
storage.googleapis.com/deepmind-med...
(also soon to be up on Arxiv, once it's been processed there)
CFP is out, send your work here!
Please help us spread the word.
rl-conference.cc/callforpaper...
CFP is out, send your work here!
1-hparam transfer openreview.net/forum?id=szU...
2-CALE openreview.net/forum?id=vlU...
1-hparam transfer openreview.net/forum?id=szU...
2-CALE openreview.net/forum?id=vlU...
New blog post in which I argue why the ALE is still a valuable resource for RL research!
psc-g.github.io/posts/resear...
New blog post in which I argue why the ALE is still a valuable resource for RL research!
psc-g.github.io/posts/resear...
We provide a robust way to assess models with a new benchmark that captures in-language nuances and cultural contexts.
Contains *newly-collected* data, prioritizing *regional knowledge*.
Setting the stage for truly global AI evaluation.
Ready to see how your model measures up?
#AI #Multilingual #LLM #NLProc
We provide a robust way to assess models with a new benchmark that captures in-language nuances and cultural contexts.
Finally in blog form, have a read!
psc-g.github.io/posts/resear...
Finally in blog form, have a read!
psc-g.github.io/posts/resear...
A reminder that the call for workshops is out: rldm.org/call-for-wor...
The workshops are one of my favourite parts of the conference :) please get in touch if you have any questions!
A reminder that the call for workshops is out: rldm.org/call-for-wor...
The workshops are one of my favourite parts of the conference :) please get in touch if you have any questions!