Johan S Obando 👍🏽
johanobandoc.bsky.social
Johan S Obando 👍🏽
@johanobandoc.bsky.social
Graduate student @Mila_Quebec @UMontrealDIRO | RL/Deep Learning/AI | De Cali/Colombia pal’ Mundo 🇨🇴 | #JuntosProsperamos⚡#TogetherWeThrive| 🌱🌎
Reposted by Johan S Obando 👍🏽
🔊Simplicial Embeddings (SEMs) Improve Sample Efficiency in Actor-Critic Agents🔊

In our recent preprint we demonstrate that the use of well-structured representations (SEMs) can dramatically improve sample efficiency in RL agents.

1/X
October 20, 2025 at 2:07 PM
Reposted by Johan S Obando 👍🏽
📢 ¡Buenas noticias!
Se extiende el período de inscripciones 🎉
No pierdas esta oportunidad de ser parte de un evento único que puede transformar tu futuro en la inteligencia artificial.
👉 Regístrate ahora y asegura tu lugar.
lasala.ai#top
October 4, 2025 at 3:25 AM
Reposted by Johan S Obando 👍🏽
Thrilled to share our #ICML2025 paper “The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep RL”, led by Jiashun Liu and with other great collaborators!

We teach RL agents when to quit wasting effort, boosting efficiency with our proposed method LEAST.

Here's the story 🧵👇🏾
July 13, 2025 at 12:25 PM
Reposted by Johan S Obando 👍🏽
proud to share a survey of state representation learning in RL that my student ayoub echchahed and i prepared, that was just published on
@tmlrorg.bsky.social !
this was the bulk of ayoub's masters thesis and he put a lot of work and care into it!
a few details in thread below...
1/
June 24, 2025 at 1:30 PM
Reposted by Johan S Obando 👍🏽
The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks

thrilled to share our #ICML2025 paper led by Walter Mayor & @johanobandoc.bsky.social , with Aaron Courville, where we explore how data collection affects agents in parallelized setups.
1/
June 5, 2025 at 2:31 PM
Reposted by Johan S Obando 👍🏽
really excited about this new work we just put out, led by my students @roger-creus.bsky.social & @johanobandoc.bsky.social , where we examine the challenges of gradient propagation when scaling deep RL nets.

roger & johan put in a lot of work and care in this work, check out more details in 🧵👇🏾 !
🚨 Excited to share our new work: "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning"! 📈

We propose gradient interventions that enable stable, scalable learning, unlocking significant performance gains across agents and environments!

Details below 👇
June 23, 2025 at 2:15 PM
Reposted by Johan S Obando 👍🏽
I'm excited to share a new paper: "Mastering Board Games by External and Internal Planning with Language Models"

storage.googleapis.com/deepmind-med...

(also soon to be up on Arxiv, once it's been processed there)
storage.googleapis.com
December 5, 2024 at 7:49 AM
Reposted by Johan S Obando 👍🏽
Everyone I spoke to at @rl-conference.bsky.social last summer agreed on it being one of the best conferences ever for an RL researcher... So many great RL-focused papers!
CFP is out, send your work here!
The call for papers for RLC is now up! Abstract deadline of 2/14, submission deadline of 2/21!
Please help us spread the word.
rl-conference.cc/callforpaper...
RLJ | RLC Call for Papers
rl-conference.cc
December 2, 2024 at 4:02 PM
Reposted by Johan S Obando 👍🏽
Post based on a talk I gave earlier this year at the AutoRL workshop in ICML, and leveraging two recent papers (1st with @joaogui1.bsky.social & @johanobandoc.bsky.social , 2nd with @jessefarebro.bsky.social ):

1-hparam transfer openreview.net/forum?id=szU...

2-CALE openreview.net/forum?id=vlU...
December 4, 2024 at 12:07 AM
Reposted by Johan S Obando 👍🏽
📢 In Defense Of Atari 📢

New blog post in which I argue why the ALE is still a valuable resource for RL research!

psc-g.github.io/posts/resear...
December 4, 2024 at 12:07 AM
Reposted by Johan S Obando 👍🏽
Good performance shouldn’t mean 'just in English' anymore 🪩

We provide a robust way to assess models with a new benchmark that captures in-language nuances and cultural contexts.
🚀 Introducing INCLUDE 🌍: A multilingual LLM evaluation benchmark spanning 44 languages!

Contains *newly-collected* data, prioritizing *regional knowledge*.
Setting the stage for truly global AI evaluation.
Ready to see how your model measures up?
#AI #Multilingual #LLM #NLProc
December 3, 2024 at 12:27 PM
Reposted by Johan S Obando 👍🏽
Last year I gave a talk titled "From 'Bigger, Better, Faster' to 'Smaller, Sparser, Stranger'", which looked at the components that make up our BBF agent (arxiv.org/abs/2305.19452), highlighting some promising areas of research.

Finally in blog form, have a read!
psc-g.github.io/posts/resear...
November 28, 2024 at 1:56 AM
Reposted by Johan S Obando 👍🏽
While I will be talking about some of our research work, there are also several fun and engaging features for fans to explore blog.google/technology/a... as well as a new Kaggle challenge on developing efficient Chess AI www.kaggle.com/competitions... under resource constraints.
LinkedIn
This link will take you to a page that’s not on LinkedIn
lnkd.in
November 25, 2024 at 4:31 PM
Reposted by Johan S Obando 👍🏽
Reposted by Johan S Obando 👍🏽
RLDM will be held next year in Dublin!

A reminder that the call for workshops is out: rldm.org/call-for-wor...

The workshops are one of my favourite parts of the conference :) please get in touch if you have any questions!
November 22, 2024 at 9:57 AM