Lightnews — Scholar-powered news

Reposted by Johan S Obando 👍🏽

Pablo Samuel Castro

@pcastr.bsky.social

🔊Simplicial Embeddings (SEMs) Improve Sample Efficiency in Actor-Critic Agents🔊

In our recent preprint we demonstrate that the use of well-structured representations (SEMs) can dramatically improve sample efficiency in RL agents.

1/X

October 20, 2025 at 2:07 PM

Reposted by Johan S Obando 👍🏽

Pablo Samuel Castro

@pcastr.bsky.social

This work was led by @johanobandoc.bsky.social and @waltermayor.bsky.social , with @lavoiems.bsky.social , Scott Fujimoto and Aaron Courville.

Read the paper at arxiv.org/abs/2510.13704

11/X

Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents

Recent works have proposed accelerating the wall-clock training time of actor-critic methods via the use of large-scale environment parallelization; unfortunately, these can sometimes still require la...

arxiv.org

October 20, 2025 at 2:07 PM

Reposted by Johan S Obando 👍🏽

lasalaai.bsky.social

@lasalaai.bsky.social

📢 ¡Buenas noticias!
Se extiende el período de inscripciones 🎉
No pierdas esta oportunidad de ser parte de un evento único que puede transformar tu futuro en la inteligencia artificial.
👉 Regístrate ahora y asegura tu lugar.
lasala.ai#top

October 4, 2025 at 3:25 AM

Reposted by Johan S Obando 👍🏽

Pablo Samuel Castro

@pcastr.bsky.social

Thrilled to share our #ICML2025 paper “The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep RL”, led by Jiashun Liu and with other great collaborators!

We teach RL agents when to quit wasting effort, boosting efficiency with our proposed method LEAST.

Here's the story 🧵👇🏾

July 13, 2025 at 12:25 PM

Reposted by Johan S Obando 👍🏽

Pablo Samuel Castro

@pcastr.bsky.social

proud to share a survey of state representation learning in RL that my student ayoub echchahed and i prepared, that was just published on
@tmlrorg.bsky.social !
this was the bulk of ayoub's masters thesis and he put a lot of work and care into it!
a few details in thread below...
1/

June 24, 2025 at 1:30 PM

Reposted by Johan S Obando 👍🏽

Pablo Samuel Castro

@pcastr.bsky.social

The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks

thrilled to share our #ICML2025 paper led by Walter Mayor & @johanobandoc.bsky.social , with Aaron Courville, where we explore how data collection affects agents in parallelized setups.
1/

June 5, 2025 at 2:31 PM

Reposted by Johan S Obando 👍🏽

Pablo Samuel Castro

@pcastr.bsky.social

really excited about this new work we just put out, led by my students @roger-creus.bsky.social & @johanobandoc.bsky.social , where we examine the challenges of gradient propagation when scaling deep RL nets.

roger & johan put in a lot of work and care in this work, check out more details in 🧵👇🏾 !

Roger Creus Castanyer @roger-creus.bsky.social · Jun 23

🚨 Excited to share our new work: "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning"! 📈

We propose gradient interventions that enable stable, scalable learning, unlocking significant performance gains across agents and environments!

Details below 👇

June 23, 2025 at 2:15 PM

Reposted by Johan S Obando 👍🏽

Nenad Tomasev

@nenadtomasev.bsky.social

I'm excited to share a new paper: "Mastering Board Games by External and Internal Planning with Language Models"

storage.googleapis.com/deepmind-med...

(also soon to be up on Arxiv, once it's been processed there)

storage.googleapis.com

December 5, 2024 at 7:49 AM

Reposted by Johan S Obando 👍🏽

Pablo Samuel Castro

@pcastr.bsky.social

Everyone I spoke to at @rl-conference.bsky.social last summer agreed on it being one of the best conferences ever for an RL researcher... So many great RL-focused papers!
CFP is out, send your work here!

Reinforcement Learning Conference @rl-conference.bsky.social · Dec 2

The call for papers for RLC is now up! Abstract deadline of 2/14, submission deadline of 2/21!
Please help us spread the word.
rl-conference.cc/callforpaper...

RLJ | RLC Call for Papers

rl-conference.cc

December 2, 2024 at 4:02 PM

Reposted by Johan S Obando 👍🏽

Pablo Samuel Castro

@pcastr.bsky.social

Post based on a talk I gave earlier this year at the AutoRL workshop in ICML, and leveraging two recent papers (1st with @joaogui1.bsky.social & @johanobandoc.bsky.social , 2nd with @jessefarebro.bsky.social ):

1-hparam transfer openreview.net/forum?id=szU...

2-CALE openreview.net/forum?id=vlU...

December 4, 2024 at 12:07 AM

Reposted by Johan S Obando 👍🏽

Pablo Samuel Castro

@pcastr.bsky.social

📢 In Defense Of Atari 📢

New blog post in which I argue why the ALE is still a valuable resource for RL research!

psc-g.github.io/posts/resear...

The Atari 2600 console and an image of some of the games used in the ALE

December 4, 2024 at 12:07 AM

Reposted by Johan S Obando 👍🏽

Marzieh Fadaee

@mziizm.bsky.social

Good performance shouldn’t mean 'just in English' anymore 🪩

We provide a robust way to assess models with a new benchmark that captures in-language nuances and cultural contexts.

Angelika Romanou @agromanou.bsky.social · Dec 2

🚀 Introducing INCLUDE 🌍: A multilingual LLM evaluation benchmark spanning 44 languages!

Contains *newly-collected* data, prioritizing *regional knowledge*.
Setting the stage for truly global AI evaluation.
Ready to see how your model measures up?
#AI #Multilingual #LLM #NLProc

December 3, 2024 at 12:27 PM

Reposted by Johan S Obando 👍🏽

Pablo Samuel Castro

@pcastr.bsky.social

Last year I gave a talk titled "From 'Bigger, Better, Faster' to 'Smaller, Sparser, Stranger'", which looked at the components that make up our BBF agent (arxiv.org/abs/2305.19452), highlighting some promising areas of research.

Finally in blog form, have a read!
psc-g.github.io/posts/resear...

November 28, 2024 at 1:56 AM

Reposted by Johan S Obando 👍🏽

Nenad Tomasev

@nenadtomasev.bsky.social

While I will be talking about some of our research work, there are also several fun and engaging features for fans to explore blog.google/technology/a... as well as a new Kaggle challenge on developing efficient Chess AI www.kaggle.com/competitions... under resource constraints.

This link will take you to a page that’s not on LinkedIn

lnkd.in

November 25, 2024 at 4:31 PM

Reposted by Johan S Obando 👍🏽

Nenad Tomasev

@nenadtomasev.bsky.social

A great new essay on AI for Science from our colleagues here:

deepmind.google/public-polic...

A new golden age of discovery

In this essay, we take a tour of how AI is transforming scientific disciplines from genomics to computer science to weather forecasting. Some scientists are training their own AI models, while...

deepmind.google

November 26, 2024 at 1:35 PM

Reposted by Johan S Obando 👍🏽

David Abel

@dabelcs.bsky.social

RLDM will be held next year in Dublin!

A reminder that the call for workshops is out: rldm.org/call-for-wor...

The workshops are one of my favourite parts of the conference :) please get in touch if you have any questions!

November 22, 2024 at 9:57 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news