Lightnews — Scholar-powered news

Reposted by Reggie McLean

onion person @junlper.beer · 6d

it’s really good when “technology of the future” treats you leaving as if you were leaving a cult

170 1.5K 7.7K

Reposted by Reggie McLean

It's The Weekend 😌 @craigweekend.bsky.social · 12d

Ladies and gentlemen... the weekend. (also: you are important and are not alone 🧡)

9 270 740

Reposted by Reggie McLean

Claas Voelcker @cvoelcker.bsky.social · 12d

I have been told I need to get more modern in my paper promotion! github.com/cvoelcker/reppo / arxiv.org/abs/2507.11019 @marcelhussing.bsky.social

Happy guy sad guy meme with sad text: USE PPO AND TUNE HYPERPARAMETER FOR WEEKS and happy text: USE REPPO AND GET A POLICY

1 2 10

Reggie McLean @reggiemclean.bsky.social · 13d

I can also vouch for this plan (minus the work visa part) 😅

Reposted by Reggie McLean

Science Magazine @science.org · 14d

“This is an immensely exciting development for the Huntington’s field.” https://scim.ag/4nnW62z

In a first, a gene therapy seems to slow Huntington disease

Small study suggests uniQure drug could be first successful treatment for devastating brain disorder

scim.ag

1 24 93

Reggie McLean @reggiemclean.bsky.social · 14d

a man in a suit and tie is standing in front of a wall with framed pictures on it

ALT: a man in a suit and tie is standing in front of a wall with framed pictures on it

media.tenor.com

Reposted by Reggie McLean

Simon Wheeler @simonwheeler.bsky.social · 14d

This is absolutely awesome. I am so pleased for everyone involved and particularly for people with Huntington's disease who will benefit.
Also, this is the first example I know of where gene therapy might well prevent dementia. Onwards to FTD!
www.independent.co.uk/news/health/...

Huntington’s treatment slows disease for first time giving hope to families

The condition, which currently has no cure, has been slowed in a groundbreaking trial

www.independent.co.uk

3 10

Reposted by Reggie McLean

Marius Schneider @mariusschneider.bsky.social · 16d

🚨Our NeurIPS 2025 competition Mouse vs. AI is LIVE!

We combine a visual navigation task + large-scale mouse neural data to test what makes visual RL agents robust and brain-like.

Top teams: featured at NeurIPS + co-author our summary paper. Join the challenge!

Whitepaper: arxiv.org/abs/2509.14446

Mouse vs. AI: A Neuroethological Benchmark for Visual Robustness and Neural Alignment

Visual robustness under real-world conditions remains a critical bottleneck for modern reinforcement learning agents. In contrast, biological systems such as mice show remarkable resilience to environ...

arxiv.org

3 20 37

Reposted by Reggie McLean

spookillumi @illumi.meme · 16d

me telling my grandkids what it was like to have vaccines

The Simpsons old man sitting on a stump telling a story but all the kids have been replaced with headstones

42 2K 8K

Reposted by Reggie McLean

Robin Z @firepile.bsky.social · 17d

Yes but many of us are women and people of color so it didn't count

𝕍∃, Cyber closed shell syndrome relief fund coordinator @vortexegg.com · 17d

Was this not… what critical AI researchers have been saying this entire time?

Prof Mike Yearworth @mikeyearworth.bsky.social · 17d

"In a landmark study, OpenAI researchers reveal that large language models will always produce plausible but false outputs, even with perfect data, due to fundamental statistical and computational limits."

www.computerworld.com/article/4059...

5 30

Reposted by Reggie McLean

Eugene Vinitsky 🍒 @eugenevinitsky.bsky.social · 18d

Meta-world is still very much the go to for multi task and meta-RL research but it has some real failings that Reggie fixes up here!

Reggie McLean @reggiemclean.bsky.social · 19d

Proud to announce that Meta-World+ was accepted to NeurIPs, Datasets and Benchmarks! Meta-World is a common benchmark for multi-task and meta-RL research! However, it was very difficult to do effective science with Meta-World as different versions produce different results.

1 5

Reposted by Reggie McLean

Pablo Samuel Castro @pcastr.bsky.social · 19d

Very happy this work got into #NeurIPS2025, it was a big effort that Reggie led, and I'm glad it was worth the work!
We've already been using Meta world+ in a lot of our research, and so should you!
Let's chat at #NeurIPS2025 !

Reggie McLean @reggiemclean.bsky.social · 19d

Proud to announce that Meta-World+ was accepted to NeurIPs, Datasets and Benchmarks! Meta-World is a common benchmark for multi-task and meta-RL research! However, it was very difficult to do effective science with Meta-World as different versions produce different results.

1 5

Reggie McLean @reggiemclean.bsky.social · 19d

👀👀👀
bsky.app/profile/regg...

Reggie McLean @reggiemclean.bsky.social · 19d

Proud to announce that Meta-World+ was accepted to NeurIPs, Datasets and Benchmarks! Meta-World is a common benchmark for multi-task and meta-RL research! However, it was very difficult to do effective science with Meta-World as different versions produce different results.

1

Reggie McLean @reggiemclean.bsky.social · 19d

A big shout-out to all of my collaborators on this work!

Paper: arxiv.org/abs/2505.11289
Code: github.com/Farama-Found...

Meta-World+: An Improved, Standardized, RL Benchmark

Meta-World is widely used for evaluating multi-task and meta-reinforcement learning agents, which are challenged to master diverse skills simultaneously. Since its introduction however, there have been numerous undocumented changes which inhibit a fair comparison of algorithms. This work strives to disambiguate these results from the literature, while also leveraging the past versions of Meta-World to provide insights into multi-task and meta-reinforcement learning benchmark design. Through this process we release a new open-source version of Meta-World (https://github.com/Farama-Foundation/Metaworld/) that has full reproducibility of past results, is more technically ergonomic, and gives users more control over the tasks that are included in a task set.

arxiv.org

Reggie McLean @reggiemclean.bsky.social · 19d

To further improve the user experience when using Meta-World+, we streamline and improve the experience of creating environments such that users have both full control of the settings used to create their environments, while also ensuring that environments are created in the correct manner.

1

Reggie McLean @reggiemclean.bsky.social · 19d

To disambiguate the results from the different versions of the benchmark, we implement and re-run experiments across several key multi-task and meta-RL algorithms, on both the V1 and V2 reward functions.

1

Reggie McLean @reggiemclean.bsky.social · 19d

What has changed between V1 and V2…? The reward functions! Here we find that the per-timestep rewards in all tasks have wildly different scales across the V1 and V2 reward functions. This affects the ability of the value function to approximate the state-action value functions (right panel)

1

Reggie McLean @reggiemclean.bsky.social · 19d

We can highlight the performance difference between different versions of Meta-World by running a simple algorithm with both the V1 and V2 versions of Meta-World.

1

Reggie McLean @reggiemclean.bsky.social · 19d

Proud to announce that Meta-World+ was accepted to NeurIPs, Datasets and Benchmarks! Meta-World is a common benchmark for multi-task and meta-RL research! However, it was very difficult to do effective science with Meta-World as different versions produce different results.

1 1 15

Reposted by Reggie McLean

RL & Agents Reading Group @rl-agents-rg.bsky.social · 19d

📢 RL reading group TODAY @ 15:00 BST (in 2 hours!) 📢

Speakers: Olya Mastikhina and Dhruv Sreenivas (University of Montreal & Mila - Quebec AI Institute)

Title: Optimistic critics can empower small actors🦸

Details: edinburgh-rl.github.io/reading-group

2 2

Reposted by Reggie McLean

Marlos C. Machado @marloscmachado.bsky.social · 20d

This paper has now been accepted @neuripsconf.bsky.social !

Huge congratulations, Hon Tik (Rick) Tse and Siddarth Chandrasekar.

Marlos C. Machado @marloscmachado.bsky.social · May 24

📢 I'm happy to share the preprint: _Reward-Aware Proto-Representations in Reinforcement Learning_ ‼️

My PhD student, Hon Tik Tse, led this work, and my MSc student, Siddarth Chandrasekar, assisted us.

arxiv.org/abs/2505.16217

Basically, it's the SR with rewards. See below 👇

3 7

Reposted by Reggie McLean

Alisa Bokulich @alisabokulich.bsky.social · 26d

I haven't used "Academia .edu" for 15 yrs (only tried it when it 1st came out) but they just tried charge me ~$400 automatic on a card w/o any notification or me agreeing to renew/reactivate anything. This is such a predatory scam & they really shouldn't be allowed an ".edu" too Watch out for them

7 14 60

Reposted by Reggie McLean

Amy J. Ko @amyko.phd · 21d

This is the experiment we're trying next: reciprocal.reviews. Earn tokens for reviews by writing adequate quality reviews, spend them on submissions. Earn the privilege to publish. We're at 70% on the beta, and should launch an ACM TOCE pilot next year some time.

reciprocal.reviews

1 2 9

Reposted by Reggie McLean

Chris Hayes @chrislhayes.bsky.social · Jul 18

Not really an overstatement to say that the test of a free society is whether or not comedians can make fun of the country's leader on TV without repurcussions.

1.5K 19K 74K