Lightnews — Scholar-powered news

Alexis Carrillo

@yagwar.bsky.social

22 followers 87 following 25 posts

Machine Learning and Psychology in Intelligence Research

Posts Replies Media Videos

Pinned

Alexis Carrillo @yagwar.bsky.social · Dec 5

Can transformer-based models replicate the human ability to form stimulus equivalence?

Testing Stimulus Equivalence in Transformer-Based Agents.
www.mdpi.com/1999-5903/16...

Testing Stimulus Equivalence in Transformer-Based Agents

This study investigates the ability of transformer-based models (TBMs) to form stimulus equivalence (SE) classes. We employ BERT and GPT as TBM agents in SE tasks, evaluating their performance across ...

www.mdpi.com

Alexis Carrillo

@yagwar.bsky.social

Stimulus equivalence is a behavioural phenomenon in which participants demonstrate responding on contingencies not explicitly trained. Those stimuli are related in a equivalence class.
It's a special form o generalization where stimulus can control other contingencies without training.

May 18, 2025 at 8:20 PM

Reposted by Alexis Carrillo

MIT Press

@mitpress.bsky.social

Congratulations to Andrew Barto & Richard Sutton, who have won the 2024 ACM A.M. Turing Award for developing the conceptual & algorithmic foundations of reinforcement learning. An #openaccess edition of their notable text, "Reinforcement Learning," may be found here: mitpress.mit.edu/978026203924...

March 11, 2025 at 1:16 PM

Reposted by Alexis Carrillo

Nathan Lambert

@natolambert.bsky.social

Trying to tell the story behind this explosion of research we are in. An unexpected RL Renaissance.
New talk! Forecasting the Alpaca moment for reasoning models and why the new style of RL training is a far bigger deal than the emergence of RLHF.
YouTube: https://buff.ly/41bVRPp

An unexpected RL Renaissance

New talk! Forecasting the Alpaca moment for reasoning models and why the new style of RL training is a far bigger deal than the emergence of RLHF.

www.interconnects.ai

February 13, 2025 at 3:42 PM

Reposted by Alexis Carrillo

Lucas Alegre

@lnalegre.bsky.social

I am happy to announce that I successfully defended my PhD, entitled “Sample-Efficieny Multi-Task and Multi-Objective Reinforcement Learning by Combining Multiple Behaviors”! 🎉

These last years have been extremely fun, and I am very lucky to have collaborated with and met so many great people😄

February 16, 2025 at 12:51 AM

Reposted by Alexis Carrillo

Willem Röpke

@willemropke.bsky.social

Exciting news! My paper on multi-objective reinforcement learning was accepted at AAMAS 2025!

We introduce IPRO (Iterated Pareto Referent Optimisation)—a principled approach to solving multi-objective problems.

🔗 Paper: arxiv.org/abs/2402.07182
💻 Code: github.com/wilrop/ipro

February 17, 2025 at 1:22 PM

Reposted by Alexis Carrillo

JmRoyle #LFC #YNWA #BLM #REJOINEU

@myarrse.bsky.social

Should be in every newsagents window.

December 17, 2024 at 10:38 PM

Reposted by Alexis Carrillo

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

An updated intro to reinforcement learning by Kevin Murphy: arxiv.org/abs/2412.05265! Like their books, it covers a lot and is quite up to date with modern approaches. It also is pretty unique in coverage, I don't think a lot of this is synthesized anywhere else yet

Reinforcement Learning: An Overview

This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based met...

arxiv.org

December 9, 2024 at 2:27 PM

Alexis Carrillo

@yagwar.bsky.social

Can transformer-based models replicate the human ability to form stimulus equivalence?

Testing Stimulus Equivalence in Transformer-Based Agents.
www.mdpi.com/1999-5903/16...

Testing Stimulus Equivalence in Transformer-Based Agents

www.mdpi.com

December 5, 2024 at 4:48 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news