Stefano Palminteri
@stepalminteri.bsky.social
1.3K followers 440 following 130 posts
Computational cognitive scientist interested in learning and decision-making in human and machiches Research director of the Human Reinforcement Learning team Ecole Normale Supérieure (ENS) Institut National de la Santé et Recherche Médicale (INSERM)
Posts Media Videos Starter Packs
Pinned
stepalminteri.bsky.social
New paper our in @pnas.org, lead by @isabellehoxha.bsky.social with Léo Sperber. We use evolutionary simulation to assess and compare the adaptive value of positivity bias and gradual perseveration in reinforcement learning. Follow the thread below (and Isabelle!) for more details!
isabellehoxha.bsky.social
Ever wondered why you keep going to that restaurant with stale fries? Is it because you went often in the past (perseveration) or because you remember past good experiences better (positivity bias)? Our study out in PNAS investigates the normative basis for these biases www.pnas.org/doi/10.1073/...
Evolving choice hysteresis in reinforcement learning: Comparing the adaptive value of positivity bias and gradual perseveration | PNAS
The tendency to repeat past choices more often than expected from the history of outcomes has been repeatedly empirically observed in reinforcement...
www.pnas.org
Reposted by Stefano Palminteri
stepalminteri.bsky.social
New (revised) preprint with @thecharleywu.bsky.social
We rethink how to assess machine consciousness: not by code or circuitry, but by behavioral inference—as in cognitive science.
Extraordinary claims still need extraordinary evidence.
👉 osf.io/preprints/ps...
#AI #Consciousness #LLM
stepalminteri.bsky.social
New (revised) preprint with @thecharleywu.bsky.social
We rethink how to assess machine consciousness: not by code or circuitry, but by behavioral inference—as in cognitive science.
Extraordinary claims still need extraordinary evidence.
👉 osf.io/preprints/ps...
#AI #Consciousness #LLM
Reposted by Stefano Palminteri
stepalminteri.bsky.social
This book by @anilananth.bsky.social is great — perfect for those, like me, who have an intuitive and geometric grasp of math but unfortunately no formal training. Highly recommended!
stepalminteri.bsky.social
This book by @anilananth.bsky.social is great — perfect for those, like me, who have an intuitive and geometric grasp of math but unfortunately no formal training. Highly recommended!
Reposted by Stefano Palminteri
stepalminteri.bsky.social
Preprint alert! Navigating Inflationary and Deflationary Claims Concerning Large Language Models Avoiding Cognitive Biases.
Very fun and efficient collaboration with @giadapistilli.com
To help cognitively bounded humans balancing hype and dismall of LLMs capabilities
osf.io/preprints/ps...
OSF
osf.io
stepalminteri.bsky.social
Check out @bcdavidson.bsky.social's preprint (w/ @georgiaturner.bsky.social @orbenamy.bsky.social @livia-tomova.bsky.social and co.) about the (computational) consequences of social isolation in social media use during covid!
bcdavidson.bsky.social
🚨 New Preprint 🚨

Prolonged Isolation is associated with an increased behavioural sensitivity to ‘Likes’ on social media.

🧵

Social media rewards are inherently social—but does posting change during social isolation, when in-person social rewards are limited?

It turns out, yes!
stepalminteri.bsky.social
Braitenberg's Vehicles arrived yesterday and I'm already halfway through it. An amazingly funny, clear, and lucid treatment of the question of attributing higher cognitive functions to artificial systems. Obviously very timely for current debates in AI
stepalminteri.bsky.social
This is the link to the previous study that served to the bases for our recent @pnas.org study on the optimality of choice-confirmation bias and perseveration.

"Choice-Confirmation Bias and Gradual Perseveration in Human Reinforcement Learning"

Open here:
www.researchgate.net/publication/...
Reposted by Stefano Palminteri
sjblakemore.bsky.social
New paper! By @livia-tomova.bsky.social,
@emilyanntowner.bsky.social, Kirsten Thomas,
@stepalminteri.bsky.social, @l32zhang.bsky.social

Acute isolation is associated with increased reward seeking and reward learning in human adolescents.

www.nature.com/articles/s44...
Reposted by Stefano Palminteri
stepalminteri.bsky.social
New paper our in @pnas.org, lead by @isabellehoxha.bsky.social with Léo Sperber. We use evolutionary simulation to assess and compare the adaptive value of positivity bias and gradual perseveration in reinforcement learning. Follow the thread below (and Isabelle!) for more details!
isabellehoxha.bsky.social
Ever wondered why you keep going to that restaurant with stale fries? Is it because you went often in the past (perseveration) or because you remember past good experiences better (positivity bias)? Our study out in PNAS investigates the normative basis for these biases www.pnas.org/doi/10.1073/...
Evolving choice hysteresis in reinforcement learning: Comparing the adaptive value of positivity bias and gradual perseveration | PNAS
The tendency to repeat past choices more often than expected from the history of outcomes has been repeatedly empirically observed in reinforcement...
www.pnas.org
stepalminteri.bsky.social
New paper our in @pnas.org, lead by @isabellehoxha.bsky.social with Léo Sperber. We use evolutionary simulation to assess and compare the adaptive value of positivity bias and gradual perseveration in reinforcement learning. Follow the thread below (and Isabelle!) for more details!
isabellehoxha.bsky.social
Ever wondered why you keep going to that restaurant with stale fries? Is it because you went often in the past (perseveration) or because you remember past good experiences better (positivity bias)? Our study out in PNAS investigates the normative basis for these biases www.pnas.org/doi/10.1073/...
Evolving choice hysteresis in reinforcement learning: Comparing the adaptive value of positivity bias and gradual perseveration | PNAS
The tendency to repeat past choices more often than expected from the history of outcomes has been repeatedly empirically observed in reinforcement...
www.pnas.org
Reposted by Stefano Palminteri
isabellehoxha.bsky.social
Ever wondered why you keep going to that restaurant with stale fries? Is it because you went often in the past (perseveration) or because you remember past good experiences better (positivity bias)? Our study out in PNAS investigates the normative basis for these biases www.pnas.org/doi/10.1073/...
Evolving choice hysteresis in reinforcement learning: Comparing the adaptive value of positivity bias and gradual perseveration | PNAS
The tendency to repeat past choices more often than expected from the history of outcomes has been repeatedly empirically observed in reinforcement...
www.pnas.org
stepalminteri.bsky.social
After 6 wonderful years together, it’s time to say goodbye. Farewell to Magdalena Soukupova – brilliant scientist, pillar of the HRL team, and amazing human being (sic.). Whatever lab you join next will be very lucky to have you
Reposted by Stefano Palminteri
stepalminteri.bsky.social
🚨 New paper out in Mind & Society!
Human reinforcement learning processes and biases: computational characterization and possible applications to behavioral public policy
🔗 link.springer.com/article/10.1...
stepalminteri.bsky.social
The takeaway:

RL offers a powerful, generalizable way to teach, shape, and sustain behavior.

Its biases are not just flaws — they may be adaptations we can harness.

We should give RL a bigger seat at the policy table.
stepalminteri.bsky.social
5/
🏛 Part 3 – RL in public policy
Despite being central in education, therapy & even marketing, RL is oddly underused in behavioral public policy compared to “nudges” or “boosts.”
We argue history & misconceptions are partly to blame.
stepalminteri.bsky.social
🎯 Part 2 – Reinforcement learning in depth
From the basics of action–outcome learning to the fine details of biases like:

Relative valuation (context-dependent outcome encoding)

Positivity bias (learning more from good than bad news)
stepalminteri.bsky.social
🧠 Part 1 – What’s a cognitive bias?
We propose a computational, value-free definition of bias — not as “errors” but as systematic deviations between reality and internal representation, which can sometimes help decision-making.
We also propose a Taxonomy for biases with the RL framework
stepalminteri.bsky.social
This paper is a bit of a chimera 🧬 — combining three strands that rarely meet in one place:

A general, operational presentation of bias

A detailed look at reinforcement learning (RL) and its biases at the computational level

A historical perspective on RL in behavioral public policy
stepalminteri.bsky.social
🚨 New paper out in Mind & Society!
Human reinforcement learning processes and biases: computational characterization and possible applications to behavioral public policy
🔗 link.springer.com/article/10.1...
stepalminteri.bsky.social
if you are already at #CCN2025 go check @nicolasyax.bsky.social work where we try to teach meta-cognition to LLMs in a very cool satellite symposium.
luciecharlesneuro.bsky.social
Super excited to kickstart our Metacognitive science satellite meeting, just before the CCN conference in Amsterdam! Organised with @dobyrahnev.bsky.social @meganakpeters.bsky.social and @stvemillertime.bsky.social