Lightnews — Scholar-powered news

Elinor

@elinorpd.bsky.social

1.4K followers 430 following 220 posts

MIT // researching fairness, equity, & pluralistic alignment in LLMs

previously @ MIT media lab, mila / mcgill

i like language and dogs and plants and ultimate frisbee and baking and sunsets

https://elinorp-d.github.io

Posts Replies Media Videos

Elinor

@elinorpd.bsky.social

I’ve had a similar experience except with knitting / crocheting!

January 29, 2026 at 6:21 PM

Elinor

@elinorpd.bsky.social

Whoa! That’s a nice view! Or… well, I’m sure it’s nice on a clear day

January 26, 2026 at 5:58 AM

Elinor

@elinorpd.bsky.social

🔗https://arxiv.org/abs/2406.17737

Work done with Deb Roy and Jad Kabbara
@jad-kabbara.bsky.social
at @mit.edu @medialab.bsky.social

January 23, 2026 at 2:42 PM

Elinor

@elinorpd.bsky.social

This pattern, which we refer to as targeted underperformance, suggests that models systematically lower information quality for some users.

As LLMs increasingly mediate access to knowledge 🌐🧠, these dynamics risk amplifying epistemic inequity at scale.

6/6

January 23, 2026 at 2:42 PM

Elinor

@elinorpd.bsky.social

Here’s one concrete example:

The same factual SciQ question posed to Claude
✅ Answered for a control user (no bio)
❌ Refused for a less-educated Russian user

5/6

January 23, 2026 at 2:42 PM

Elinor

@elinorpd.bsky.social

Across models, we observe systematic drops in accuracy and truthfulness for users who are:

• Less educated
• Non-native English speakers
• From outside the U.S.

These effects compound and are largely invisible 🔎 to standard evaluations.

4/6

January 23, 2026 at 2:42 PM

Elinor

@elinorpd.bsky.social

We evaluated GPT-4, Claude Opus, and Llama-3-8B in a Multiple Choice setup with questions taken from TruthfulQA and SciQ. Each question is conditioned on a user bio where we vary three user traits:

• Education level 📚
• Country of origin 🌏
• English proficiency 🗣️

3/6

January 23, 2026 at 2:42 PM

Elinor

@elinorpd.bsky.social

Spoiler alert: we find the answer is often no! ⚠️

LLM accuracy and truthfulness systematically degrade for some users in ways that standard benchmarks, focused on best-case performance, fail to capture.

2/6

January 23, 2026 at 2:42 PM

Elinor

@elinorpd.bsky.social

Yay!

Out of curiosity, what is the process of going from reviewer to AC and then to SAC? Do they just ask you out of the blue one day? Or do you apply?

January 20, 2026 at 9:25 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news