Elinor🎗️ @ COLM 🍁
@elinorpd.bsky.social
1.3K followers 400 following 180 posts
MIT // researching fairness, equity, & pluralistic alignment in LLMs previously @ MIT media lab, mila / mcgill i like language and dogs and plants and ultimate frisbee and baking and sunsets https://elinorp-d.github.io
Posts Media Videos Starter Packs
Reposted by Elinor🎗️ @ COLM 🍁
dmimno.bsky.social
COLM word cloud. Yoav says it’s the year of reasoning, but evaluation is also huge.
Evaluation reasoning interpretability rl in context benchmark alignment synthetic data
elinorpd.bsky.social
I’m at #COLM2025! Would love to chat about anything related to pluralistic alignment, fairness evaluations, societal impacts of LLMs, etc 😊

You can also find me at the NLP4Democracy workshop giving a talk about my work analyzing democratic deliberation with LLMs Oct 10th!
Reposted by Elinor🎗️ @ COLM 🍁
noscoredraws.com
Alright the evening sky, you’re utterly wondrous and fantastical, we get it, geez
Reposted by Elinor🎗️ @ COLM 🍁
mariaa.bsky.social
Here’s a #COLM2025 feed!

Pin it 📌 to follow along with the conference this week!
Reposted by Elinor🎗️ @ COLM 🍁
jmendelsohn2.bsky.social
I will be at #COLM2025 this week, and would love to connect with folks interested in applications (and critiques) of language modeling in social science research!

And join us for the NLP4Democracy workshop on Friday!

sites.google.com/andrew.cmu.e...

#NLP #NLProc #LLM #ComputationalSocialScience
NLP 4 Democracy - COLM 2025
sites.google.com
Reposted by Elinor🎗️ @ COLM 🍁
Reposted by Elinor🎗️ @ COLM 🍁
nsaphra.bsky.social
I wish students understood in most empirical AI research there’s a huge scientific advantage from being constitutionally excited by math vs intimidated, but very little additional gain from being actually “good” at math. Maybe they’d be less intimidated if they didn’t feel they had to be “good”.
elinorpd.bsky.social
Beyond research, this paves the way for:
✨ Tools supporting live assemblies in real time
✨ Increasing transparency & communicating critical insights to decision-makers
✨ Enabling richer cross-assembly analysis to advance research on deliberative best practices
elinorpd.bsky.social
In the tech-enhanced assembly, our framework revealed:
🔹 How deliberation surfaced, refined, or discarded ideas
🔹 *Missing* viable ideas
🔹 How opinion shifts & rec edits shaped outcomes
🔹 Underlying values & trade-offs invisible to decision-makers
elinorpd.bsky.social
We develop an LLM-based framework to:
✅ Map how suggestions transform into concrete recommendations
✅ Reconstruct individuals’ evolving perspectives
✅ Detect why votes shift across deliberation
elinorpd.bsky.social
Despite their promise, we still lack tools to empirically trace:
• how ideas evolve into recommendations
• how deliberation shapes perspectives & votes

At MIT CCC, we hosted our own tech-enhanced assembly to explore how AI can help!

sustainabilityassembly.portal.cortico.ai
Loading...
sustainabilityassembly.portal.cortico.ai
elinorpd.bsky.social
Deliberative assemblies bring together everyday citizens selected by lottery. Through deliberation 💬 & learning, they collectively form policy recommendations 💡for decision-makers.

They’ve proven successful worldwide, facilitating rebuilding trust & strengthening democracy 🤝.
elinorpd.bsky.social
"This suggests that LLM benchmark behavior may generalize less and less to non-benchmark settings, raising new concerns about ecological validity."

super interesting
bufangao.bsky.social
🚨 New #EMNLP2025 paper!

Do LLMs exhibit distinct behavior when the prompt looks similar to common evaluation prompts? 👀

We show that prompts that signal bias evaluation can flip the measured bias. See below ⬇️
Violin plots of Probability of Pronoun Shift. Models show significant sensitivity to prompt changes: when prompts highlight gender evaluation, pronoun use shifts, with decreased “he” and increased “they” use.
Reposted by Elinor🎗️ @ COLM 🍁
elinorpd.bsky.social
Thread inspired by having to review 6(!!) papers for AAAI and most of them having no line numbers. And one particularly great paper I want to show the authors exactly how much I enjoyed it via my annotation drawings (>20 check marks, ~10 exclamations, and even 2 hearts!)
elinorpd.bsky.social
Typing the (sometimes extreme) number of typo corrections is tedious, time consuming, and especially frustrating when there’s no line numbers on the pdf! It would honestly be faster for me to edit it myself 🙄
elinorpd.bsky.social
When reading papers, especially reviewing, I like to print and annotate as I read. I wish I could upload this to open review so authors can see smaller suggestions (typos, formatting errors) as well as smaller positive notes eg things I appreciated or found useful/interesting
elinorpd.bsky.social
totally agree! the peer review system is already over burdened and there needs to be an intermediate step for AI generated work. for example, AAAI received *double* the amount of normal submissions this year even after desk rejections aaai.org/conference/a....
AAAI-26 Review Process Update: Scale, Integrity Measures, and Pathways to Sustainability - AAAI
aaai.org
elinorpd.bsky.social
First crochet project done! Super proud and excited for the next one
Reposted by Elinor🎗️ @ COLM 🍁