Juan Diego Rodriguez (@ COLM 2025)
@juand-r.bsky.social
4.2K followers 2.1K following 540 posts
CS PhD student at UT Austin in #NLP Interested in language, reasoning, semantics and cognitive science. One day we'll have more efficient, interpretable and robust models! Other interests: math, philosophy, cinema https://www.juandiego-rodriguez.com/
Posts Media Videos Starter Packs
Pinned
juand-r.bsky.social
One of the ways that LLMs can be inconsistent is the "generator-validator gap," where LLMs deem their own answers incorrect.

🎯 We demonstrate that ranking-based discriminator training can significantly reduce this gap, and improvements on one task often generalize to others!

🧵👇
A visualization of the generator-validator gap, where the LM likelihoods of for the generator and discriminator forms of questions are poorly correlated. Aligning the validator and generator rankings can fix it!
Reposted by Juan Diego Rodriguez (@ COLM 2025)
vgel.me
thebes @vgel.me · 14h
if you're interesting in gaining a better intuition for how llms behave at inference time, you should try logitloom🌱, the open-source tool i made for exploring token trajectory trees (aka looming) on base and instruct models! more info in thread

🌱 vgel.me/logitloom
💻 github.com/vgel/logitloom
Reposted by Juan Diego Rodriguez (@ COLM 2025)
nsubramani23.bsky.social
At @colmweb.org all week 🥯🍁! Presenting 3 mechinterp + actionable interp papers at @interplay-workshop.bsky.social

1. BERTology in the Modern World w/ @bearseascape.bsky.social
2. MICE for CATs
3. LLM Microscope w/ Jiarui Liu, Jivitesh Jain, @monadiab77.bsky.social

Reach out to chat! #COLM2025
juand-r.bsky.social
Excited to present this at #COLM2025 tomorrow! (Tuesday, 11:00 AM poster session)
juand-r.bsky.social
One of the ways that LLMs can be inconsistent is the "generator-validator gap," where LLMs deem their own answers incorrect.

🎯 We demonstrate that ranking-based discriminator training can significantly reduce this gap, and improvements on one task often generalize to others!

🧵👇
A visualization of the generator-validator gap, where the LM likelihoods of for the generator and discriminator forms of questions are poorly correlated. Aligning the validator and generator rankings can fix it!
Reposted by Juan Diego Rodriguez (@ COLM 2025)
mariaa.bsky.social
Here’s a #COLM2025 feed!

Pin it 📌 to follow along with the conference this week!
Reposted by Juan Diego Rodriguez (@ COLM 2025)
jessyjli.bsky.social
On my way to #COLM2025 🍁

Check out jessyli.com/colm2025

QUDsim: Discourse templates in LLM stories arxiv.org/abs/2504.09373

EvalAgent: retrieval-based eval targeting implicit criteria arxiv.org/abs/2504.15219

RoboInstruct: code generation for robotics with simulators arxiv.org/abs/2405.20179
juand-r.bsky.social
Excited to present this at COLM tomorrow! (Tuesday, 11:00 AM poster session)
juand-r.bsky.social
One of the ways that LLMs can be inconsistent is the "generator-validator gap," where LLMs deem their own answers incorrect.

🎯 We demonstrate that ranking-based discriminator training can significantly reduce this gap, and improvements on one task often generalize to others!

🧵👇
A visualization of the generator-validator gap, where the LM likelihoods of for the generator and discriminator forms of questions are poorly correlated. Aligning the validator and generator rankings can fix it!
juand-r.bsky.social
Yes, smartphones are a great example.
As far as computer technology more generally, they are often invisible to many people... They do not realize that our modern world would just stop working without them.
juand-r.bsky.social
(honest question, genuinely curious about your opinion)-- do you think text/image/video generation has improved people's well-being directly in certain ways they are ignoring? (people who are not programmers or researchers)
Reposted by Juan Diego Rodriguez (@ COLM 2025)
socialmedialab.ca
🤖 Yeah, this place, like most of social media, is crawling with Russians, Chinese, Israelis, and others running their games. (See: readsludge.com/2025/09/15/d... and www.voanews.com/a/bluesky-co...)

Wish we had more time to hunt the bots. Stay sharp out there.
juand-r.bsky.social
👀
shravanvasishth.bsky.social
Computational Psycholinguistics Meeting 2025

cpl2025.sites.uu.nl

When: December 18–19, 2025

Where: Utrecht, the Netherlands

Abstract submission deadline: June 15, 2025

Organizers: Jakub Dotlačil, Lena Jäger, Bruno Nicenboim, Ece Takmaz
Computational Psycholinguistics Meeting 2025 | Universiteit Utrecht
Universiteit Utrecht
cpl2025.sites.uu.nl
Reposted by Juan Diego Rodriguez (@ COLM 2025)
brunojnavarro.bsky.social
The Nobel prize winner Maria Ressa has said Americans are like “deer in the headlights” amid the collapse of US institutions and free speech under the Trump administration, particularly after Jimmy Kimmel’s suspension.
Americans are ‘deer in the headlights’ in face of Trump assault on free speech, Maria Ressa tells Jon Stewart
Nobel prize winner says US institutions have collapsed much quicker than expected under the Trump administration
www.theguardian.com
Reposted by Juan Diego Rodriguez (@ COLM 2025)
kristinacooke.bsky.social
I spoke to a Venezuelan woman who was arrested in this raid and later released with her 4yo son. She said agents broke down their door, pointed guns at them and made sexualized remarks about Venezuelan women. When she returned to her apartment it was boarded up and all her possessions were gone.
Reposted by Juan Diego Rodriguez (@ COLM 2025)
aaroth.bsky.social
One more thought: AI tools are a very useful research accelerator for an expert, and I plan to use them whenever I can. But at the moment it is very easy to be led down false paths if you let them get ahead of yourself and lure you too far from your expertise.
Reposted by Juan Diego Rodriguez (@ COLM 2025)
sfeucht.bsky.social
Nikhil's recent paper is a tour de force in causal analysis! They show that LLMs keep track of what characters know in a story using "pointer" mechanisms. Definitely worth checking out.
nikhil07prakash.bsky.social
How do language models track mental states of each character in a story, often referred to as Theory of Mind?

We reverse-engineered how LLaMA-3-70B-Instruct handles a belief-tracking task and found something surprising: it uses mechanisms strikingly similar to pointer variables in C programming!
juand-r.bsky.social
I’m excited for COLM this week!

Looking forward to chatting with people about interpretability, data efficient training, cog sci and LLM consistency.
Reposted by Juan Diego Rodriguez (@ COLM 2025)
crookedfootball.bsky.social
Stefan Zweig, The World of Yesterday, p. 436
Reposted by Juan Diego Rodriguez (@ COLM 2025)
emollick.bsky.social
Some important findings in this paper:
1) Working with AI boosts the performance of people solving math, science & ethics questions
2) The biggest boost is for the hardest problems
3) High performers remain highest performing, but low performers gain more
4) People who are good with AI gain most
Reposted by Juan Diego Rodriguez (@ COLM 2025)
acyn.bsky.social
Abughazaleh: I think Kristi Noem should be tried at The Hague. And if the response from ICE to people exercising their first amendment right is to drive vehicles through them, they should not be an agency in the US.
Reposted by Juan Diego Rodriguez (@ COLM 2025)
chrisshank.com
The best writing I’ve seen on this topic is the essay “Technically Radical: On the Unrecognized Potential of Tech Workers and Hackers” by @mutual-a.bsky.social

wedontagree.net/technically-...
“Given all this, I posit that the crux of the conflict today is, contra Karl Marx, not over wage relations. Rather it’s a conflict over what technology is developed and how it is deployed (conflicts over wage relations are merely a subset of this broader struggle). And while anyone who wants can play a part, those with technical skills and scientific knowledge have a key role to play.”
Reposted by Juan Diego Rodriguez (@ COLM 2025)
Reposted by Juan Diego Rodriguez (@ COLM 2025)
jasonkoebler.bsky.social
i made this meme which is better than the article:
trade meme. open ai receives: total sum of creative output from all humanity, $500 billion valuation
you receive: polluted internet, polluted world, collapse of society and nature of truth, no jobs, can put your face in my slop app