(@techimpactpolicy.bsky.social). Formerly IU / Observatory on Social Media.
Computational social science, human-AI interaction, social media, trust and safety, etc.
🧨 matthewdeverna.com
🚨 If you're a fit for this job, I highly recommend applying!
Find more details via the link below.
www.linkedin.com/posts/yyahn_...
🚨 If you're a fit for this job, I highly recommend applying!
Find more details via the link below.
www.linkedin.com/posts/yyahn_...
Cited sources were mostly fact-checking outlets, mainstream news, and government sites.
They have high reliability scores (NewsGuard) and tend to align with the political left.
Cited sources were mostly fact-checking outlets, mainstream news, and government sites.
They have high reliability scores (NewsGuard) and tend to align with the political left.
Web search improved GPT models, but Gemini saw no benefit—likely because it failed to return sources for most queries.
GPT models often return citations and many are the PolitiFact article containing the fact check.
Again, curated info helps a lot.
Web search improved GPT models, but Gemini saw no benefit—likely because it failed to return sources for most queries.
GPT models often return citations and many are the PolitiFact article containing the fact check.
Again, curated info helps a lot.
However, when we give models curated fact-checking evidence, performance improves dramatically.
However, when we give models curated fact-checking evidence, performance improves dramatically.
Can LLMs with reasoning + web search reliably fact-check political claims?
We evaluated 15 models from OpenAI, Google, Meta, and DeepSeek on 6,000+ PolitiFact claims (2007–2024).
Short answer: Not reliably—unless you give them curated evidence.
arxiv.org/abs/2511.18749
Can LLMs with reasoning + web search reliably fact-check political claims?
We evaluated 15 models from OpenAI, Google, Meta, and DeepSeek on 6,000+ PolitiFact claims (2007–2024).
Short answer: Not reliably—unless you give them curated evidence.
arxiv.org/abs/2511.18749
🚨AI and the Future of the Public Square🚨
Deserves a bit more attention (imho).
arxiv.org/abs/2412.09988
🚨AI and the Future of the Public Square🚨
Deserves a bit more attention (imho).
arxiv.org/abs/2412.09988
osf.io/preprints/ps...
osf.io/preprints/ps...
He'll be discussing 🚨 Computational Research in the Post-API Age 🚨 — which seems more and more relevant as time goes by...
📅 Feb 6, 12 PM ET | 💻 Online
🔗 Register: iu.zoom.us/meeting/regi...
He'll be discussing 🚨 Computational Research in the Post-API Age 🚨 — which seems more and more relevant as time goes by...
📅 Feb 6, 12 PM ET | 💻 Online
🔗 Register: iu.zoom.us/meeting/regi...
"Understanding the Prominence of Alternative Social Media Platforms"
Register here: iu.zoom.us/meeting/regi...
Learn about the series here: osome.iu.edu/events/speak...
"Understanding the Prominence of Alternative Social Media Platforms"
Register here: iu.zoom.us/meeting/regi...
Learn about the series here: osome.iu.edu/events/speak...
Plurals: A System for Guiding LLMs Via Simulated Social Ensembles (arxiv.org/abs/2409.17213)
Comes with a Python package to guide LLM agents. Even allows incorporating nationally representative ensembles.
Package: github.com/josh-ashkina...
🧪
Plurals: A System for Guiding LLMs Via Simulated Social Ensembles (arxiv.org/abs/2409.17213)
Comes with a Python package to guide LLM agents. Even allows incorporating nationally representative ensembles.
Package: github.com/josh-ashkina...
🧪
Register: iu.zoom.us/meeting/regi...
Abstract/Series info: osome.iu.edu/events/speak...
Register: iu.zoom.us/meeting/regi...
Abstract/Series info: osome.iu.edu/events/speak...
doi.org/10.48550/arX...
doi.org/10.48550/arX...