manueltonneau.com
Big thanks to my wonderful co-authors: @deeliu97.bsky.social, Niyati, @computermacgyver.bsky.social, Sam, Victor, and @paul-rottger.bsky.social!
Thread 👇and data avail at huggingface.co/datasets/man...
Have you ever wondered what the political content in LLM's training data is? What are the political opinions expressed? What is the proportion of left- vs right-leaning documents in the pre- and post-training data? Do they correlate with the political biases reflected in models?
Have you ever wondered what the political content in LLM's training data is? What are the political opinions expressed? What is the proportion of left- vs right-leaning documents in the pre- and post-training data? Do they correlate with the political biases reflected in models?
In a new paper, we introduce Bonsai, a tool to create feeds based on stated preferences, rather than predicted engagement.
arxiv.org/abs/2509.10776
In a new paper, we introduce Bonsai, a tool to create feeds based on stated preferences, rather than predicted engagement.
arxiv.org/abs/2509.10776
Paper: arxiv.org/pdf/2509.08825
Paper: arxiv.org/pdf/2509.08825
🔗https://sicss.io/stories/2025-08-18
🔗https://sicss.io/stories/2025-08-18
uk.themedialeader.com/social-platf...
uk.themedialeader.com/social-platf...
That's the topline finding from an impressive new working paper leveraging newly mandated transparency data under the DSA led by @manueltonneau.bsky.social osf.io/preprints/so...
That's the topline finding from an impressive new working paper leveraging newly mandated transparency data under the DSA led by @manueltonneau.bsky.social osf.io/preprints/so...
Our new WP shows the answer is no:
-Millions of users post in languages with zero moderators
-Where mods exist, mod count relative to content volume varies widely across langs
osf.io/amfws
Our new WP shows the answer is no:
-Millions of users post in languages with zero moderators
-Where mods exist, mod count relative to content volume varies widely across langs
osf.io/amfws
Big thanks to my wonderful co-authors: @deeliu97.bsky.social, Niyati, @computermacgyver.bsky.social, Sam, Victor, and @paul-rottger.bsky.social!
Thread 👇and data avail at huggingface.co/datasets/man...
Big thanks to my wonderful co-authors: @deeliu97.bsky.social, Niyati, @computermacgyver.bsky.social, Sam, Victor, and @paul-rottger.bsky.social!
Thread 👇and data avail at huggingface.co/datasets/man...
In our new pre-print, we find this practice of repurposing accounts to be prevalent and consequential on YouTube!
arxiv.org/abs/2507.16045
In our new pre-print, we find this practice of repurposing accounts to be prevalent and consequential on YouTube!
arxiv.org/abs/2507.16045
To answer this, we introduce 🤬HateDay🗓️, a global hate speech dataset representative of a day on Twitter.
The answer: not really! Detection perf is low and overestimated by traditional eval methods
arxiv.org/abs/2411.15462
🧵
We tested whether AI-generated messages – single static messages vs. conversations – can boost intent to screen for colorectal cancer.
Turns out: short, tailored AI messages outperform expert-written materials & match conversations, at a fraction of the time! 🧵👇
We tested whether AI-generated messages – single static messages vs. conversations – can boost intent to screen for colorectal cancer.
Turns out: short, tailored AI messages outperform expert-written materials & match conversations, at a fraction of the time! 🧵👇
Because someone at Musk’s xAi deliberately did this, and we only found out because they were clumsy.
My piece on the real dangers of AI.
Gift link:
www.nytimes.com/2025/05/17/o...
Because someone at Musk’s xAi deliberately did this, and we only found out because they were clumsy.
My piece on the real dangers of AI.
Gift link:
www.nytimes.com/2025/05/17/o...
arxiv.org/abs/2505.09254
arxiv.org/abs/2505.09254
We have a new consensus statement with 120 experts on this topic. Check it out to see where experts agree and where they think more evidence is needed!
This is a hotly debated and often polarizing debate. So we surveyed over 120 experts on the topic to see where there was genuine consensus (or not), like experts have previous done for climate change.
See our paper: osf.io/preprints/ps...
We have a new consensus statement with 120 experts on this topic. Check it out to see where experts agree and where they think more evidence is needed!
📌 Main: HateDay: A Global Hate Speech Dataset Representative of Twitter (arxiv.org/abs/2411.15462)
📌 Findings: When Claims Evolve – Robustness to Misinformation Edits (arxiv.org/abs/2503.03417)
See you all in Vienna! 🇦🇹 #ACL2025 #NLProc
📌 Main: HateDay: A Global Hate Speech Dataset Representative of Twitter (arxiv.org/abs/2411.15462)
📌 Findings: When Claims Evolve – Robustness to Misinformation Edits (arxiv.org/abs/2503.03417)
See you all in Vienna! 🇦🇹 #ACL2025 #NLProc
@manueltonneau.bsky.social presenting "HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter" ✨
#NLProc #hatespeech
@manueltonneau.bsky.social presenting "HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter" ✨
#NLProc #hatespeech
A 3-min conversation with GPT-4o nudged HPV-vax-hesitant parents (who obv knew it was AI & consented!)—BUT reading standard public-health material still outperformed chatbots in impact and longevity. Details below 👇
A 3-min conversation with GPT-4o nudged HPV-vax-hesitant parents (who obv knew it was AI & consented!)—BUT reading standard public-health material still outperformed chatbots in impact and longevity. Details below 👇