Lightnews — Scholar-powered news

Manuel Tonneau

@manueltonneau.bsky.social

710 followers 550 following 55 posts

PhD candidate @oiioxford.bsky.social NLP, Computational Social Science @WorldBank manueltonneau.com

Posts Media Videos Starter Packs

Pinned

Manuel Tonneau @manueltonneau.bsky.social · Jul 31

🏆 Thrilled to share that our HateDay paper has received an Outstanding Paper Award at #ACL2025

Big thanks to my wonderful co-authors: @deeliu97.bsky.social, Niyati, @computermacgyver.bsky.social, Sam, Victor, and @paul-rottger.bsky.social!

Thread 👇and data avail at huggingface.co/datasets/man...

2 7 29

Reposted by Manuel Tonneau

Oxford Internet Institute @oii.ox.ac.uk · 7d

ICYMI: Listen to @manueltonneau.bsky.social @oii.ox.ac.uk's interview with the SOEP podcast talking about his new research into hate speech, online platforms and disparities in content moderation across different European countries. Available here: bit.ly/4ntsiRU

Home - Somewhere On Earth Productions

SOMEWHERE ON EARTH PRODUCTIONS: We are here to connect technology and business to people and new possibilities.

bit.ly

1 1

Reposted by Manuel Tonneau

Andreu Casas @andreucasas.bsky.social · 9d

🚨Hiring a fully funded (3.5 years) PhD for the @ldnsocmedobs.bsky.social to research social media and politics. Candidates should have quantitative/computational skills and/or be interested in content curation/moderation. UK home candidates only unfortunately. www.royalholloway.ac.uk/media/hquftp...

www.royalholloway.ac.uk

1 14 4

Reposted by Manuel Tonneau

Tanise Ceron @taniseceron.bsky.social · 9d

📣 New Preprint!
Have you ever wondered what the political content in LLM's training data is? What are the political opinions expressed? What is the proportion of left- vs right-leaning documents in the pre- and post-training data? Do they correlate with the political biases reflected in models?

2 14 43

Reposted by Manuel Tonneau

Manoel Horta Ribeiro @manoelhortaribeiro.bsky.social · 22d

Social media feeds today are optimized for engagement, often leading to misalignment between users' intentions and technology use.

In a new paper, we introduce Bonsai, a tool to create feeds based on stated preferences, rather than predicted engagement.

arxiv.org/abs/2509.10776

5 46 150

Reposted by Manuel Tonneau

Joachim Baumann @joachimbaumann.bsky.social · 26d

🚨 New paper alert 🚨 Using LLMs as data annotators, you can produce any scientific result you want. We call this **LLM Hacking**.

Paper: arxiv.org/pdf/2509.08825

$We present our new preprint titled "Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation". We quantify LLM hacking risk through systematic replication of 37 diverse computational social science annotation tasks. For these tasks, we use a combined set of 2,361 realistic hypotheses that researchers might test using these annotations. Then, we collect 13 million LLM annotations across plausible LLM configurations. These annotations feed into 1.4 million regressions testing the hypotheses. For a hypothesis with no true effect (ground truth $p > 0.05$), different LLM configurations yield conflicting conclusions. Checkmarks indicate correct statistical conclusions matching ground truth; crosses indicate LLM hacking -- incorrect conclusions due to annotation errors. Across all experiments, LLM hacking occurs in 31-50\% of cases even with highly capable models. Since minor configuration changes can flip scientific conclusions, from correct to incorrect, LLM hacking can be exploited to present anything as statistically significant.$

5 94 260

Reposted by Manuel Tonneau

Chris Bail @chrisbail.bsky.social · Aug 29

1/ 🚨 Big news 🚨 today we’re launching Tech for Open Minds (TOM) at @DukeU— a global program exploring how technology shapes open-mindedness, humility & polarization 🌍🧠
🔗https://sicss.io/stories/2025-08-18

1 2 10

Reposted by Manuel Tonneau

Oxford Internet Institute @oii.ox.ac.uk · Aug 29

@themedialeader.bsky.social highlights new insights from @manueltonneau.bsky.social, @deeliu97.bsky.social, Prof. Ralph Schroeder + Prof. @computermacgyver.bsky.social, whose have found that 16mn EU-based X users “do not have moderators for their language.”

uk.themedialeader.com/social-platf...

Social platforms' 'language blind spots' in content moderation bring brand safety concerns

A new study of recently mandated transparency data under the EU Digital Services Act found that millions of users of social platforms in the region post in languages without any human moderation.

uk.themedialeader.com

1 2 3

Manuel Tonneau @manueltonneau.bsky.social · Aug 29

thanks a lot for the repost!

Reposted by Manuel Tonneau

rasmuskleis.bsky.social @rasmuskleis.bsky.social · Aug 28

Millions of users are posting to social media and other platforms in languages with zero moderators, even within the EU.

That's the topline finding from an impressive new working paper leveraging newly mandated transparency data under the DSA led by @manueltonneau.bsky.social osf.io/preprints/so...

OSF

osf.io

5 9

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

I don't have Portuguese roots but my parents liked the name, and I lived in Lisbon for a few months, so can speak um bocado :)

1 1

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

Thank you and great point! We did not but I suppose we could find the info in the DSA Transparency Database, at least for Spanish and Portuguese. The issue I foresee though is that we'll only have info for moderation in EU countries and nothing on Latin America. Still, worth a look, thanks again!

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

Merci beaucoup pour le repost :)

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

muito obrigado :)

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

@oii.ox.ac.uk @weizenbauminstitut.bsky.social @umassamherst.bsky.social @umich.edu

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

Finally tagging scholars whose work inspired this piece: @monaelswah.bsky.social @farhana-shahid.bsky.social @nicp.bsky.social @cgoanta.bsky.social Your feedback is most welcome!

1 1

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

This would also not have been possible without data collection efforts led by @jurgenpfeffer.bsky.social and without @claesdevreese.bsky.social @aurman21.bsky.social who made me aware of the DSA moderator count data on here a while back, thank you all!

Claes de Vreese @claesdevreese.bsky.social · Nov 5

One! That is the number of content moderators X has who speaks Dutch. One.

That person will be busy ahead of the national elections in the Netherlands on November 22

🗳️ 🇳🇱 commsky polcom polisky eusky

Source: transparency DSA files

transparency.twitter.com/dsa-transpar...

2 3

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

Had a blast working on this paper with my wonderful coauthors @deeliu97.bsky.social @antisomniac.bsky.social @ze.vin Ralph @ethanz.bsky.social @computermacgyver.bsky.social

1 3

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

We also issue a recommendation: platforms and regulators should improve transparency by reporting moderator counts with context (eg content volume per language), ensure consistent reporting over time, and extend data coverage beyond EU languages.

OII | OII researchers propose recommendations for effective data governance in light of the EU’s Digital Service Act

OII researchers propose a series of recommendations for effective data access and data governance in light of the EU’s Digital Service Act.

www.oii.ox.ac.uk

1 1

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

So what? The main implication is that speakers of underserved languages likely receive less protection from online harms. Our analysis also nuances existing concerns: while Global South languages are consistently underserved, allocation for other non-English languages varies widely across platforms.

1 1

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

For languages with moderators, we normalize mod counts by content volume per language and find that platforms allocate moderation workforce disproportionately relative to content volume, with languages primarily spoken in the Global South (Spanish, Portuguese, Arabic) consistently underserved.

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

We also quantify the amount of EU-based users whose national language does not have moderators, and we’re talking about millions of users posting in languages with zero moderators.

1 1

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

Taking Twitter/X as an example, we then show that languages subject to moderation blind spots are generally widely spoken on social media, representing an average of 31% of all tweets during a one-day period in countries where they are the official language.

1 1

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

We first look at language coverage and find that while larger platforms such as YouTube and Meta have moderators in most EU languages, smaller platforms such as X and Snapchat have several language blind spots with no human moderators, particularly in Southern, Eastern and Northern Europe.

1 1 3

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

Concerns about underinvestment in non-English moderation have long circulated via whistleblower leaks, but they were never quantified. The EU’s Digital Services Act is a turning point, requiring platforms to disclose moderator counts per language, making cross-lingual comparison possible.

Frances Haugen: ‘I never wanted to be a whistleblower. But lives were in danger’

The woman whose revelations have rocked Facebook tells how spending time with her mother, a priest, motivated her to speak out

www.theguardian.com

1 2

Manuel Tonneau @manueltonneau.bsky.social · Aug 28

Social media platforms operate globally, but do they allocate human moderation equitably across languages?

Our new WP shows the answer is no:

-Millions of users post in languages with zero moderators
-Where mods exist, mod count relative to content volume varies widely across langs

osf.io/amfws

2 11 18