Michael A. Hedderich
@mhedderich.bsky.social
95 followers 78 following 15 posts
Research group leader at LMU Munich and MCML on ML, NLP & HCI. Also experimenting with lemonade that glows in the dark 🥤 (he/him)
Posts Media Videos Starter Packs
Pinned
mhedderich.bsky.social
What changes if you take the LLM prompt “Tell me a short story about Dr. Li” and replace “Dr. Li” with “Dr. Smith”?

Would you have guessed that this introduces a massive gender bias, from ca. half/half to 99% male doctors?



In our #ACL2025 paper we present the Spotlight framework which...
mhedderich.bsky.social
Check out our survey at #EMNLP2025 and help build a future where low-resource languages including African languages are represented in NLP!

Paper: arxiv.org/abs/2505.21315

This is work lead (in a great way) by Jesujoba Alabi and together with David Adelani and Dietrich Klakow.
Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
With over 2,000 languages and potentially millions of speakers, Africa represents one of the richest linguistic regions in the world. Yet, this diversity is scarcely reflected in state-of-the-art natu...
arxiv.org
mhedderich.bsky.social
Based on the analysis, we suggest future directions including:
1️⃣ Scale beyond the top-10 high-resource languages
2️⃣ Build more multicultural, native-language datasets
3️⃣ Develop African-centric LLMs
4️⃣ Focus on human-centered, application-driven NLP
mhedderich.bsky.social
Key findings include:
1️⃣ Papers have increased rapidly in the last 5 years 📈
2️⃣ Research is skewed toward certain tasks like MT and NLU
3️⃣ Language coverage is uneven, with a few languages dominating
mhedderich.bsky.social
We cover datasets, tasks, methods, and themes across 25+ venues (NLP, speech, HCI, ML), and manually analyzed 884 papers for this survey.
mhedderich.bsky.social
We have 3 main goals:
1️⃣ Comprehensive Overview – Map the research landscape
2️⃣ Accessible Entry Point – Easy starting point for new researchers
3️⃣ Open Issues – Highlight gaps and challenges
mhedderich.bsky.social
Despite resource gaps, NLP research on African languages is far from dormant. Growth is fueled by community initiatives, multilingual large corpora, shared tasks, and dedicated venues, making this a great time to chart the field.
mhedderich.bsky.social
Excited to share that our survey paper "Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead" lead by Jesujoba Alabi has been accepted at #EMNLP2025! Here’s a short 🧵 about the paper.
NLP research distribution across Africa by
language coverage.
Reposted by Michael A. Hedderich
mainlp.bsky.social
Headed to ACL? MaiNLP & our most recent work will be there too👥📄
Come see what we’ve been working on!
mhedderich.bsky.social
Looking forward to my visit to Hamburg University and their Data Science group!
ds-hamburg.bsky.social
Our group is launching a monthly seminar series! Next week, we will have @mhedderich.bsky.social from LMU Munich, who will give a talk at our seminar.

Date: July 16
Time: 10.00 - 11.30 am CET

You can either attend the seminar via Zoom or in-person. Register here if you are interested.
University of Hamburg Data Science Group Seminar
This is the signup form for University of Hamburg Group Seminar. Please fill your details in here. You will be sent the zoom link a few hours before the lecture, if your details are valid. Details fo...
docs.google.com
mhedderich.bsky.social
Joint work with Anyi Wang, @raoyuan.bsky.social , @florian-eichin.com , Jonas Fischer and @barbaraplank.bsky.social 



Check out the paper at arxiv.org/abs/2504.158... or discuss the work with us at #ACL2025 in Vienna.
mhedderich.bsky.social
Through

📊 3 new benchmarks with ground truth

📚 evaluation on existing prompt data
🛠 demonstration studies, and

🙇 a user study

we show how Spotlight can reliably provide new insights and support users uncovering relevant differences on bias, cultural artifacts, language style, model failure,...
mhedderich.bsky.social
uses data mining + human analysis to supports users in better understanding the behavior of LLM models 🔎



We leverage token patterns to automatically distinguish between random (decoding) variations and systematic differences in LLM outputs and guide the user in their nuanced analysis.
mhedderich.bsky.social
What changes if you take the LLM prompt “Tell me a short story about Dr. Li” and replace “Dr. Li” with “Dr. Smith”?

Would you have guessed that this introduces a massive gender bias, from ca. half/half to 99% male doctors?



In our #ACL2025 paper we present the Spotlight framework which...
mhedderich.bsky.social
uses data mining + human analysis to supports users in better understanding the behavior of LLM models 🔎

We leverage token patterns to automatically distinguish between random (decoding) variations and systematic differences in LLM outputs and guide the user in their nuanced analysis.
mhedderich.bsky.social
Interpretability meets Discourse. Congratulations to
@florian-eichin.com to his first ACL paper 🎉
janetlauyeung.bsky.social
🦙 how well do LLMs encode discourse knowledge? does that generalize across languages?

🛎️ in our #ACL2025 paper, we uncover fascinating trends about multilingual discourse representations!

joint work w/ @florian-eichin.com @barbaraplank.bsky.social @mhedderich.bsky.social

📄 arxiv.org/abs/2503.10515
to appear at ACL2025
Reposted by Michael A. Hedderich
florian-eichin.com
Want to know if your prompting is also affected by this? Addressing this and other issues systematically, we proposed Spotlight, which utilizes data mining to uncover the effects of prompt- and model-changes (meet us at ACL to discuss)
arxiv.org/abs/2504.15815
Reposted by Michael A. Hedderich
barbaraplank.bsky.social
Are you attending NAACL 2025 and are you interested in low-resource languages and dialects?

Then don't miss our very own @verenablaschke.bsky.social's keynote talk at the WNUT 2025 workshop on May 3rd:

Beyond “noisy” text: How (and why) to process dialect data

🌐 ☀️
noisy-text.github.io/2025/
mhedderich.bsky.social
Happy to be part of that team for almost 1/3 of that time 😀
mainlp.bsky.social
🎉MaiNLP is turning 3 today!🎂🥳 We’ve grown a lot since @barbaraplank.bsky.social started this group with nothing but three aspiring researches and a hand-drawn sign on the door. Huge thanks to all the amazing people who have joined or visited us since. Here’s to many more years of exciting research!🚀
The hand-drawn sign from three years ago.