MilaNLP Lab
milanlp.bsky.social
MilaNLP Lab
@milanlp.bsky.social
The Milan Natural Language Processing Group #NLProc #AI

milanlproc.github.io
We're happy to have @veraneplenbroek.bsky.social at our lab this week! She presented her #EMNLP2025 work "Reading Between the Prompts: How Stereotypes Shape LLM's Implicit Personalization" and shared more of her exciting ongoing work.

#NLProc
November 26, 2025 at 1:17 PM
#MemoryModay #NLProc 'Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers,' by Nguyen & @dirkhovy.bsky.social decodes speaker reviews for user preferences using topic models. Domain knowledge needed for market analysis.
Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers
Hanh Nguyen, Dirk Hovy. Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019). 2019.
aclanthology.org
November 24, 2025 at 4:01 PM
What an inspiring week at #EMNLP2025 in Suzhou🇨🇳!
Huge thanks to the organizers and everyone who stopped by our poster/talk!
November 24, 2025 at 10:20 AM
For our weekly lab seminar, it was a pleasure to have @andersgiovanni.com presenting his research "How AI Affects Us: Controlled Experiments in Human-AI Interaction".

#NLProc
November 21, 2025 at 3:58 PM
#TBT #NLProc ' Attanasio et al. study asks 'Is It Worth the (Environmental) Cost?' analyzing continuous training for language models. Balances benefits, environmental impacts, for responsible use. #Sustainability'
arxiv.org
November 20, 2025 at 4:02 PM
For our weekly reading group, @joachimbaumann.bsky.social presented the upcoming PNAS article "The potential existential threat of large language models to online survey research" by @
@seanjwestwood.bsky.social.
November 20, 2025 at 11:54 AM
#MemoryModay #NLProc ' 'State of Profanity Obfuscation in NLP Scientific Publications' probes bias in non-English papers. @deboranozza.bsky.social & @dirkhovy.bsky.social (2023) propose 'PrOf' to aid authors & improve access.
The State of Profanity Obfuscation in Natural Language Processing Scientific Publications
Debora Nozza, Dirk Hovy. Findings of the Association for Computational Linguistics: ACL 2023. 2023.
aclanthology.org
November 17, 2025 at 4:04 PM
#TBT #NLProc Hessenthaler et al.'s 2022 work delves into AI's link with fairness & energy reduction in English NLP models, challenging bias reduction theories. #AI #sustainability
Bridging Fairness and Environmental Sustainability in Natural Language Processing
Marius Hessenthaler, Emma Strubell, Dirk Hovy, Anne Lauscher. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2022.
aclanthology.org
November 13, 2025 at 4:05 PM
#MemoryModay #NLProc 'Measuring Harmful Representations in Scandinavian Language Models' uncovers gender bias, challenging Scandinavia's equity image.
Measuring Harmful Representations in Scandinavian Language Models
Samia Touileb, Debora Nozza. Proceedings of the Fifth Workshop on Natural Language Processing and Computational Social Science (NLP+CSS). 2022.
aclanthology.org
November 10, 2025 at 4:03 PM
#TBT #NLProc "Explaining Speech Classification Models" by Pastor et al. (2024) makes speech classification more transparent! 🔍 Their research reveals which words matter most and how tone and background noise impact decisions.
Explaining Speech Classification Models via Word-Level Audio Segments and Paralinguistic Features
Eliana Pastor, Alkis Koudounas, Giuseppe Attanasio, Dirk Hovy, Elena Baralis. Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long...
aclanthology.org
November 6, 2025 at 4:04 PM
Reposted by MilaNLP Lab
LLMs require social knowledge to understand implicit misogyny, yet they mostly fail. If you want to know more, come check my poster from 12.30 to 13.30!

Paper: aclanthology.org/2025.finding...

#EMNLP2025
Proud to present our #EMNLP2025 papers!
Catch our team across Main, Findings, Workshops & Demos 👇
November 5, 2025 at 5:24 PM
Reposted by MilaNLP Lab
Feeling a little sad not to be in Suzhou for #EMNLP2025, but so proud of all the amazing work from our MilaNLP Lab! 💫

Honored to have received the Outstanding Senior Area Chair Award!

Check out our papers 👇
Proud to present our #EMNLP2025 papers!
Catch our team across Main, Findings, Workshops & Demos 👇
November 5, 2025 at 6:07 PM
#MemoryModay #NLProc 'Universal Joy: A Data Set and Results for Classifying Emotions Across Languages' by Lamprinidis et al. (2021) explores how AI research affects our planet.
Universal Joy A Data Set and Results for Classifying Emotions Across Languages
Sotiris Lamprinidis, Federico Bianchi, Daniel Hardt, Dirk Hovy. Proceedings of the Eleventh Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. 2021.
aclanthology.org
November 3, 2025 at 4:02 PM
For our weekly lab seminar it was a pleasure to have Valerio Capraro talking about The Economics of Language.

#NLProc
October 31, 2025 at 4:20 PM
Proud to present our #EMNLP2025 papers!
Catch our team across Main, Findings, Workshops & Demos 👇
October 31, 2025 at 2:04 PM
#TBT #NLProc Explore 'Wisdom of Instruction-Tuned LLM Crowds' by Plaza et al. LLM labels outperform single models in tasks & languages. But few-shot can't top zero-shot. Supervised models rule.
Wisdom of Instruction-Tuned Language Model Crowds. Exploring Model Label Variation
Flor Miriam Plaza-del-Arco, Debora Nozza, Dirk Hovy. Proceedings of the 3rd Workshop on Perspectivist Approaches to NLP (NLPerspectives) @ LREC-COLING 2024. 2024.
aclanthology.org
October 30, 2025 at 4:05 PM
Great session today in our lab reading group. Thanks to Emanuele Moscato for presenting the article “Universities are embracing AI: will students get smarter or stop thinking?” from @naturemagazine.bsky.social.

Article: www.nature.com/articles/d41...

#NLProc
October 30, 2025 at 1:35 PM
Reposted by MilaNLP Lab
LLMs are good at simulating human behaviours, but they are not going to be great unless we train them to.

We hope SimBench can be the foundation for more specialised development of LLM simulators.

I really enjoyed working on this with @tiancheng.bsky.social et al. Many fun results 👇
Can AI simulate human behavior? 🧠
The promise is revolutionary for science & policy. But there’s a huge "IF": Do these simulations actually reflect reality?
To find out, we introduce SimBench: The first large-scale benchmark for group-level social simulation. (1/9)
October 28, 2025 at 5:58 PM
Reposted by MilaNLP Lab
There’s plenty of evidence for political bias in LLMs, but very few evals reflect realistic LLM use cases — which is where bias actually matters.

IssueBench, our attempt to fix this, is accepted at TACL, and I will be at #EMNLP2025 next week to talk about it!

New results 🧵
Are LLMs biased when they write about political issues?

We just released IssueBench – the largest, most realistic benchmark of its kind – to answer this question more robustly than ever before.

Long 🧵with spicy results 👇
October 29, 2025 at 4:12 PM
For our last Thursday Reading Group, @taniseceron.bsky.social presented "Helpful, harmless, honest? Sociotechnical limits of AI alignment and safety through Reinforcement Learning from Human Feedback" by A. D. Lindström et al. (2025)

Paper: link.springer.com/article/10.1...

#NLProc
October 28, 2025 at 10:43 AM
#MemoryModay #NLProc 'Dense Node Representation for Geolocation' by Fornaciari & @dirkhovy.bsky.social reveals efficient geolocation methods using node2vec & doc2vec models. Greater network size, less parameters.
Dense Node Representation for Geolocation
Tommaso Fornaciari, Dirk Hovy. Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019). 2019.
aclanthology.org
October 27, 2025 at 4:06 PM
Reposted by MilaNLP Lab
Over the past two days, I participated in the @erc.europa.eu Workshop on Data Access under DSA Article 40.

An enriching experience that deepened my understanding of the DSA's implications for research and enabled me to connect with exceptional media researchers.

erc.europa.eu/news-events/...
ERC Workshop on data access under the Digital Services Act (DSA) Article 40 (opening session)
The Digital Services Act (DSA) is an European legislation that specifies a set of rules to make the digital space safer and more trustworthy for users.
erc.europa.eu
October 23, 2025 at 5:02 PM
#TBT #NLProc 'Classist Tools: Social Class Correlates with Performance in NLP' by Curry et al. (2024) explores AI's hidden energy problem, and how machine learning impacts environmental sustainability.
October 23, 2025 at 3:06 PM
Reposted by MilaNLP Lab
🚀 We are pleased to announce the First Call for Papers for #WASSA2026

This year, we introduce a Special Track on Multilinguality and Social Bridges between High- & Lesser-Resourced Languages/Communities. 🌍

🗓️ Deadlines: Dec 17 (direct) and Jan 2 (ARR).
🔗 workshop-wassa.github.io/2026/call-fo...
Call for Papers
Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis
workshop-wassa.github.io
October 21, 2025 at 2:12 PM
#MemoryModay #NLProc 'Benchmarking Post-Hoc Interpretability Approaches for Transformer-based Misogyny Detection' - Attanasio et al. Explores reliability of interpretability in hate speech detection.
Benchmarking Post-Hoc Interpretability Approaches for Transformer-based Misogyny Detection
Giuseppe Attanasio, Debora Nozza, Eliana Pastor, Dirk Hovy. Proceedings of NLP Power! The First Workshop on Efficient Benchmarking in NLP. 2022.
aclanthology.org
October 20, 2025 at 3:23 PM