Andrea Piergentili
@apierg.bsky.social
310 followers 280 following 24 posts
PhD student at the University of Trento and @fbk-mt.bsky.social, working on gender-inclusive machine translation (he/him) Applied Scientist Intern at Amazon apierg.github.io #NLP #NLProc #MT
Posts Media Videos Starter Packs
Reposted by Andrea Piergentili
gitt-workshop.bsky.social
Last but definitely not least: @bsavoldi.bsky.social presenting joint work with @apierg.bsky.social @matteo-negri.bsky.social @luisabentivogli.bsky.social on scalable gender neutral translation evaluation using LLM-as-a-judge at #GITT2025
apierg.bsky.social
Super interesting paper by Subramonian et al: "Agree to Disagree? A Meta-Evaluation of LLM Misgendering" arxiv.org/abs/2504.17075
Turns out, misgendering is messier than just pronouns. I'd love to see this analysis extended to grammatical gender languages! #LLM #AI #ethics @fbk-mt.bsky.social
Agree to Disagree? A Meta-Evaluation of LLM Misgendering
Numerous methods have been proposed to measure LLM misgendering, including probability-based evaluations (e.g., automatically with templatic sentences) and generation-based evaluations (e.g., with automatic heuristics or human validation). However, it has gone unexamined whether these evaluation methods have convergent validity, that is, whether their results align. Therefore, we conduct a systematic meta-evaluation of these methods across three existing datasets for LLM misgendering. We propose a method to transform each dataset to enable parallel probability- and generation-based evaluation. Then, by automatically evaluating a suite of 6 models from 3 families, we find that these methods can disagree with each other at the instance, dataset, and model levels, conflicting on 20.2% of evaluation instances. Finally, with a human evaluation of 2400 LLM generations, we show that misgendering behaviour is complex and goes far beyond pronouns, which automatic evaluations are not currently designed to capture, suggesting essential disagreement with human evaluations. Based on our findings, we provide recommendations for future evaluations of LLM misgendering. Our results are also more widely relevant, as they call into question broader methodological conventions in LLM evaluation, which often assume that different evaluation methods agree.
arxiv.org
Reposted by Andrea Piergentili
bsavoldi.bsky.social
🔍 Stiamo studiando come l'AI viene usata in Italia e per farlo abbiamo costruito un sondaggio!

👉 bit.ly/sondaggio_ai...

(è anonimo, richiede ~10 minuti, e se partecipi o lo fai girare ci aiuti un sacco🙏)

Ci interessa anche raggiungere persone che non si occupano e non sono esperte di AI!
Qualtrics Survey | Qualtrics Experience Management
The most powerful, simple and trusted way to gather experience data. Start your journey to experience management and try a free account today.
bit.ly
Reposted by Andrea Piergentili
sarapapi.bsky.social
🚀 New tech report out! Meet FAMA, our open-science speech foundation model family for both ASR and ST in 🇬🇧 English and 🇮🇹 Italian.

The models are live and ready to try on @hf.co:
🔗 huggingface.co/collections/...

📄 Preprint: arxiv.org/abs/2505.22759

#ASR #ST #OpenScience #MultilingualAI
FAMA - a FBK-MT Collection
The First Large-Scale Open-Science Speech Foundation Model for English and Italian
huggingface.co
Reposted by Andrea Piergentili
jdaems.bsky.social
👀 Wanted: #Italian or #Dutch native speakers to take a survey on audiovisual translation for a master thesis student: watch a short video, answer some questions, help academic research 😎
⏩ Sharing = nice! ❤️
NL link: ugent.qualtrics.com/jfe/form/SV_...
IT link: ugent.qualtrics.com/jfe/form/SV_...
a woman is standing in front of a bookshelf in a bookstore and talking about research .
ALT: a woman is standing in front of a bookshelf in a bookstore and talking about research .
media.tenor.com
Reposted by Andrea Piergentili
gitt-workshop.bsky.social
💭Dreaming of attending #GITT2025 but need a little extra 💸 boost?
📣 Bursary applications to support participation are now open at tinyurl.com/gitt25
📆 Deadline May 9th
🙏Thanks to our incredible sponsors DCA at Tilburg University tinyurl.com/tudca25 and FLW at Ghent University www.ugent.be/lw/en
a man in a suit is making a funny face with the words dreams are expensive behind him
ALT: a man in a suit is making a funny face with the words dreams are expensive behind him
media.tenor.com
Reposted by Andrea Piergentili
fbk-mt.bsky.social
📢 Come and join our group!
We offer a fully funded 3-year PhD position:

📔 Automatic translation with large multimodal models: iecs.unitn.it/education/ad...

📍Full details for application: iecs.unitn.it/education/ad...

📅 Deadline May 12, 2025

#NLProc #FBK
Reserved topic scholarships | Doctoral Program - Information Engineering and Computer Science
iecs.unitn.it
Reposted by Andrea Piergentili
gitt-workshop.bsky.social
While we look forward to a sunny Geneva, why wait to join the conversation?

We’ve created a starter pack for our #GITT2025 friends!
🕵️ Follow researchers working on gender bias in MT
💬 Stay up to date and dive into the discussion!

All info at sites.google.com/tilburgunive...
Reposted by Andrea Piergentili
jeremyfaust.bsky.social
BREAKING NEWS: CDC orders mass retraction and revision of submitted research across all science and medicine journals. Banned terms must be scrubbed.

Goes beyond MMWR +other CDC pubs. Applies to research already submitted to top medical journals.

Take a look.
open.substack.com/pub/insideme...
BREAKING NEWS: CDC orders mass retraction and revision of submitted research across all science and medicine journals. Banned terms must be scrubbed.
Any unpublished manuscript mentioning certain topics, including gender and "LGBT," must be pulled or revised.
open.substack.com
Reposted by Andrea Piergentili
fbk-mt.bsky.social
🙌 All members of our group are now on Bluesky! 🙌

You can find all of us in this starter pack 👇
apierg.bsky.social
Looking ahead to 2025, my goal is to keep the momentum and build on this year’s lessons: being more intentional about time management, becoming a better collaborator, and and carving out time for deep, focused work.
apierg.bsky.social
The rest of the year was spent on testing new things, new collaborations, reading and reviewing papers, and traveling around for conferences. No doubt this has been the year where I learned the most so far, and 99% of the learning happened because I had access to some amazing (and patient) people.
apierg.bsky.social
I also developed a demo showcasing gender-neutral translation with LLMs, which I had the chance to present at FBK’s Digital Industry Center Demo Day. Unfortunately the demo is not open to the public for now, but here is a photo of @bsavoldi.bsky.social and me presenting it ✌️
apierg.bsky.social
Two key resources enabled the research progress we made this year: GeNTE (2023) and Neo-GATE (2024). They are benchmarks for the conservative and the innovative approach respectively and are both freely available on Hugging Face:

huggingface.co/datasets/FBK...
huggingface.co/datasets/FBK...
apierg.bsky.social
My research topic is gender-inclusive MT, and this year we explored two directions: the "conservative" one with gender-neutral translation and the "innovative" one, using neomorphemes (like ə and *, in Italian). I worked on papers published at venues ranging from top conferences to local workshops.
apierg.bsky.social
With 2024 wrapping up, and given how little I’ve posted here (or anywhere, really), I thought I’d share a quick recap of my year and finally make some ✨content✨
but look i made you some is written in white on a black background
ALT: but look i made you some is written in white on a black background
media.tenor.com
Reposted by Andrea Piergentili
fbk-mt.bsky.social
Our @apierg.bsky.social presenting our #calamita challenges at #CLiCit2024: machine translation and gender-fair generation.

Poster session upcoming, see you there!

For more details:
👉 MagneT: clic2024.ilc.cnr.it/wp-content/u...
👉 GFG: clic2024.ilc.cnr.it/wp-content/u...
Reposted by Andrea Piergentili
fbk-mt.bsky.social
Our very own @dennisfucci.bsky.social presenting the challenges of Explainability for Speech Models at #CLiCit2024. If you’re interested, check out the paper 👉 clic2024.ilc.cnr.it/wp-content/u...
#NLProc
Reposted by Andrea Piergentili
fbk-mt.bsky.social
Today @luisabentivogli.bsky.social, Dennis Fucci, and @apierg.bsky.social presented a research communication about gender-neutral translation in the morning poster session #CLiCit2024 #NLProc
Reposted by Andrea Piergentili
daniel-russo.bsky.social
If you are in Pisa at #CLiCit2024 don't miss the presentation of our last work today at 12 🔥
land-fbk.bsky.social
To click it, or not to click it: that is the question.

Our #CLiC-it2024 contribution is finally out: "To Click It or Not to Click It: An Italian Dataset for Neutralising Clickbait Headlines" authored by @daniel-russo.bsky.social, Oscar Araque, and Marco Guerini.

clic2024.ilc.cnr.it/wp-content/u...
Reposted by Andrea Piergentili
deboranozza.bsky.social
I’ve created an Italian #NLProc Researcher Starter Pack 🇮🇹

DM me to join if you're not in yet!

go.bsky.app/LHbWLHp