MT Group at FBK
@fbk-mt.bsky.social
190 followers 170 following 45 posts
#MachineTranslation Research Unit @ Fondazione Bruno Kessler #nlproc #deeplearning #ai mt.fbk.eu
Posts Media Videos Starter Packs
Reposted by MT Group at FBK
linaconti.bsky.social
🎉 Excited to share that my paper "The Unheard Alternative" was accepted to @blackboxnlp.bsky.social 2025!
We introduce contrastive explanations for speech-to-text, identifying which audio features ST models use to assign a grammatical gender to the speaker.
📄 Preprint: arxiv.org/abs/2509.265...
The Unheard Alternative: Contrastive Explanations for Speech-to-Text Models
Contrastive explanations, which indicate why an AI system produced one output (the target) instead of another (the foil), are widely regarded in explainable AI as more informative and interpretable th...
arxiv.org
Reposted by MT Group at FBK
sarapapi.bsky.social
🚀 Excited to present FAMA, the first large-scale #OpenScience #Speech foundation model for 🇮🇹 Italian & 🇬🇧 English, at #clicit2025 (17:30–18:45 oral session)!

🔗 Models: hf.co/collections/...
📊 Data: hf.co/datasets/FBK...
💻 Code: github.com/hlt-mt/FBK-f...
📄 Preprint: arxiv.org/pdf/2505.22759
Reposted by MT Group at FBK
dh-fbk.bsky.social
We are on our way to Casteddu for #clicit2025 with a guest from @fbk-mt.bsky.social @ailc-nlp.bsky.social
fbk-mt.bsky.social
Our pick of the week by @sarapapi.bsky.social: "Retrieval-Augmented Generation for AI-Generated Content: A Survey" by Penghao Zhao, Hailin Zhang, Qinhan Yu, Zhengren Wang, Yunteng Geng, Fangcheng Fu, Ling Yang, Wentao Zhang, Jie Jiang, Bin Cui.

arxiv.org/pdf/2402.19473

#RAG #survey
fbk-mt.bsky.social
Our pick of the week by @zhihangxie.bsky.social: "SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation" by Chenyang Le, Bing Han, Jinshun Li, Songyong Chen, and Yanmin Qian (2025)

#Speech #Simultaneous #Translation #MOE #SpeechTech
zhihangxie.bsky.social
🚀 SimulMEGA: MoE Routers as advanced policy makers for Simultaneous Speech Translation 🎧🌍
Mixture-of-Experts routing → smarter decisions on when & how to translate, balancing latency vs quality in real-time speech. Paper link at arxiv.org/pdf/2509.012...
arxiv.org
fbk-mt.bsky.social
Our pick of the week by @beomseok-lee.bsky.social: "Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in SpeechLLMs" by Dingdong Wang, Junan Li, Mingyu Cui, Dongchao Yang, Xueyuan Chen, and Helen Meng (EMNLP 2025)
fbk-mt.bsky.social
Our pick of the week by @linaconti.bsky.social: "I Have No Mouth, and I Must Rhyme: Uncovering Internal Phonetic Representations in LLaMA 3.2" @jackmerullo.bsky.social, Arjun Khurana, Oliver McLaughlin (ICML 2025 Workshop on Assessing World Models)

arxiv.org/abs/2508.02527

#XAI #LLM
fbk-mt.bsky.social
Finally, we contributed to "NUTSHELL: A Dataset for Abstract Generation from Scientific Talks" presented by @maikezufle.bsky.social from @ai4lt.bsky.social

👉 aclanthology.org/2025.iwslt-1...
(6/6)
https://aclanthology.org/2025.iwslt-1.2…
fbk-mt.bsky.social
Marco Gaido took a deep dive into "The Warmup Dilemma" @iwslt.bsky.social

👉 aclanthology.org/2025.iwslt-1...
(5/6)
fbk-mt.bsky.social
We organized 5 tasks at #IWSLT:
📌 Offline
📌 Simultaneous
📌 Subtitling
📌Model compression
📌 Instruction following

👉 iwslt.org/2025/#shared...
(4/6)
fbk-mt.bsky.social
@dennisfucci.bsky.social shared insightful findings on gender bias through the lens of interpretability: "Different speech translation models encode and translate speaker gender differently"

👉 aclanthology.org/2025.acl-sho...
(3/6)
fbk-mt.bsky.social
@sarapapi.bsky.social presented her TACL paper: “How real is your real-time simultaneous speech-to-text translation system?”

👉 aclanthology.org/2025.tacl-1.14/
(2/6)
fbk-mt.bsky.social
Heading home after an exciting and intense @aclmeeting.bsky.social in Vienna! We had a great time presenting our work and connecting with the community.

Thanks to everyone who came by!

#acl2025 #nlproc
(1/6)
fbk-mt.bsky.social
Before presenting our speech model compression task at IWSLT, our pick of the week by Marco Gaido: WhisperKit arxiv.org/abs/2507.10860 by Atila Orhon, Arda Okan, Berkin Durmus, Zach Nagengast and @eduardo-pacheco.bsky.social (ICML 2025)—an early attempt to bring large-scale models to edge devices
WhisperKit: On-device Real-time ASR with Billion-Scale Transformers
Real-time Automatic Speech Recognition (ASR) is a fundamental building block for many commercial applications of ML, including live captioning, dictation, meeting transcriptions, and medical scribes. ...
arxiv.org
fbk-mt.bsky.social
Our pick of the week by @zhihangxie.bsky.social: "Adversarial Speech-Text Pre-Training for Speech Translation" by Chenxuan Liu, Liping Chen, Weitai Zhang, Xiaoxi Li, Peiwang Tang, Mingjia Yu, Sreyan Ghosh, and Zhongyi Ye (ICASSP 2025)
fbk-mt.bsky.social
Our pick of the week by @zhihangxie.bsky.social 🔎: "PHRASED: Phrase Dictionary Biasing for Speech Translation" by Peidong Wang, Jian Xue, Rui Zhao, Junkun Chen, Aswin Shanmugam Subramanian, Jinyu Li
arxiv.org/abs/2506.09175

#speech #AI #ST #NLP
Reposted by MT Group at FBK
gitt-workshop.bsky.social
@bsavoldi.bsky.social taking us back in time at #GITT2025 ⌚⏳ focusing on the first discussions of gender bias in language technology as a socio-technical issue. No, the problem hasn't been fixed yet. But what has happened?
Reposted by MT Group at FBK
gitt-workshop.bsky.social
Last but definitely not least: @bsavoldi.bsky.social presenting joint work with @apierg.bsky.social @matteo-negri.bsky.social @luisabentivogli.bsky.social on scalable gender neutral translation evaluation using LLM-as-a-judge at #GITT2025
fbk-mt.bsky.social
Our pick of the week by @dennisfucci.bsky.social: "Speech Representation Analysis Based on Inter- and Intra-Model Similarities" by @yelkheir.bsky.social, @ratedali.bsky.social, and Shammur Absar Chowdhury (ICASSP Workshops 2024)
fbk-mt.bsky.social
Our pick of the week by @beomseok-lee.bsky.social: "ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs" by Pooneh Mousavi, Yingzhi Wang, Mirco Ravanelli, and Cem Subakan (2025)

arxiv.org/abs/2505.19937

#SLU #speech #multimodal #LLM
fbk-mt.bsky.social
Our pick of the week by @apierg.bsky.social: "Agree to Disagree? A Meta-Evaluation of LLM Misgendering" by Arjun Subramonian, @dippedrusk.com, Preethi Seshadri, Dietrich Klakow, Kai-Wei Chang, and Yizhou Sun

#LLM #NLProc #fairness
apierg.bsky.social
Super interesting paper by Subramonian et al: "Agree to Disagree? A Meta-Evaluation of LLM Misgendering" arxiv.org/abs/2504.17075
Turns out, misgendering is messier than just pronouns. I'd love to see this analysis extended to grammatical gender languages! #LLM #AI #ethics @fbk-mt.bsky.social
Agree to Disagree? A Meta-Evaluation of LLM Misgendering
Numerous methods have been proposed to measure LLM misgendering, including probability-based evaluations (e.g., automatically with templatic sentences) and generation-based evaluations (e.g., with automatic heuristics or human validation). However, it has gone unexamined whether these evaluation methods have convergent validity, that is, whether their results align. Therefore, we conduct a systematic meta-evaluation of these methods across three existing datasets for LLM misgendering. We propose a method to transform each dataset to enable parallel probability- and generation-based evaluation. Then, by automatically evaluating a suite of 6 models from 3 families, we find that these methods can disagree with each other at the instance, dataset, and model levels, conflicting on 20.2% of evaluation instances. Finally, with a human evaluation of 2400 LLM generations, we show that misgendering behaviour is complex and goes far beyond pronouns, which automatic evaluations are not currently designed to capture, suggesting essential disagreement with human evaluations. Based on our findings, we provide recommendations for future evaluations of LLM misgendering. Our results are also more widely relevant, as they call into question broader methodological conventions in LLM evaluation, which often assume that different evaluation methods agree.
arxiv.org
Reposted by MT Group at FBK
bsavoldi.bsky.social
🔍 Stiamo studiando come l'AI viene usata in Italia e per farlo abbiamo costruito un sondaggio!

👉 bit.ly/sondaggio_ai...

(è anonimo, richiede ~10 minuti, e se partecipi o lo fai girare ci aiuti un sacco🙏)

Ci interessa anche raggiungere persone che non si occupano e non sono esperte di AI!
Qualtrics Survey | Qualtrics Experience Management
The most powerful, simple and trusted way to gather experience data. Start your journey to experience management and try a free account today.
bit.ly
Reposted by MT Group at FBK
sarapapi.bsky.social
🚀 New tech report out! Meet FAMA, our open-science speech foundation model family for both ASR and ST in 🇬🇧 English and 🇮🇹 Italian.

The models are live and ready to try on @hf.co:
🔗 huggingface.co/collections/...

📄 Preprint: arxiv.org/abs/2505.22759

#ASR #ST #OpenScience #MultilingualAI
FAMA - a FBK-MT Collection
The First Large-Scale Open-Science Speech Foundation Model for English and Italian
huggingface.co