Zhihang Xie
@zhihangxie.bsky.social
16 followers 12 following 5 posts
Posts Media Videos Starter Packs
zhihangxie.bsky.social
🚀 SimulMEGA: MoE Routers as advanced policy makers for Simultaneous Speech Translation 🎧🌍
Mixture-of-Experts routing → smarter decisions on when & how to translate, balancing latency vs quality in real-time speech. Paper link at arxiv.org/pdf/2509.012...
arxiv.org
zhihangxie.bsky.social
🚀 Boost rare-phrase translation in speech! Uses **bilingual dictionaries** (e.g., "climate change"→"Klimawandel") to dynamically bias outputs.
✅ **+21%** recall in streaming ST
✅ **+85%** in multimodal LLMs
🔗: arxiv.org/abs/2506.09175
PHRASED: Phrase Dictionary Biasing for Speech Translation
Phrases are essential to understand the core concepts in conversations. However, due to their rare occurrence in training data, correct translation of phrases is challenging in speech translation task...
arxiv.org
Reposted by Zhihang Xie
bsavoldi.bsky.social
🔍 Stiamo studiando come l'AI viene usata in Italia e per farlo abbiamo costruito un sondaggio!

👉 bit.ly/sondaggio_ai...

(è anonimo, richiede ~10 minuti, e se partecipi o lo fai girare ci aiuti un sacco🙏)

Ci interessa anche raggiungere persone che non si occupano e non sono esperte di AI!
Qualtrics Survey | Qualtrics Experience Management
The most powerful, simple and trusted way to gather experience data. Start your journey to experience management and try a free account today.
bit.ly
Reposted by Zhihang Xie
fbk-mt.bsky.social
📢 Come and join our group!
We offer a fully funded 3-year PhD position:

📔 Automatic translation with large multimodal models: iecs.unitn.it/education/ad...

📍Full details for application: iecs.unitn.it/education/ad...

📅 Deadline May 12, 2025

#NLProc #FBK
Reserved topic scholarships | Doctoral Program - Information Engineering and Computer Science
iecs.unitn.it
zhihangxie.bsky.social
ReShape Attention bridges speech & text models without extra parameters. Achieves +8.5% BLEU in translation by leveraging acoustic cues, outperforming cascade/E2E methods. Efficient & scalable. Check the paper by Kano et al. (2025) at: ieeexplore.ieee.org/stamp/stamp.....
IEEE Xplore Full-Text PDF:
ieeexplore.ieee.org
zhihangxie.bsky.social
New research fuels the debate between cascaded and E2E speech translation! The challenge of error propagation is addressed by incorporating multiple ASR candidates, along with HuBERT features to preserve acoustic information lost after ASR. Check the paper by Min et al. at: arxiv.org/pdf/2502.00377.
arxiv.org