Raoyuan Zhao
banner
raoyuan.bsky.social
Raoyuan Zhao
@raoyuan.bsky.social
PhD student in NLP @MaiNLPlab, @CIS, @LMU
Reposted by Raoyuan Zhao
📝 What's the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns
👥 @mhedderich.bsky.social Anyi Wang @raoyuan.bsky.social @florian-eichin.com Jonas Fischer @barbaraplank.bsky.social
🔗 arxiv.org/abs/2504.158...
📁Main - Long
July 23, 2025 at 12:30 PM
Reposted by Raoyuan Zhao
What changes if you take the LLM prompt “Tell me a short story about Dr. Li” and replace “Dr. Li” with “Dr. Smith”?

Would you have guessed that this introduces a massive gender bias, from ca. half/half to 99% male doctors?



In our #ACL2025 paper we present the Spotlight framework which...
July 11, 2025 at 10:43 AM
Reposted by Raoyuan Zhao
Caught some great moments at #MCML Munich AI Day 2025 last week📍
From sharp keynotes to poster debates. Our team had the chance to show some recent work, join the conversations, and bring back plenty of food for thought🧠🗣️📊
Last week, #MCML Munich AI Day 2025 kicked off with keynotes by Julia Schnabel and Tina Eliassi-Rad, brilliantly moderated by Eva Schulz.
July 9, 2025 at 8:14 AM
Reposted by Raoyuan Zhao
Dei Boarisch heard ned bei "Servus" und "Pfiade" auf? Dann suach ma genau Di!
Wir suachan Bairischsprecher:innen, de a kurze Umfrage über KI-generierds Boarisch für a Masterarbeit beantwortn mechadn.
Mid jeder Teilnahme bring ma den boarischn Dialekt a Stickal weida in de digitale Weyd!
Bavarian dialect speakers needed! Our MSc student Miriam wants to find out 1. how good/bad LLM-generated "Bavarian" is, and 2. whether dialect speakers agree with each other on this. The survey takes <5 min: survey.ifkw.lmu.de/dialquali25/ Thank you for sharing/participating!
June 4, 2025 at 2:15 PM
Reposted by Raoyuan Zhao
Want to know if your prompting is also affected by this? Addressing this and other issues systematically, we proposed Spotlight, which utilizes data mining to uncover the effects of prompt- and model-changes (meet us at ACL to discuss)
arxiv.org/abs/2504.15815
May 30, 2025 at 2:57 PM