Anna Seo Gyeong Choi
Anna Seo Gyeong Choi
@annaseogyeongchoi.bsky.social
25 followers 17 following 8 posts
phd @ cornell information science
Posts Media Videos Starter Packs
Dialect speakers already face real quality-of-service harms when distinctive grammatical structures systematically diverge from what LLMs were trained on. Pinpointing specific grammar rules gives us concrete targets for bias mitigation that can transfer across multiple dialects! 🎯
We can decompose performance degradation by individual grammar rules.
Three rules – existential “it”, zero copula, and y’all – account for roughly half of a dialect’s accuracy decreases, relative to Standard American English accuracy.
Example:
SAE: “Can you drive with a beer in Texas?” → Correct Answer: No
Dialect: “Can y’all drive with a beer in Texas?” → GPT-4o-mini Answer: Yes
Same meaning. Different grammar. Different results.
We used the Multi-VALUE package to transform Standard American English questions from QA datasets into dialectal variants based on grammatical rules.
We studied 6 English dialects (African American, Appalachian, Chicano, Indian, Singaporean, Southern) across 3 LLMs using 3 multiple-choice QA benchmarks.
The question: Do dialects affect performance even on easy tasks?
Answer: YES, with worst performance on Singaporean English.
🧵Excited to present our work at #EMNLP2025 “Analyzing Dialectal Biases in LLMs for Knowledge and Reasoning Benchmarks”!
Paper 📄 arxiv.org/abs/2510.00962
w/ Eileen Pan, Skyler Seto, @allisonkoe.bsky.social @maartjeterhoeve.bsky.social