Lightnews — Scholar-powered news

Light up
your news

About Privacy Terms Help

Wessel Poelman

Wessel Poelman

@wpoelman.bsky.social

14 followers 3 following 1 posts

Working with languages @ KU Leuven

Posts Replies Media Videos

Wessel Poelman

@wpoelman.bsky.social

New EACL paper (with @mdlhx.bsky.social)! We tested if comparing perplexity of parallel data across languages is fair. Turns out: it depends. We show the choice of test set (even with consistent meaning) can flip conclusions about which language is easier to model.

Paper: arxiv.org/abs/2601.10580

Form and Meaning in Intrinsic Multilingual Evaluations

Intrinsic evaluation metrics for conditional language models, such as perplexity or bits-per-character, are widely used in both mono- and multilingual settings. These metrics are rather straightforwar...

January 28, 2026 at 1:25 PM

Reposted by Wessel Poelman

LAGoM NLP

@lagom-nlp.bsky.social

@wpoelman.bsky.social and @mdlhx.bsky.social 's 🔥 hot takes on multilingual LLM evaluation, to appear @nodalida.bsky.social is up on arXiv: arxiv.org/abs/2412.08392

The Roles of English in Evaluating Multilingual Language Models

Multilingual natural language processing is getting increased attention, with numerous models, benchmarks, and methods being released for many languages. English is often used in multilingual evaluati...

December 12, 2024 at 3:28 PM