LAGoM NLP
@lagom-nlp.bsky.social
520 followers 170 following 42 posts
We are the Leuven AI Group of Multilingual NLP (LAGoM NLP), a research lab at the department of Computer Science at KU Leuven, led by @mdlhx
Posts Media Videos Starter Packs
lagom-nlp.bsky.social
Ok, added the ones that were missing from yours to ours
lagom-nlp.bsky.social
You're included in the NLP labs starter pack, see go.bsky.app/LKGekew
Reposted by LAGoM NLP
mdlhx.bsky.social
Reminder, a few more days to apply!
mdlhx.bsky.social
Interested in multilingual tokenization in #NLP? Lisa Beinborn and I are hiring!

PhD candidate position in Göttingen, Germany: www.uni-goettingen.de/de/644546.ht...

PostDoc position in Leuven, Belgium:
www.kuleuven.be/personeel/jo...

Deadline 6th of June
Stellen OBP - Georg-August-Universität Göttingen
Webseiten der Georg-August-Universität Göttingen
www.uni-goettingen.de
Reposted by LAGoM NLP
clin35-2025.bsky.social
📅 Don't forget! The deadline for submitting your abstract to the #CLIN conference in Leuven is coming: 13th of June! Submitting is easy: name, title of your work, 500-word abstract, done! #nlp #nlproc #compling #llm #ai #dutch clin35.ccl.kuleuven.be
CLIN35
Computational Linguistics in The Netherlands (CLIN) is a yearly conference on computational linguistics. Each year the conference is organized by a different institution in the Dutch-speaking region. ...
clin35.ccl.kuleuven.be
lagom-nlp.bsky.social
We are hiring in #nlproc!!
mdlhx.bsky.social
Interested in multilingual tokenization in #NLP? Lisa Beinborn and I are hiring!

PhD candidate position in Göttingen, Germany: www.uni-goettingen.de/de/644546.ht...

PostDoc position in Leuven, Belgium:
www.kuleuven.be/personeel/jo...

Deadline 6th of June
Stellen OBP - Georg-August-Universität Göttingen
Webseiten der Georg-August-Universität Göttingen
www.uni-goettingen.de
Reposted by LAGoM NLP
marcel.bollmann.me
I’m looking for a postdoc, to start ideally ASAP!

The work would be in the EU-funded TrustLLM project, focusing on modularisation and language adaptation of LLMs, tokenization, and evaluation benchmarks for multilingual LLMs. The position would be full-time for 2 years with no teaching obligation.
lagom-nlp.bsky.social
We look at the role of English in this evaluation: it can be, and is often used as, an interface to boost task performance. Or it can be used as a natural language to evaluate language understanding. We recommend to move away from task performance as a main goal and focus on language understanding.
Reposted by LAGoM NLP
milanlp.bsky.social
🚨 New Account Alert! This is the official account of the *MilaNLP group*. We had to recreate it because it was not indexed.

If you were following us before, please follow us again. If not, now’s the perfect time to start!
Reposted by LAGoM NLP
juand-r.bsky.social
There's too many starter packs.
👇 Here's a list, mostly for NLP, ML, and related areas.
NLP grad students
Join the conversation
go.bsky.app
lagom-nlp.bsky.social
Moreover, we advocate for a shift in perspective from seeking a general definition of data quality towards a more language- and task-specific one. Ultimately, we aim for this study to serve as a guide to using Wikipedia for pretraining in a multilingual setting.
lagom-nlp.bsky.social
We evaluate the downstream impact of quality filtering on Wikipedia by training tiny monolingual pretrained models for each Wikipedia to find that data quality pruning is an effective means for resource-efficient training without hurting performance, especially for LRLs.
lagom-nlp.bsky.social
We subject non-English Wikipedias to common quality filtering techniques like script filtering, MinHash and heuristic filtering, which reveal widespread issues such as a high percentage of one-line articles and duplicate articles.
lagom-nlp.bsky.social
In this paper we critically examine the notion of Wikipedia as a 'high quality' resource, particularly in the pretraining setting.
lagom-nlp.bsky.social
It's still not working somehow, if i search for your handle in the search bar, your profile doesn't show up, I don't know if this is a bug or some setting on your side that's not set correctly?
lagom-nlp.bsky.social
I just tried to add you to the list and somehow couldn't find you, I suspect this might just be too soon after the account creation? I will try again later, might be tomorrow
lagom-nlp.bsky.social
We're slowly growing, tell your friends!! #nlp
lagom-nlp.bsky.social
The NLP labs starter pack is here! go.bsky.app/LKGekew Let us know if you want to be added!