Institute of Formal and Applied Linguistics
banner
ufal.mff.cuni.cz
Institute of Formal and Applied Linguistics
@ufal.mff.cuni.cz
Computational linguistics • Natural language processing • Formal linguistics • Machine translation | at Faculty of Mathematics and Physics, Charles University
🎧 Ondřej Bojar v novém díle podcastu Alma mater!

📺 Sledujte na YouTube (www.youtube.com/watch?v=I1vS...), poslouchejte na Spotify a v dalších podcastových aplikacích.
Alma Mater: Je neštěstí brát umělou inteligenci za kamaráda, říká doc. Ondřej Bojar z MFF UK
YouTube video by Univerzita Karlova
www.youtube.com
November 26, 2025 at 9:24 AM
Reposted by Institute of Formal and Applied Linguistics
🎮✨ Nový projekt od absolventky @mff.unikarlova.cuni.cz spojuje hudbu, emoce a umělou inteligenci!
Hra „Symphony of Adventure“ totiž místo obvyklých dotazníků sbírá data o emocích v hudbě hraním – a tím pomáhá trénovat AI.
November 17, 2025 at 11:04 AM
EMNLP 2025 is over... and Milan Straka is bringing home an award! 🏆
CorPipe triumphed in the prestigious CRAC25 Shared Task, focusing on multilingual coreference resolution.

Did Milan just CRACk it? We certainly think so! 😉

🔗 Find out more at arxiv.org/abs/2509.17858

#EMNLP2025 #CorPipe #CRAC25
November 11, 2025 at 1:49 PM
If you speak/know speakers of Piedmontese or Neapolitan 🇮🇹, check out @gianlucavico.bsky.social's project, which collects crowd-sourced translations to study how LLMs handle these under-resourced dialects. Anyone can participate! 🎯
We’re collecting crowd-sourced translations in Piedmontese and Neapolitan.
🎯 Goal: see how well LLMs understand these languages.
👉 Participate here (in IT🇮🇹):
- Piedmontese: quest.ms.mff.cuni.cz/crowd-transl...
- Neapolitan: quest.ms.mff.cuni.cz/crowd-transl...
Anyone can join, no need to be fluent!
Welcome to CrowdTranslation
quest.ms.mff.cuni.cz
November 10, 2025 at 2:34 PM
🗓️ Mark the dates!
🌉 #EMNLP2026 will be October 24-29th in Budapest! 🌉

Thanks all for a great conference, and see you at the next one!
November 10, 2025 at 1:04 PM
The EU's 🇪🇺 HPLT project, coordinated by @ufal.mff.cuni.cz is at #EMNLP2025! It has supported it as a silver sponsor, disseminating HPLT results from our booth and through several papers. We'll continue to shape the future of multilingual datasets and models here and in @openeurollm.bsky.social!
November 7, 2025 at 9:03 PM
Excited to share our work at #EMNLP2025! Our team is presenting 12 papers across the main conference and workshops, covering multilingual NLG, LLM agents, coreference resolution, and machine translation.
A thread with highlights 🧵👇
November 7, 2025 at 8:54 PM
Reposted by Institute of Formal and Applied Linguistics
With @andrei-a-manea.bsky.social, we posted a survey on multilingual vision-language models 👉 arxiv.org/pdf/2509.22123
We reviewed 31 models+21 benchmarks. There's a tension between language neutrality (same results across languages) & cultural awareness (context matters differently across cultures)
arxiv.org
October 21, 2025 at 1:30 PM
Zveme na dnešní přednášku Jazykovědného sdružení, kterou od 17:30 přednese prof. PhDr. Eva Hajičová, DrSc.

🔗 Můžete přijít osobně nebo sledovat na zoomu: lnkd.in/eQeST-uG

Téma přednášky: Aktuální členění v době paralelních korpusů

📸 Foto: Vladimír Šigut, UK
October 23, 2025 at 9:02 AM
🚀 PROJECT LAUNCH: Infoveillance is Live! Our AI tool monitors digital media to detect misinformation and enhance public trust/literacy. Fighting infodemics & polarization.

[https://ufal.mff.cuni.cz/grants/infoveillance]
#Infoveillance #AI #Misinformation #PublicTrust #UFAL
October 2, 2025 at 11:46 AM
Reposted by Institute of Formal and Applied Linguistics
Huge win! 🎉 The CLARIN Steven Krauwer Prize 2025 goes to LINDAT/CLARIAH-CZ partners Pavel Ircing & Jan Švec (UWB). They won for their ASR tools for complex Oral History recordings. Their work is vital for global digital humanities!

#CLARIN #LINDAT #StevenKrauwerAward #ASR #OralHistory #DG
October 1, 2025 at 8:39 AM
Šest kolegů vedlo pro DGT (Evropské ředitelství pro překlady) třídenní letní školu v Lucemburku. Učili 40+ pracovníků DGT nejnovější metody strojové podpory překladu a zajištění kvality. Cíl? Zefektivnit překlad legislativy EU do všech členských jazyků!
#DGT #UFAL #StrojovyPreklad #AI #EUTools
September 29, 2025 at 9:31 AM
@Dan Zeman has been invited as a keynote speaker at the ICLC 11 conference! iclc11.ff.cuni.cz/keynote-spea...

#UFAL #ICLC11 #UniversalDependencies #CharlesUniversity #Prague
September 24, 2025 at 7:48 AM
Nahlédněte na kick‐off meeting projektu ✨HumanAId: AI zaměřená na člověka pro udržitelnou a adaptabilní společnost✨.

Projekt se silnou účastí: vede ho FFUK ve spolupráci s MFF UK, FSV UK, PF UP v Olomouci, FÚ AV ČR, prg.ai a Kampusem Hybernská.

#prgAI #HumanAId #OPJAK
1/2
September 23, 2025 at 2:58 PM
And another successfully defended thesis: 👉Dr.👈 Kira Droganova defended her thesis: Dependency Parsing beyond Simple Trees, which focused on enriching syntactic parsing with deeper semantic layers to better capture meaning across languages. Congratulations 🥳
September 23, 2025 at 11:07 AM
🎉 Congratulations to 👉Dr.👈 Tomáš Musil on successfully defending his PhD thesis! 🍻 His talk explored #LLMs, theories of meaning, and their role in LLM #interpretability, highlighting unsupervised discovery of binary semantic features via ICA and the word intruder test.
September 22, 2025 at 10:09 AM
Workshop "Regulace, AI a advokacie – zákulisí legislativy a advokátních inovací" představil OpenEuroLLM jako naději pro evropskou digitální suverenitu a nutnost pro konkurenceschopnost Evropy. Jan Hajič zdůraznil, že Česko se snaží o snižování byrokracie v oblasti AI.

#AI #AIregulation #FutureOfLaw
September 19, 2025 at 2:45 PM
Researchers' Night with @informatfyz.cuni.cz!
You can come to a live podcast recording and try out a real-time automatic interpreting system ELITR. The event is on September 26th.

🔗 czechia.representation.ec.europa.eu/evropsky-den...

#ELITR #AI #Interpreting #MachineTranslation #LanguageTech
September 18, 2025 at 10:19 AM
Great week at #TSD2025 in Erlangen! Our @ufal-cuni.bsky.social team presented 7 papers covering various topics from Czech speech assessment to multilingual morphology. Thanks to all attendees who engaged with our work! 🧵👇
#NLP #ComputationalLinguistics #CzechNLP #MachineLearning
September 1, 2025 at 2:29 PM
Šimon Libřický defended his bachelor thesis with great success and co-authored an article accepted to @ISMIR 2025. His topic is the intersection between music and computers - a model of difficulty of saxophone scores. Inteview for @informatfyz.cuni.cz: www.mff.cuni.cz/en/public/ne....

#UFAL #MFFUK
Musician at Matfyz, computer scientist with a saxophone
Šimon Libřický, a student from the Prague Music Computing Group, has merged music with computer science to create a model for the difficulty of saxophone scores. His bachelor's thesis, which addresses...
www.mff.cuni.cz
September 1, 2025 at 7:12 AM
Tomorrow, Thu Aug 28, 14-17, join AutoMin 2025 fully remote session to learn about automatic summarization of meetings. Keynote talks by our colleagues from Zoom and Translated.net
Link+Schedule: ufal.github.io/automin-2025
Proudly co-organized by Ondřej Bojar and folks from @ufal-cuni.bsky.social
AutoMin 2025 | Home
Shared Task on Automatic Minuting
ufal.github.io
August 27, 2025 at 2:44 PM
Our team of 10 is at #MTMarathon2025 in Helsinki 🇫🇮, a week-long meeting of machine translation researchers, developers.
✅ Posters presented
✅ Now working on cool collaborative projects with researchers from around the world.
#MachineTranslation #NLP
August 26, 2025 at 8:05 AM
Our latest contribution to multilingual NLP evaluation 🚀 by @jlibovicky.bsky.social, @jindrahelcl.bsky.social, @andrei-a-manea.bsky.social & @gianlucavico.bsky.social
🧵 We're releasing CUS-QA - a new benchmark for testing LLMs on regional knowledge!
Find out what your model knows about Czechia 🇨🇿, Slovakia 🇸🇰, and Ukraine 🇺🇦!
👉 Textual and visual questions, answers, and human judgment on model outputs!
huggingface.co/datasets/ufa...
www.arxiv.org/abs/2507.22752
ufal/cus-qa · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
August 25, 2025 at 8:10 AM
We are hosting a summer school „Data Literacy with R for Students of Humanities“ at Malá Strana, August 4-15. ufal.mff.cuni.cz/events/summe...

#DataLiteracy #Humanities #Matfyz #UFAL #CharlesUniversity
August 7, 2025 at 1:11 PM
#ACL2025NLP continues with two days of workshop, and @ufal-cuni.bsky.social folks are there with more than a dozen posters 👇
July 31, 2025 at 1:30 PM