Institute of Formal and Applied Linguistics
@ufal.mff.cuni.cz
590 followers 67 following 130 posts
Computational linguistics • Natural language processing • Formal linguistics • Machine translation | at Faculty of Mathematics and Physics, Charles University
Posts Media Videos Starter Packs
ufal.mff.cuni.cz
🚀 PROJECT LAUNCH: Infoveillance is Live! Our AI tool monitors digital media to detect misinformation and enhance public trust/literacy. Fighting infodemics & polarization.

[https://ufal.mff.cuni.cz/grants/infoveillance]
#Infoveillance #AI #Misinformation #PublicTrust #UFAL
ufal.mff.cuni.cz
The tools developed by Pavel and Jan are used by several oral history collections around the world, including the USC Holocaust testimonies available and the Center for Visual History Malach at UFAL. Their general ASR tools are also available at lindat.mff.cuni.cz/services/uwe....
UWebASR - University of West Bohemia : Automatic Speech Recognition Service
lindat.mff.cuni.cz
Reposted by Institute of Formal and Applied Linguistics
lindatclariahcz.bsky.social
Huge win! 🎉 The CLARIN Steven Krauwer Prize 2025 goes to LINDAT/CLARIAH-CZ partners Pavel Ircing & Jan Švec (UWB). They won for their ASR tools for complex Oral History recordings. Their work is vital for global digital humanities!

#CLARIN #LINDAT #StevenKrauwerAward #ASR #OralHistory #DG
ufal.mff.cuni.cz
Letní školu vedli: Ondřej Bojar, Zdeněk Kasner, Tomáš Polák, Dominik Macháček, Miroslav Hrabal a Josef Jon.
ufal.mff.cuni.cz
Šest kolegů vedlo pro DGT (Evropské ředitelství pro překlady) třídenní letní školu v Lucemburku. Učili 40+ pracovníků DGT nejnovější metody strojové podpory překladu a zajištění kvality. Cíl? Zefektivnit překlad legislativy EU do všech členských jazyků!
#DGT #UFAL #StrojovyPreklad #AI #EUTools
ufal.mff.cuni.cz
In his speech titled "Indirect Objects across Languages: A Trap in Universal Dependencies?" he discussed the challenges of the UD framework in relation to traditional language descriptions. He highlighted the ambiguities in UD guidelines and their impact on annotation practices.
ufal.mff.cuni.cz
@Dan Zeman has been invited as a keynote speaker at the ICLC 11 conference! iclc11.ff.cuni.cz/keynote-spea...

#UFAL #ICLC11 #UniversalDependencies #CharlesUniversity #Prague
Dan Zeman´s talk ICLC conference lecture room
ufal.mff.cuni.cz
ufal.mff.cuni.cz/grants/human...
Projekt bude zkoumat, jak mohou velké jazykové modely a další technologie umělé inteligence přispět k demokratickému dialogu, vzdělávání a komunikaci mezi lidmi.

Období realizace: 1. 3.2025 – 31. 12.2028. Financování: OP JAK: Společenské a humanitní vědy 2/2
ufal.mff.cuni.cz
Nahlédněte na kick‐off meeting projektu ✨HumanAId: AI zaměřená na člověka pro udržitelnou a adaptabilní společnost✨.

Projekt se silnou účastí: vede ho FFUK ve spolupráci s MFF UK, FSV UK, PF UP v Olomouci, FÚ AV ČR, prg.ai a Kampusem Hybernská.

#prgAI #HumanAId #OPJAK
1/2
HumanAID meeting
ufal.mff.cuni.cz
And another successfully defended thesis: 👉Dr.👈 Kira Droganova defended her thesis: Dependency Parsing beyond Simple Trees, which focused on enriching syntactic parsing with deeper semantic layers to better capture meaning across languages. Congratulations 🥳
ufal.mff.cuni.cz
🎉 Congratulations to 👉Dr.👈 Tomáš Musil on successfully defending his PhD thesis! 🍻 His talk explored #LLMs, theories of meaning, and their role in LLM #interpretability, highlighting unsupervised discovery of binary semantic features via ICA and the word intruder test.
ufal.mff.cuni.cz
The discussions from the LINDAT/CLARIAH-CZ Advisory Board meeting, particularly on long-term strategy and the upcoming Evaluation 2025, are crucial for our shared mission. We were happy to host the meeting!
ufal.mff.cuni.cz
Workshop "Regulace, AI a advokacie – zákulisí legislativy a advokátních inovací" představil OpenEuroLLM jako naději pro evropskou digitální suverenitu a nutnost pro konkurenceschopnost Evropy. Jan Hajič zdůraznil, že Česko se snaží o snižování byrokracie v oblasti AI.

#AI #AIregulation #FutureOfLaw
ufal.mff.cuni.cz
Researchers' Night with @informatfyz.cuni.cz!
You can come to a live podcast recording and try out a real-time automatic interpreting system ELITR. The event is on September 26th.

🔗 czechia.representation.ec.europa.eu/evropsky-den...

#ELITR #AI #Interpreting #MachineTranslation #LanguageTech
ufal.mff.cuni.cz
Gold Data and Multiple Understanding of Discourse Relations
by Š. Zikánová, A. Nedoluzhko, J. Mírovský & E. Hajičová
TL;DR: Investigate how annotators interpret discourse relations differently, revealing important insights about subjectivity in linguistic annotation and its impact on NLP systems.
ufal.mff.cuni.cz
Morphological Segmentation with Neural Networks: Performance Effects of Architecture, Data Size, and Cross-Lingual Transfer in Seven Languages
by M. Olbrich & Z. Zabokrtsky
TL;DR: Analyzed neural architectures, data size, and cross-lingual transfer for morphological segmentation for 7 languages.
ufal.mff.cuni.cz
Flexing in 73 Languages: A Single Small Model for Multilingual Inflection
by Tomáš Sourada & Jana Straková
TL;DR: Compact neural model successfully handles morphological inflection across 73 diverse languages, proving that small can be mighty in multilingual NLP.
ufal.mff.cuni.cz
Refining Czech GEC: Insights from a Multi-Experiment Approach
by P. Pechman, @straka-milan.bsky.social , @janastrakova.bsky.social , J. Náplava
TL;DR: Better Czech grammatical error correction systems + insights for better automated writing assistance in Czech arxiv.org/abs/2506.22402
ufal.mff.cuni.cz
Investigating the Effect of Parallel Data in the Cross-Lingual Transfer for Vision-Language Encoders
by @andrei-a-manea.bsky.social & @jlibovicky.bsky.social
TL;DR: Explore how parallel datasets improve cross-lingual transfer in vision-language models. arxiv.org/abs/2504.21681
ufal.mff.cuni.cz
ParCzech 3.0: A Large Czech Speech Corpus with Rich Metadata
by M. Kopp, V. Stankov, J. O. Krůza, . Straňák & . Bojar
TL;DR: Czech parliamentary speeches from 2013-2021 with rich metadata incl. speaker identities, political affiliations, and automatic linguistic annotations in TEI format.
ufal.mff.cuni.cz
Automated Speaking Assessment for L2 Learners of Czech by Peter Polák, Michal Novák, Kateřina Rysová, Magdaléna Rysová & Ondřej Bojar
TL;DR: An automated system to evaluate the Czech speaking skills of second language learners, making language assessment more accessible and consistent.
ufal.mff.cuni.cz
Great week at #TSD2025 in Erlangen! Our @ufal-cuni.bsky.social team presented 7 papers covering various topics from Czech speech assessment to multilingual morphology. Thanks to all attendees who engaged with our work! 🧵👇
#NLP #ComputationalLinguistics #CzechNLP #MachineLearning
ufal.mff.cuni.cz
Tomorrow, Thu Aug 28, 14-17, join AutoMin 2025 fully remote session to learn about automatic summarization of meetings. Keynote talks by our colleagues from Zoom and Translated.net
Link+Schedule: ufal.github.io/automin-2025
Proudly co-organized by Ondřej Bojar and folks from @ufal-cuni.bsky.social
AutoMin 2025 | Home
Shared Task on Automatic Minuting
ufal.github.io