Bastian Bunzeck
banner
bbunzeck.bsky.social
Bastian Bunzeck
@bbunzeck.bsky.social
Computational linguist trying to understand how humans and computers learn and use language 👶🧠🗣️🖥️💬

The work is mysterious and important. See https://bbunzeck.github.io

PhDing at @clausebielefeld.bsky.social
Reposted by Bastian Bunzeck
🚨NEW PUBLICATION ALERT!🚨
The 'Design Features' of Language Revisited (w/ @mperlman.bsky.social @glupyan.bsky.social Koen de Reus & @limorraviv.bsky.social)
Feature Review out now in #OpenAccess in @cp-trendscognsci.bsky.social! #language #linguistics
Paper: doi.org/10.1016/j.ti...
November 25, 2025 at 7:49 PM
Reposted by Bastian Bunzeck
We are advertising **11 new PhD positions** in the second cohort of our RTG on Curiosity (details on all 11 positions here: www.uni-goettingen.de/de/open+posi...). One of these positions is in my group looking at the role of curiosity in early word learning (www.uni-goettingen.de/en/644546.ht...)
Open Positions - Georg-August-Universität Göttingen
Webseiten der Georg-August-Universität Göttingen
www.uni-goettingen.de
November 25, 2025 at 1:32 PM
Reposted by Bastian Bunzeck
🚀 Introducing TMLR Beyond PDF!

🎬 This is a new, HTML-based submission format for TMLR, that supports interactive figures and videos, along with the usual LaTeX and images.

🎉 Thanks to TMLR Editors in Chief: Hugo Larochelle, @gautamkamath.com, Naila Murray, Nihar B. Shah, and Laurent Charlin!
November 25, 2025 at 4:12 PM
Reposted by Bastian Bunzeck
We began day 2 of our Large Language Models (LLM) for linguistics research workshop @UniKoeln with a fascinating keynote by Charlotte Pouw on "Interpreting models for speech generation and understanding using methods from #psycholinguistics". Charlotte shared […]

[Original post on fediscience.org]
November 25, 2025 at 8:54 AM
Reposted by Bastian Bunzeck
Tomorrow, we will show within the science festival #geniale how AI voice modification can help explaining the subtle differences between voice qualities, expressing personality, age, mood, gender, health, and much more! #trr318, #bielefeld #tts #xAI wissenswerkstadt.de/veranstaltun...
Sag was! | Wissenswerkstadt Bielefeld
Mit Hilfe von KI Stimmeigenschaften erklärbarer machen
wissenswerkstadt.de
November 21, 2025 at 1:15 PM
Reposted by Bastian Bunzeck
Interested in the evolution of human language? Check out our new paper in @science.org where we synthesize latest findings and outline a multifaceted, bio-cultural approach for studying how language evolved. Super proud of this work, and hoping it leads to exciting new research! tinyurl.com/ykacvanp
November 21, 2025 at 9:47 AM
Reposted by Bastian Bunzeck
📢Out now in NEJLT!📢

In each of these sentences, a verb that doesn't usually encode motion is being used to convey that an object is moving to a destination.

Given that these usages are rare, complex, and creative, we ask:

Do LLMs understand what's going on in them?

🧵1/7
November 19, 2025 at 1:57 PM
Reposted by Bastian Bunzeck
If a language model could dynamically optimise subword tokenisation, how would its subwords evolve during training? In our new paper we study the learning dynamics of subword segmentation:
arxiv.org/pdf/2511.09197
November 19, 2025 at 9:55 AM
Reposted by Bastian Bunzeck
Am I evil? Am I likeable?

Need a 10 minutes break? Like Fantasy? Loath it? Take part in our study and help us by rating images of fictional characters here:
bixprag.lili.uni-bielefeld.de/publix/0aSWK...
November 19, 2025 at 10:25 AM
Reposted by Bastian Bunzeck
Coming soon - Turner & Hoffmann on Creative Construction Grammar. In this, we argue that the domain-general process of Conceptual Blending is the cognitive operation that combines constructions. BTW this will be published open access!

www.cambridge.org/core/element...
November 17, 2025 at 10:45 AM
Reposted by Bastian Bunzeck
#CfP alert! 🚨 The call for papers and theme sessions for the next International Conference of the German Cognitive Linguistics Conference (Bielefeld, 31.08.-02.09.2026) is finally out 🎉 www.uni-bielefeld.de/fakultaeten/...
www.uni-bielefeld.de
November 15, 2025 at 3:27 PM
Reposted by Bastian Bunzeck
Less than 3 weeks to submit an abstract for ICCG14! please pass it on: iccg14.oa-event.com June 4-7, 2026
Home
iccg14.oa-event.com
November 11, 2025 at 2:17 PM
Reposted by Bastian Bunzeck
Our panel moderated by @danaarad.bsky.social
"Evaluating Interpretability Methods: Challenges and Future Directions" just started! 🎉 Come to learn more about the MIB benchmark and hear the takes of @michaelwhanna.bsky.social, Michal Golovanevsky, Nicolò Brunello and Mingyang Wang!
November 9, 2025 at 6:55 AM
Reposted by Bastian Bunzeck
#EMNLP2026 will be in Budapest 🇭🇺 24-29/October/2026 (earlier than ever?) #EMNLP2025 #nlp #nlproc
November 7, 2025 at 9:30 AM
Reposted by Bastian Bunzeck
I'm in Suzhou to present our work on MultiBLiMP, Friday @ 11:45 in the Multilinguality session (A301)!

Come check it out if your interested in multilingual linguistic evaluation of LLMs (there will be parse trees on the slides! There's still use for syntactic structure!)

arxiv.org/abs/2504.02768
November 6, 2025 at 7:08 AM
Reposted by Bastian Bunzeck
One of the great mysteries of #language is how it finds a balance between robust stability and endless flexibility. I believe this requires us to rethink #linguistic structures. In this article, I propose dynamic #tensegrity as a novel architectural metaphor
aclanthology.org/2025.cxgsnlp...
aclanthology.org
November 4, 2025 at 2:08 PM
As part of this year's BabyLM challenge, we (researchers from @gronlp.bsky.social and @clausebielefeld.bsky.social diverged from established pretraining paradigm by training only on dialogue data from CHILDES.
October 28, 2025 at 12:53 PM
Reposted by Bastian Bunzeck
With only a week left for #EMNLP2025, we are happy to announce all the works we 🐮 will present 🥳 - come and say "hi" to our posters and presentations during the Main and the co-located events (*SEM and workshops) See you in Suzhou ✈️
October 27, 2025 at 11:54 AM
Reposted by Bastian Bunzeck
"The capacity for language exists along a continuum [...]. The idea that language development does not require uniquely human properties becomes increasingly important as legal boundaries expand to include nonhuman species."
October 23, 2025 at 8:49 PM
Reposted by Bastian Bunzeck
🌍Introducing BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data!

LLMs learn from vastly more data than humans ever experience. BabyLM challenges this paradigm by focusing on developmentally plausible data

We extend this effort to 45 new languages!
October 15, 2025 at 10:53 AM
Preprint alert! We release BabyBabelLM, a multilingual benchmark of developmentally plausible training data. I was responsible for German and Polish data as well as various child-directed wikis. Immensely rewarding project with exceptionally cool co-authors. 🥳🚀
𝐃𝐨 𝐲𝐨𝐮 𝐫𝐞𝐚𝐥𝐥𝐲 𝐰𝐚𝐧𝐭 𝐭𝐨 𝐬𝐞𝐞 𝐰𝐡𝐚𝐭 𝐦𝐮𝐥𝐭𝐢𝐥𝐢𝐧𝐠𝐮𝐚𝐥 𝐞𝐟𝐟𝐨𝐫𝐭 𝐥𝐨𝐨𝐤𝐬 𝐥𝐢𝐤𝐞? 🇨🇳🇮🇩🇸🇪

Here’s the proof! 𝐁𝐚𝐛𝐲𝐁𝐚𝐛𝐞𝐥𝐋𝐌 is the first Multilingual Benchmark of Developmentally Plausible Training Data available for 45 languages to the NLP community 🎉

arxiv.org/abs/2510.10159
October 14, 2025 at 5:19 PM
Reposted by Bastian Bunzeck
Keynote at #COLM2025: Nicholas Carlini from Anthropic

"Are language models worth it?"

Explains that the prior decade of his work on adversarial images, while it taught us a lot, isn't very applied; it's unlikely anyone is actually altering images of cats in scary ways.
October 9, 2025 at 1:12 PM
Reposted by Bastian Bunzeck
i wrote a custom llm sampler for llama-3.1-8b so it could only say words that are in the bible
October 7, 2025 at 4:35 AM
Reposted by Bastian Bunzeck
Huge congrats to the envisionBOX team for the Open Science award nomination! 🎉

My tutorial on speech analysis tools in Python from the Unboxing Multimodality summer school (github.com/mdhk/unboxin...) is now also available at envisionbox.org

Thanks for the invitation & this great initiative! 👏
October 2, 2025 at 5:18 PM
Reposted by Bastian Bunzeck
Gentle reminder that the #CfP for #Evolang2026 @evolangconf.bsky.social is still open - deadline October 26! sites.google.com/york.ac.uk/e...
EVOLANG 2026 - Call for Papers
sites.google.com
October 2, 2025 at 11:32 AM