Bastian Bunzeck
@bbunzeck.bsky.social
340 followers 770 following 90 posts
Computational linguist trying to understand how humans and computers learn and use language 👶🧠🗣️🖥️💬 PhD @clausebielefeld.bsky.social, Bielefeld University https://bbunzeck.github.io
Posts Media Videos Starter Packs
Reposted by Bastian Bunzeck
mariaa.bsky.social
Keynote at #COLM2025: Nicholas Carlini from Anthropic

"Are language models worth it?"

Explains that the prior decade of his work on adversarial images, while it taught us a lot, isn't very applied; it's unlikely anyone is actually altering images of cats in scary ways.
Reposted by Bastian Bunzeck
vgel.me
i wrote a custom llm sampler for llama-3.1-8b so it could only say words that are in the bible
Reposted by Bastian Bunzeck
mdhk.net
Huge congrats to the envisionBOX team for the Open Science award nomination! 🎉

My tutorial on speech analysis tools in Python from the Unboxing Multimodality summer school (github.com/mdhk/unboxin...) is now also available at envisionbox.org

Thanks for the invitation & this great initiative! 👏
Reposted by Bastian Bunzeck
Reposted by Bastian Bunzeck
amuuueller.bsky.social
What's the right unit of analysis for understanding LLM internals? We explore in our mech interp survey (a major update from our 2024 ms).

We’ve added more recent work and more immediately actionable directions for future work. Now published in Computational Linguistics!
Reposted by Bastian Bunzeck
gboleda.bsky.social
New paper! 🚨 I argue that LLMs represent a synthesis between distributed and symbolic approaches to language, because, when exposed to language, they develop highly symbolic representations and processing mechanisms in addition to distributed ones.
arxiv.org/abs/2502.11856
Sigmoid function. Non-linearities in neural network allow it to behave in distributed and near-symbolic fashions.
Reposted by Bastian Bunzeck
pranav-nlp.bsky.social
I'm conducting research on how ACL's peer review policies impact NLP research quality, career trajectories, and inclusivity within our community. I am running a survey, which would take around 7-10 mins to complete: forms.cloud.microsoft/e/j2jr9nH3X0

I would really appreciate insights from y'all!
I'm conducting research on how ACL's peer reviewing policies impact NLP research quality, career trajectories, and inclusivity within our community. Your insights—whether you're a seasoned reviewer, early-career researcher, or anywhere in between—are invaluable.
The survey takes 7-10 minutes and covers topics like review quality, reviewer assignment, and accessibility barriers. All responses are confidential and will help inform evidence-based improvements to our peer review processes.
Reposted by Bastian Bunzeck
a-lauscher.bsky.social
🚨 Are you looking for a PhD in #NLProc dealing with #LLMs?
🎉 Good news: I am hiring! 🎉
The position is part of the “Contested Climate Futures" project. 🌱🌍 You will focus on developing next-generation AI methods🤖 to analyze climate-related concepts in content—including texts, images, and videos.
Reposted by Bastian Bunzeck
simphon.bsky.social
Attending the The Second International Workshop on Construction Grammars and NLP (CxGs+NLP 2025) in Düsseldorf, Germany? Check out the poster “Do Construction Distributions Shape Formal Language Learning In German BabyLMs?” by Bastian Bunzeck and colleagues! @bbunzeck.bsky.social #CRC1646 #LINCC
bbunzeck.bsky.social
From conference to conference: September ends with a trip to #IWCS in beautiful Düsseldorf. Hyped for two days of semantics (and two more days of construction grammar and NLP). 🥳
Slides of the first keynote speaker.
Reposted by Bastian Bunzeck
stefanhartmann.bsky.social
The first of the three corpora of German-English bilingual children's early speech that we've been working on for the last few years is finally publicly available! 🥳 🎉 talkbank.org/childes/acce...
CHILDES English-German MPI-EVA-Leipzig Corpus
talkbank.org
Reposted by Bastian Bunzeck
simphon.bsky.social
“Developmentally plausible pretraining, now also auf Deutsch: a BabyLM Dataset for German” — Today I had the pleasure to present our German BabyLM dataset together with the first author Bastian Bunzeck @bbunzeck.bsky.social‬ to an interested and engaging audience at #KONVENS2025 in Hildesheim.
Daniel Duran and Bastian Bunzeck at the poster presentation
bbunzeck.bsky.social
Our BabyLMs at #konvens 🥳
clausebielefeld.bsky.social
Happening now: Sina‘s keynote on our BabyLM work. 🥳
bbunzeck.bsky.social
From conference to conference — after last week’s #semdial I am at #konvens in Hildesheim this week. I will be presenting out German BabyLM Corpus (with @simphon.bsky.social) and our PI Sina Zarrieß will give a Keynote on BabyLMs tomorrow. 🥳
Reposted by Bastian Bunzeck
adelegoldberg.bsky.social
Abstract deadline changed to *December 1, 2025*
adelegoldberg.bsky.social
UPDATE: Abstract deadline: Nov 1, 25! Invited speakers: Corrine Occhino, Dagmar Divjak, Idan Blank, Randy Allen Harris, Gary Lupyan, Laura Michaelis, Kanishka Misra !
@dagmardivjak.bsky.social @randyallenharris.bsky.social @congramqueen.bsky.social @glupyan.bsky.social @kanishka.bsky.social
adelegoldberg.bsky.social
📌 👉 The 14th International Construction Grammar conference will be held at Princeton, June 4-7, 2026

Usage-based analyses and Empirical methods

Stay tuned for updates!
Reposted by Bastian Bunzeck
simphon.bsky.social
It's a wrap! Thanks to the organizers, presenters and all participants for an inspiring and engaging #semdial2025 #bialogue conference at Bielefeld University! I had fun.
In case you missed it, proceedings can be found on the website:
semdial2025.github.io
Members of the Local Organizing Committee at the closing of SemDial 2025.
bbunzeck.bsky.social
🗣️🗣️🗣️❗️❗️❗️
Reposted by Bastian Bunzeck
simphon.bsky.social
“Child-directed speech is fine-tuned to children’s developmental needs” — @bbunzeck.bsky.social from the A02 project of #CRC1646 #LINCC, presented a great poster today at #semdial2025 #bialogue based on earlier work in collaboration with Holger Diessel.
Bastian Bunzeck in front of his poster "Child-directed speech is fine-tuned to children’s developmental needs (Bastian Bunzeck, Holger Diessel)"
bbunzeck.bsky.social
I will present a poster on the First Language article I wrote with Holger Diessel now at #semdial 😁💬
Reposted by Bastian Bunzeck
clausebielefeld.bsky.social
Leonie Schade asks whether it takes two to do an articulatory tango 😁
bbunzeck.bsky.social
Now coming up: session 1 on naturalistic dialogue 👌