Emiel van Miltenburg
evanmiltenburg.bsky.social
Emiel van Miltenburg
@evanmiltenburg.bsky.social
Assistant professor in computational linguistics/nlp. Interested in research methodology, ethics, multimodality, and accessibility.
Some years ago, I bought Joseph Weizenbaum's book on Computer Power and Human Reason for a couple of dollars. Nice to have a classic work on computers and society.

Following the AI boom, the price of secondhand copies has skyrocketed: more than 100 dollars for a used copy!
September 24, 2025 at 8:54 AM
We just published a position paper on the evaluation of dialog systems, roughly asking:

1. Why don't we talk more about usage requirements for evaluation metrics?
2. Should we only adapt metrics to dialog, or can we also adapt dialog to suit metrics?

URL: aclanthology.org/2025.gem-1.18/

#NLProc
Measure only what is measurable: towards conversation requirements for evaluating task-oriented dialogue systems
Emiel Van Miltenburg, Anouck Braggaar, Emmelyn Croes, Florian Kunneman, Christine Liebrecht, Gabriella Martijn. Proceedings of the Fourth Workshop on Generation, Evaluation and Metrics (GEM²). 2025.
aclanthology.org
August 22, 2025 at 1:07 PM
I'm looking for studies on the mood/atmosphere/affective dimensions of photographs, but it's hard to find relevant work.

Any recommendations?
July 17, 2025 at 7:46 AM
Reposted by Emiel van Miltenburg
The first papers are already submitted, but for anyone hoping for an extra week to work on their paper, I have very good news!

The deadline for submitting to #INLG2025 has been extended until July 18th (end of the day anywhere on earth, AoE).

Submit your work on generation!

2025.inlgmeeting.org
INLG2025
The 18th International Natural Language Generation Conference is scheduled to be held in Hanoi, Vietnam from October 29 to November 2, 2025.
2025.inlgmeeting.org
July 14, 2025 at 9:39 AM
Does anyone know whether there are automatic solutions to detect either "ghost references" (academic sources that do not actually exist) or improper citations (misrepresenting claims in earlier work)?

Would be a very interesting #nlp #nlproc challenge, and useful to flag potential fraud.
June 26, 2025 at 11:05 AM
Just over a month left to submit to INLG2025, hosted in Hanoi, Vietnam!

INLG is the annual @siggen.bsky.social conference for research on any aspect of Natural Language Generation: (large) language models, rule-based generation, applications, human/automatic evaluation of output quality, etc.
June 1, 2025 at 10:48 PM
New open access textbook (in Dutch): tiu.trialanderror.org/projects/wer...

It's all about quantitative content analysis, and how to analyse existing data in a responsible manner.
Werkboek inhoudsanalyse | Open Press Tilburg University
Werkboek Inhoudsanalyse is een praktische handleiding om te leren hoe je op een betrouwbare en verantwoordelijke manier kunt analyseren hoe mensen met elkaar communiceren. Dit boek richt zich op het o...
tiu.trialanderror.org
May 18, 2025 at 9:14 PM
Curious new #NLProc/#NLP paper aiming to quantify the extent to which a language model can potentially generate all valid strings from a language without generating strings that do not belong to the language: arxiv.org/abs/2504.14370

Very similar to generative linguistics, but Chomsky is not cited.
Density Measures for Language Generation
The recent successes of large language models (LLMs) have led to a surge of theoretical research into language generation. A recent line of work proposes an abstract view, called language generation i...
arxiv.org
May 15, 2025 at 7:31 AM
Reposted by Emiel van Miltenburg
Today we strike! My first day off since 2017 - Thanks @eppobruins.bsky.social
April 10, 2025 at 7:12 AM
Reposted by Emiel van Miltenburg
Mijn opiniestuk over domme AI-hype-stukjes op (o.a.) televisie, perfect verbeeld door Klöppings verhaal over ChatGPT als psycholoog. Wat zulk commentaar dom maakt is niet eens zozeer een verkeerd begrip van de technologie zélf, maar totale onkunde over de echte wereld.

www.trouw.nl/opinie/opini...
April 7, 2025 at 8:08 PM
INLG2025 will be in Hanoi, Vietnam. If you work on Natural Language Generation #NLG, please consider submitting!

It's a really fun conference with attention for both #LLMs and more traditional natural language generation techniques.

More information via: 2025.inlgmeeting.org

#NLProc
INLG2025
The 18th International Natural Language Generation Conference is scheduled to be held in Hanoi, Vietnam from October 29 to November 2, 2025.
2025.inlgmeeting.org
April 4, 2025 at 1:10 PM
If you are organizing a workshop in the United States, please consider adding a statement on remote attendance.

Given the stories in the media about academia being under threat and grants being canceled based on their topic, it doesn't feel safe to go there if you work on those topics.
March 19, 2025 at 8:05 AM
Results:
Two weeks ago I was looking for Vision & Language models, and @dcatteeu.bsky.social helpfully suggested to look at the Open VLM Leaderboard.

At least two models are not on the list, even though they look really nice:
1. Pixtral, by @mistralai.bsky.social
2. Molmo, by @ai2.bsky.social
I'm looking for the current best Vision & Language models that can be run on a modern laptop, ideally through LMStudio.

I currently have Qwen2.5-VL 7B and LLaVA 1.5 7B. Any others I should consider?

#NLP #NLProc #ML #AI
March 14, 2025 at 12:39 PM
Reposted by Emiel van Miltenburg
i know people are coming here for catharsis and that's fine but when you are done screaming please pick something you can do that will help people and do it. you don't accomplish anything by rolling around in the fear all day
January 22, 2025 at 12:03 AM
I'm looking for the current best Vision & Language models that can be run on a modern laptop, ideally through LMStudio.

I currently have Qwen2.5-VL 7B and LLaVA 1.5 7B. Any others I should consider?

#NLP #NLProc #ML #AI
February 27, 2025 at 7:57 PM
I recently answered a question for the Dutch "AI helpdesk" on the future of LLMs: ikhebeenvraagoverai.nl/answers/wat-...

Interesting exercise to write a short answer in plain language. I really enjoyed this! At the same time it's hard to provide a nuanced answer in ~1000 words. I hope it's useful.
Wat zou een grotere prioriteit moeten krijgen om Large Language Models (LLM’s) zoals ChatGPT veilig en betrouwbaar verder te ontwikkelen?
Large Language Models (LLM’s), Confabulaties en Hallucinaties in LLM’s
ikhebeenvraagoverai.nl
February 12, 2025 at 2:18 PM
Reposted by Emiel van Miltenburg
What is a good pointer for a lecture/tutorial about the empirical process of LM/NLP/ML solution development? Rather than the technical details of models, optimization, etc. Basically, how to work with data, evaluation, measure generalization, overfitting, etc.
January 27, 2025 at 3:52 PM
Not sure how documents like these (i.e., reports on community matters) should be published.

I just went for ArXiv, at least for the first version of the report, but there should be a place to make these kinds of documents more findable.
During my time in the SIGGEN board, we received a request from the @aclmeeting.bsky.social executive board to create an overview of dual use issues in Natural Language Generation. In response, I carried out a survey. The results are here: arxiv.org/abs/2501.06636

Feedback is very welcome.
Dual use issues in the field of Natural Language Generation
This report documents the results of a recent survey in the SIGGEN community, focusing on Dual Use issues in Natural Language Generation (NLG). SIGGEN is the Special Interest Group (SIG) of the Associ...
arxiv.org
January 14, 2025 at 9:21 AM
During my time in the SIGGEN board, we received a request from the @aclmeeting.bsky.social executive board to create an overview of dual use issues in Natural Language Generation. In response, I carried out a survey. The results are here: arxiv.org/abs/2501.06636

Feedback is very welcome.
Dual use issues in the field of Natural Language Generation
This report documents the results of a recent survey in the SIGGEN community, focusing on Dual Use issues in Natural Language Generation (NLG). SIGGEN is the Special Interest Group (SIG) of the Associ...
arxiv.org
January 14, 2025 at 8:53 AM
Reposted by Emiel van Miltenburg
Good news everybody! You have an extra 7 days to prepare you submissions for #INLG2024! Please let everyone you know know!

The new deadline for Friday, 7th of June 11:59 PM AOE!

inlg2024.github.io/calls.html
INLG2024
The 17th International Natural Language Generation Conference is scheduled to be held in Tokyo, Japan from September 23rd to 27th, 2024.
inlg2024.github.io
May 29, 2024 at 2:19 PM
Do you have experience writing ALT-text or any other image descriptions for accessibility purposes? (On BlueSky or elsewhere.)

Please fill in our survey!

URL: tilburghumanities.eu.qualtrics.com/jfe/form/SV_...

Please share widely!

#a11y #accessibility
Survey on Image Description
Do you have any experience providing image descriptions (also known as ALT-text)? Please fill in our survey! This is part of the Experience Matters project, funded by ZonMw. Principal investigators: E...
tilburghumanities.eu.qualtrics.com
May 25, 2024 at 7:07 AM
New #NLProc paper on the evaluation of task-oriented dialogue systems: arxiv.org/abs/2312.13871

This took a while to complete (#slowscience), but it was really nice to be able to contribute to the paper.
December 22, 2023 at 11:22 AM
I’m interested to learn more about science and technology studies (#sts). What is the best place to start?

I’ve found this book by Sergio Sismondo: www.wiley.com/en-in/An+Int...

Is that the recommended starting point or should I look elsewhere?
An Introduction to Science and Technology Studies, 2nd Edition
An Introduction to Science and Technology Studies, Second Edition reflects the latest advances in the field while continuing to provide students with a road map to the complex interdisciplinary ter...
www.wiley.com
August 30, 2023 at 9:12 PM