David M. Schmidt
dmschmidt.bsky.social
David M. Schmidt
@dmschmidt.bsky.social
#NLProc PhD Student & Research Associate at Bielefeld University
Working on: Question Answering over Linked Data, Semantic Web, Lexical Knowledge & Compositionality in AI
https://davidmschmidt.de
- and listened to keynotes of well-known researchers like Frank van Harmelen, Natasha Noy and Enrico Motta
A huge thanks to everyone who made this week such a memorable experience! And, if you are Master's/PhD student or PostDoc, I cannot recommend too much to apply for the next iteration of #ISWS!
June 16, 2025 at 2:27 PM
- worked in a research task force on building a reliable LLM-based metadata enrichment pipeline for cultural heritage objects (special thanks to our tutor Valentina Presutti and our whole team), as well as writing a corresponding white paper and presenting our results in the final session
June 16, 2025 at 2:26 PM
During the last week, among many other things, I
- summarized the motivation of my work in a 45s "Minute Madness" session
- presented my work during a poster session, getting helpful feedback from students and tutors (special thanks to Aidan Hogan and Stefano De Giorgis)
June 16, 2025 at 2:26 PM
The #ISWS2025 experience really managed to combine lots of fun activities, working with leading figures of the Semantic Web field as well as intense networking in a unique, wonderful way! It felt like a month worth of program items had been compressed to one magnificent piece of art.
June 16, 2025 at 2:25 PM
Social media is acknowledged as an important source of patient experience data to learn about patients’ unmet needs, priorities, and preferences. The objective of this study was to evaluate to what extent SOTA LLMs can appropriately summarize posts shared by patients in web-based forums.
April 15, 2025 at 11:07 AM
🎓 Authors: Rakhi Asokkumar Subjagouri Nair, Matthias Hartung, Philipp Heinisch, Janik Jaskolski, Cornelius Starke-Knäusel, Susana Veríssimo, David M. Schmidt, Philipp Cimiano

🔗 Paper: doi.org/10.2196/62909
Summarizing Online Patient Conversations Using Generative Language Models: Experimental and Comparative Study
Background: Social media is acknowledged by regulatory bodies (eg, the Food and Drug Administration) as an important source of patient experience data to learn about patients’ unmet needs, priorities,...
doi.org
April 15, 2025 at 11:05 AM
NLP/Text Generation
EN: uni-bielefeld.hr4you.org/job/view/4054
DE: uni-bielefeld.hr4you.org/job/view/4053

NLP/Information Extraction
EN: uni-bielefeld.hr4you.org/job/view/4059
DE: uni-bielefeld.hr4you.org/job/view/4057

If you have any questions, do not hesitate to contact me or Philipp directly!
Research Position - Text Generation/Natural Langua...
<div style="text-align: justify;">The Faculty of Engineering at Bielefeld University is looking for a research assistant to work on th...
uni-bielefeld.hr4you.org
March 6, 2025 at 2:40 PM
We currently have two fully-funded open PhD positions in our group with a focus on #NLProc, #InformationExtraction and #TextGeneration. I can really recommend both the group as well as Philipp Cimiano as a supervisor, so take this opportunity!
March 6, 2025 at 2:40 PM
💡 Interested? Try it yourself!

Tool: ag-sc.techfak.uni-bielefeld.de/ctvis/
ag-sc.techfak.uni-bielefeld.de
February 3, 2025 at 12:49 PM
For selecting clinical trials to be compared in systematic reviews, it is important they measure the same outcomes. Therefore, we developed a tool that provides an overview of the clinical trial information about glaucoma and type 2 diabetes and enables users to group them by outcomes.
February 3, 2025 at 12:49 PM
💡 Interested? Try it yourself!

Zenodo artifact: doi.org/10.5281/zeno...

GitHub repository: github.com/ag-sc/clinic...
January 8, 2025 at 4:44 PM
In this work, we investigate the influence of grammar-constrained decoding (GCD) as well as pointer generators (PG) on the performance of a domain-specific information extraction (IE) system. We investigate whether the addition of GCD and PG improve IE results of fine-tuned encoder-decoder models.
January 8, 2025 at 4:43 PM
🧑‍💻 Additionally, you can find the code and data if our approach on Zenodo, GitHub and DockerHub:
Zenodo artifact: doi.org/10.5281/zeno...
GitHub repository: github.com/ag-sc/neodud...
DockerHub image: hub.docker.com/r/dvs23/neod...
ag-sc/neodudes: v1.1.2
Mentioning results folder in root README now.
doi.org
November 28, 2024 at 4:51 PM
At the main conference, I presented our paper "Lexicalization Is All You Need: Examining the Impact of Lexical Knowledge in a Compositional QALD System" as well as an accompanying poster and demo illustrating the strengths of our lexicon-based, compositional question answering approach.
November 28, 2024 at 4:48 PM