Steffen Herbold
@sherbold.bsky.social
250 followers 130 following 66 posts
https://www.fim.uni-passau.de/ai-engineering/
Posts Media Videos Starter Packs
sherbold.bsky.social
Very interesting paper that shows that LLMs are good. But not good enough. I wish the following from the conclusion would have made it into the abstract:
sherbold.bsky.social
😂
merriam-webster.com
We are thrilled to announce that our NEW Large Language Model will be released on 11.18.25.
sherbold.bsky.social
Just accepted at TMLR:

We found evidence of copyright violations by LLMs even when we ask questions that were not part of the training. Indeed, we found that the amount of memorized content was independent from the questions being part of the training or not.

openreview.net/forum?id=ddo...
Studying memorization of large language models using answers to...
Large Language Models (LLMs) are capable of answering many software related questions and supporting developers by generating code snippets. These capabilities originate from training on massive...
openreview.net
Reposted by Steffen Herbold
michaeldorner.mastodon.social.ap.brid.gy
Scientific impact and achievement, redefined:
Huge congrats to #fraunhofer IIS on winning an #emmy for their JPEG XS compression standard 🏆🎉 […]
Original post on mastodon.social
mastodon.social
sherbold.bsky.social
re

(I miss IRC)

(Now I feel old)
sherbold.bsky.social
Dear all,

please enjoy your complementary "European Professor goes on Holiday" message.

See you in September.

Yours sincerely,
A European Professor
Reposted by Steffen Herbold
hadaskotek.bsky.social
Good news (for me!) my gender bias paper from 2023 still replicates with GPT-5.
Bad news (for everyone!) my gender bias paper from 2023 still replicates with GPT-5.
arxiv.org/pdf/2308.14921
hkotek.com/blog/gender-...
sherbold.bsky.social
I wonder what my PhD students will think, once they discover that "someone" glued the three laws to the wall in the hallway. 🙃
phdcomics.com
Newton's Laws of Graduation, Part 1
Reposted by Steffen Herbold
phdcomics.com
Newton's Laws of Graduation, Part 2 - The Second Law
Reposted by Steffen Herbold
phdcomics.com
Newton's Laws of Graduation, Part 3 - The Third Law 😆
sherbold.bsky.social
Success, a luxury problem, and its solution:
🎉 Our quiz is a huge success and incredibly popular on YouTube with now over 100,000 views.
😐 We cannot answer all the feedback and comments individually anymore.
😀 We write a follow up article to answer the most important questions.
sherbold.bsky.social
My debut as TV-Show moderator - now live on Youtube.

We had a lot of fun with how the five professors answered questions on topics ranging from 90's music, counting peas, size of Asian countries, etc.

The only drawback: it is only available in German.

P.S. The humans won.
unipassauresearch.bsky.social
Wie schlägt sich KI gegen professorale Expertise? Die Quiz-Show unter Moderation von @sherbold.bsky.social ist nun in voller Länge online. Wer sich vorab selbst mit der KI messen möchte, kann dies per Online-Quiz tun: www.digital.uni-passau.de/beitraege/20...

#KeepCALLM #5gegenKI
Quizshow 5 gegen KI - Professorenteam tritt gegen KI an
YouTube video by Universität Passau
www.youtube.com
Reposted by Steffen Herbold
unipassauresearch.bsky.social
Wie schlägt sich KI gegen professorale Expertise? Die Quiz-Show unter Moderation von @sherbold.bsky.social ist nun in voller Länge online. Wer sich vorab selbst mit der KI messen möchte, kann dies per Online-Quiz tun: www.digital.uni-passau.de/beitraege/20...

#KeepCALLM #5gegenKI
Quizshow 5 gegen KI - Professorenteam tritt gegen KI an
YouTube video by Universität Passau
www.youtube.com
sherbold.bsky.social
That was so much fun. I look forward to the video 😃
unipassauresearch.bsky.social
Wer kann besser Erbsen zählen - 🤖oder unsere Profs?
Gestern bei uns im Studio: die Quizshow #5gegenKI

Moderator @sherbold.bsky.social hat mit elf Fragen immer wieder für Überraschung gesorgt. Eine Erkenntnis: Auch KI kann faul sein.

Die Show stellen wir in Kürze online. #KeepCALLM #staytuned
Wie viele Erbsen sind auf diesem Bild? Moderator Steffen Herbold, Professor für AI Engineering (links im Bild zu sehen) hat mit seinen Fragen nicht nur die Professoren und die Professorin auf der Bühne ins Grübeln gebracht. Auf der Bühne rätselten: Florian Lemmerich, Martina Padmanabhan, Johann-Mattis List, Moritz Müller und Brian Valerius.
Reposted by Steffen Herbold
philippleitner.net
I'll just leave that quote here ...
sherbold.bsky.social
Yesterday: Let's try to ground AI models in reality.
Now: Let's try to ground reality on AI models.

Fixes a lot of issues. I am impressed. 😅

They should call it LLM as a Physicists, then it gets accepted by the community ... right? (Looking at you, everybody trusting LLM as a judge!)
sherbold.bsky.social
Happy to share that we published MAMUT @tmlrorg.bsky.social. We defined multiple data augmentation approaches to get more diverse mathematical data and show this improves pre-training.

Congrats to my student Jonathan Drechsel for his first publication! 🎉

www.fim.uni-passau.de/en/ai-engine...
Math Mutator (MAMUT) Accepted at TMLR
www.fim.uni-passau.de
sherbold.bsky.social
No need, I can already feel the @icseconf.bsky.social paper bidding approaching 🙃
sherbold.bsky.social
Starting the weekend on a Friday at 4pm with an empty inbox feels kind of strange. Good, but strange.
Reposted by Steffen Herbold
robertoverdecchia.bsky.social
How much energy is needed to generate an image? 🎨🧠⚡️
Up to 4.08 Wh — like charging your phone to 40%!
In our new study we tested 17 models & 9,000+ runs.

Other key finds:
⚡️ Model energy use varies up to 46x
📐 Resolution matters, prompts don't
🛠️ Quantization ≠ savings

📄 Preprint: lnkd.in/dKWWAETW
Reposted by Steffen Herbold
unipassauresearch.bsky.social
Können #LLMs einen neuen Zugang zum Recht eröffnen? Darüber spricht Brian Valerius, Professor für #KI im #Strafrecht, mit Rechtsanwalt Sven Galla, der KI bereits in der Praxis einsetzt.

📅 Donnerstag, 10. Juli, 18 Uhr
📍 Hörsaal 13

Mehr Infos: www.digital.uni-passau.de/generative-s...

#KeepCALLM
Ringvorlesung - Sprachmodelle: ein neuer Zugang zum Recht?
YouTube video by Universität Passau
youtu.be
sherbold.bsky.social
The truly impressive thing about Zoom is that whenever they update the UI, it gets worse.
sherbold.bsky.social
New pre-print: If you are wondering which models are good for non-code software engineering tasks, take a look at this work from my student Fabian Pena.

Also: Look at it if you want to know how to use Bayesian stats for ranking models.

arxiv.org/abs/2506.10833
Evaluating Large Language Models on Non-Code Software Engineering Tasks
Large Language Models (LLMs) have demonstrated remarkable capabilities in code understanding and generation; however, their effectiveness on non-code Software Engineering (SE) tasks remains underexplo...
arxiv.org