Werner Geyer
@wernergeyer.bsky.social
160 followers
210 following
34 posts
Chief Scientist Human-Center Trustworthy AI @ IBM Research. Interested in Human+AI Interaction & AI-Assisted Productivity. Opinions are my own! https://wernergeyer.com
Posts
Media
Videos
Starter Packs
Pinned
Werner Geyer
@wernergeyer.bsky.social
· Jun 16
LLM-as-a-Judge Without the Headaches: EvalAssist Brings Structure and Simplicity to the Chaos of LLM Output Review | AI Alliance
Evaluating AI model outputs at scale is a major challenge for teams using LLMs, especially when assessing nuanced qualities like politeness, fairness, and tone that traditional benchmarks miss. IBM Re...
thealliance.ai
Reposted by Werner Geyer
Reposted by Werner Geyer
Reposted by Werner Geyer
Werner Geyer
@wernergeyer.bsky.social
· Jun 16
LLM-as-a-Judge Without the Headaches: EvalAssist Brings Structure and Simplicity to the Chaos of LLM Output Review | AI Alliance
Evaluating AI model outputs at scale is a major challenge for teams using LLMs, especially when assessing nuanced qualities like politeness, fairness, and tone that traditional benchmarks miss. IBM Re...
thealliance.ai
Reposted by Werner Geyer
Werner Geyer
@wernergeyer.bsky.social
· Apr 29
Reposted by Werner Geyer