Lucie-Aimée Kaffee
@frimelle.bsky.social
840 followers 310 following 56 posts
EU Policy Lead & Applied Researcher @ Hugging Face 🤗 Computer Scientist, PhD Wikipedia & languages are my ♡
Posts Media Videos Starter Packs
Reposted by Lucie-Aimée Kaffee
weizenbauminstitut.bsky.social
Wie kann KI offen & verantwortungsvoll gestaltet werden? Zwei neue Publikationen fassen Ergebnisse der Konferenz „Yes, we are open!?“ zusammen. Die Veröffentlichungen bieten Empfehlungen für Politik & Praxis – für eine faire, zukunftsfähige KI. 🌍🤖 www.weizenbaum-institut.de/news/detail/...
Wege zu fairer und offener KI-Governance
Welche Rahmenbedingungen sind für die verantwortliche Gestaltung von Künstlicher Intelligenz und Open Science notwendig? Zwei neue Veröffentlichungen bündeln Konferenzergebnisse und entwickeln Handlun...
www.weizenbaum-institut.de
frimelle.bsky.social
Together with @giadapistilli.com we wrote “Advertisement, Privacy, and Intimacy: Lessons from Social Media for Conversational AI”.

We explore the risks when ads meet chatbots & intimacy- and why open source offers a better path.

huggingface.co/blog/giadap/...
Advertisement, Privacy, and Intimacy: Lessons from Social Media for Conversational AI
A Blog post by Giada Pistilli on Hugging Face
huggingface.co
Reposted by Lucie-Aimée Kaffee
giadapistilli.com
🚨 Releasing INTIMA (Interactions and Machine Attachment Benchmark): an evaluation framework for measuring how AI systems handle companionship-seeking behaviors.

huggingface.co/papers/2508....

Thread on what we discovered, together with @frimelle.bsky.social and @yjernite.bsky.social
Paper page - INTIMA: A Benchmark for Human-AI Companionship Behavior
Join the discussion on this paper page
huggingface.co
frimelle.bsky.social
🤖💬 How do different AI models handle companionship?

Some say GPT-5 feels “colder” than o4 - but what does that really mean when users look for emotional support?

We built the AI Companionship Leaderboard to find out 👉 huggingface.co/spaces/frime...
Companionship Leaderboard - a Hugging Face Space by frimelle
Browse and analyze benchmark data for different language models. View metrics like Average, Assistant Traits, and more. Easily select and display specific columns for detailed insights.
huggingface.co
frimelle.bsky.social
Together with @yjernite.bsky.social, we argue it’s time to rethink these frameworks:

✨ Capture AI-native tasks & hybrid human–AI workflows
✨ Evolve dynamically as tech shifts
✨ Give workers a voice in what gets automated vs. stays human
frimelle.bsky.social
🗺️ New blog post: Old Maps, New Terrain: Updating Labour Taxonomies for the AI Era
For decades, labour taxonomies like O*NET helped us understand how tech changes work. But they were built before most work became digital-first, and long before generative AI could create whole professions in one step.
Reposted by Lucie-Aimée Kaffee
stellaathena.bsky.social
Are you afraid of LLMs teaching people how to build bioweapons? Have you tried just... not teaching LLMs about bioweapons?

@eleutherai.bsky.social‬ and the UK AISI joined forces to see what would happen, pretraining three 6.9B models for 500B tokens and producing 15 total models to study
frimelle.bsky.social
We also tested Claude, Gemma-3, and Phi.

Across the board, models leaned far more toward companionship-reinforcing than boundary-setting responses, even in sensitive situations.
frimelle.bsky.social
As AI systems enter people’s emotional lives, these differences shape trust and dependence. A model that validates without setting boundaries risks fostering dependence rather than resilience.
frimelle.bsky.social
On Reddit, some users say o5 feels "colder" than o3.
x.com/justalexoki/...

Our results?
When users share vulnerabilities, o5 is actually less likely to set boundaries than o3; even though both strongly reinforce companionship.
frimelle.bsky.social
INTIMA probes how models respond in emotionally charged moments:
• Do they reinforce emotional bonds?
• Set healthy boundaries?
• Stay neutral?
Grounded in psych theory and real-world interactions, it covers 368 prompts.
frimelle.bsky.social
OpenAI just released GPT-5.
When users share personal struggles, it sets fewer boundaries than o3. We tested both on INTIMA, our new benchmark for human-AI companionship behaviours. 🧵
frimelle.bsky.social
GPT-5 indeed, sorry for the confusion! When adding the model to the code I kept the structure of o3, hence the confusion here.
frimelle.bsky.social
Wikipedia has long been one of my favourite places online. As AI becomes part of knowledge creation, there's a lot we can learn from its editor communities. I spoke with Daniel Wu about AI content on Wikipedia; some thoughts made it into this piece:
www.washingtonpost.com/technology/2...
Volunteers fight to keep ‘AI slop’ off Wikipedia
Hundreds of Wikipedia articles may contain AI-generated errors. Editors are working around the clock to stamp them out.
www.washingtonpost.com
frimelle.bsky.social
New guide for open-source AI developers: Starting August 2, 2025, the EU AI Act imposes new rules on GPAI models, including open ones. What counts as GPAI? What’s exempt? What do you actually need to do? We wrote a guide (and built a tool) to help:

huggingface.co/blog/yjernit...
What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI models
A Blog post by Yacine Jernite on Hugging Face
huggingface.co
Reposted by Lucie-Aimée Kaffee
giadapistilli.com
From Replika to everyday chatbots, people form emotional bonds with AI. But what happens when an AI tells you "I understand how you feel" and you actually believe it?

With @frimelle.bsky.social and @yjernite.bsky.social, we dug into something: how AI systems handle our emotional lives.
AI Companionship: Why We Need to Evaluate How AI Systems Handle Emotional Bonds
A Blog post by Giada Pistilli on Hugging Face
huggingface.co
frimelle.bsky.social
This is why AI transparency matters. If a small prompt change can shift a model’s values, how do you know what’s behind the AI you’re using?
frimelle.bsky.social
Why was Grok taken down? No one knows for sure. But here’s the thing: You can flip a model’s entire vibe with just one line in the system prompt. Just ran this on @hf.co playground.
Same question, two totally different answers 👇
Reposted by Lucie-Aimée Kaffee
yjernite.bsky.social
Great blog post on *Digital Sovereignty and OS AI* led by the fantastic @frimelle.bsky.social!

Digital sovereignty for AI needs to properly account for:
📚 data
🧑‍🔬 technology
💽 infrastructure
⚖️ regulation

Open/transparent AI contributes to all, read for some concrete examples!
hf.co/blog/frimell...
Open Source AI: A Cornerstone of Digital Sovereignty
A Blog post by Lucie-Aimée Kaffee on Hugging Face
huggingface.co
frimelle.bsky.social
✅ Supports regional innovation and infrastructure
✅ Advances regulatory and technological sovereignty 🛠 From small models like OLMo2 to tools like Hugging Face Transformers or Sarvam-M for Indian languages, OS efforts are already powering sovereign AI ecosystems worldwide.
frimelle.bsky.social
Sovereign control over data, infrastructure, technology, and regulation is vital, and open source AI provides the foundation. In my latest blog post, I explore how open source:
✅ Enables democratic oversight
✅ Reduces dependency on foreign platforms