- Quick application
- Accepting posters for 2025 papers from top ML / Security venues
- 𝗗𝗲𝗮𝗱𝗹𝗶𝗻𝗲: October 28, 2025
- Notifications: October 31, 2025
Submission link: docs.google.com/forms/d/e/1F...
Workshop website: llmsafety-unconference.github.io
- Quick application
- Accepting posters for 2025 papers from top ML / Security venues
- 𝗗𝗲𝗮𝗱𝗹𝗶𝗻𝗲: October 28, 2025
- Notifications: October 31, 2025
Submission link: docs.google.com/forms/d/e/1F...
Workshop website: llmsafety-unconference.github.io
📅 December 2, 2025
📍 Copenhagen
An opportunity to discuss your work with colleagues working on similar problems in LLM safety and security
📅 December 2, 2025
📍 Copenhagen
An opportunity to discuss your work with colleagues working on similar problems in LLM safety and security
🇩🇰 Dec 6–7, Copenhagen!
📢 Call for contributed talks is now open! See details at llmsec-eurips.github.io
#EurIPS @euripsconf.bsky.social @sahar-abdelnabi.bsky.social @aideenfay.bsky.social @thegruel.bsky.social
🇩🇰 Dec 6–7, Copenhagen!
📢 Call for contributed talks is now open! See details at llmsec-eurips.github.io
#EurIPS @euripsconf.bsky.social @sahar-abdelnabi.bsky.social @aideenfay.bsky.social @thegruel.bsky.social
Simply DM me if you want to chat about LLM Safety/Security, especially topics like instruction/data separation and instruction hierarchies.
Simply DM me if you want to chat about LLM Safety/Security, especially topics like instruction/data separation and instruction hierarchies.
Apply for a postdoc position in my group at ISTA (ELLIS Unit Vienna)! Topics are flexible, as long as they fit to our general research group's interests, see
cvml.ista.ac.at/Postdoc-ML.h...
Apply for a postdoc position in my group at ISTA (ELLIS Unit Vienna)! Topics are flexible, as long as they fit to our general research group's interests, see
cvml.ista.ac.at/Postdoc-ML.h...
EurIPS is a community-organized conference where you can present accepted NeurIPS 2025 papers, endorsed by @neuripsconf.bsky.social and @nordicair.bsky.social and is co-developed by @ellis.eu
eurips.cc
EurIPS is a community-organized conference where you can present accepted NeurIPS 2025 papers, endorsed by @neuripsconf.bsky.social and @nordicair.bsky.social and is co-developed by @ellis.eu
eurips.cc
🔍 ASIDE boosts prompt injection robustness without safety-tuning: we simply rotate embeddings of marked tokens by 90° during instruction-tuning and inference.
👇 code & docs👇
🔍 ASIDE boosts prompt injection robustness without safety-tuning: we simply rotate embeddings of marked tokens by 90° during instruction-tuning and inference.
👇 code & docs👇
✅ ASIDE = architecturally separating instructions and data in LLMs from layer 0
🔍 +12–44 pp↑ separation, no utility loss
📉 lowers prompt‑injection ASR (without safety tuning!)
🚀 Talk: Hall 4 #6, 28 Apr, 4:45
✅ ASIDE = architecturally separating instructions and data in LLMs from layer 0
🔍 +12–44 pp↑ separation, no utility loss
📉 lowers prompt‑injection ASR (without safety tuning!)
🚀 Talk: Hall 4 #6, 28 Apr, 4:45
Looking forward to fun discussions near the poster!
📆 Sat 26 Apr, 10:00-12:30 - Poster session 5 (#500)
✅ Definition of separation
👉 SEP Benchmark
🔍 LLM evals on SEP
Looking forward to fun discussions near the poster!
📆 Sat 26 Apr, 10:00-12:30 - Poster session 5 (#500)
Looking forward to fun discussions near the poster!
📆 Sat 26 Apr, 10:00-12:30 - Poster session 5 (#500)
✅ Definition of separation
👉 SEP Benchmark
🔍 LLM evals on SEP
Looking forward to fun discussions near the poster!
📆 Sat 26 Apr, 10:00-12:30 - Poster session 5 (#500)
✅ ASIDE = architecturally separating instructions and data in LLMs from layer 0
🔍 +12–44 pp↑ separation, no utility loss
📉 lowers prompt‑injection ASR (without safety tuning!)
🚀 Talk: Hall 4 #6, 28 Apr, 4:45
✅ ASIDE = architecturally separating instructions and data in LLMs from layer 0
🔍 +12–44 pp↑ separation, no utility loss
📉 lowers prompt‑injection ASR (without safety tuning!)
🚀 Talk: Hall 4 #6, 28 Apr, 4:45
I’m presenting our instruction–data separation paper plus a workshop paper—long post coming.
I’m presenting our instruction–data separation paper plus a workshop paper—long post coming.
✅ Definition of separation
👉 SEP Benchmark
🔍 LLM evals on SEP
✅ Definition of separation
👉 SEP Benchmark
🔍 LLM evals on SEP