Umar Iqbal
@umariqbal.bsky.social
78 followers
130 following
14 posts
Assistant professor at the Washington University in St. Louis. I research computer security and privacy.
Posts
Media
Videos
Starter Packs
Umar Iqbal
@umariqbal.bsky.social
· Feb 18
Umar Iqbal
@umariqbal.bsky.social
· Feb 18
Umar Iqbal
@umariqbal.bsky.social
· Feb 18
Umar Iqbal
@umariqbal.bsky.social
· Feb 18
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
Today's LLMs are susceptible to prompt injections, jailbreaks, and other attacks that allow adversaries to overwrite a model's original instructions with their own malicious prompts.
openai.com
Umar Iqbal
@umariqbal.bsky.social
· Feb 18
Umar Iqbal
@umariqbal.bsky.social
· Feb 18
Umar Iqbal
@umariqbal.bsky.social
· Feb 18
Umar Iqbal
@umariqbal.bsky.social
· Feb 18
Reposted by Umar Iqbal