Aengus Lynch
aengusl.bsky.social
Aengus Lynch
@aengusl.bsky.social
AI safety researcher
NEW PAPER: Best-of-N Jailbreaking.

We modify LLM inputs with simple, randomly generated augmentations and jailbreak frontier models across text, vision, and audio modalities.

The algorithm is simple, scalable and highly effective.
December 13, 2024 at 5:27 PM