https://orpheuslummis.info, based in Montréal
www.lesswrong.com/posts/csdn3e...
www.lesswrong.com/posts/csdn3e...
- Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training arxiv.org/abs/2401.05566
- Here Comes The AI Worm arxiv.org/abs/2403.02817
- Many-shot Jailbreaking openreview.net/forum?id=cw5...
- PoisonedRAG arxiv.org/abs/2402.07867
- Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training arxiv.org/abs/2401.05566
- Here Comes The AI Worm arxiv.org/abs/2403.02817
- Many-shot Jailbreaking openreview.net/forum?id=cw5...
- PoisonedRAG arxiv.org/abs/2402.07867