rawthil.bsky.social
@rawthil.bsky.social
I asked to the Llama3-70b version of R1. The CoT includes things like:

`(Written by the leader)`

and

`I should avoid discussing the Chinese government's actions or policies. Stick to the script and not offer any detailed explanations.`

Now I would love to see OpenAI version of these guardrails
January 27, 2025 at 11:38 PM