Dr Francis Rhys Ward
f-rhys-ward.bsky.social
Dr Francis Rhys Ward
@f-rhys-ward.bsky.social
AGI Alignment Researcher
Reposted by Dr Francis Rhys Ward
Help me grow this starter pack for technical researchers working on AGI safety! go.bsky.app/D6P44sC Some flex, but aiming for mostly technical research rather than governance/strategy. Who am I missing?
November 25, 2024 at 2:04 PM
In real-life, agents with different subjective beliefs interact in a shared objective reality. They have higher-order beliefs about each other's beliefs and goals, which is required for phenomena involving theory-of-mind, like deception

Our paper formalises this in causal models
March 16, 2025 at 4:44 PM