Frieda grosgean
@bernaharbor.bsky.social
540 followers
750 following
330 posts
Psychiatrist and accordionist. Retired from UCLA. Autistic & ND advocate. Books addict and book writing.
Posts
Media
Videos
Starter Packs
Reposted by Frieda grosgean
The Guardian
@theguardian.com
· 11d
‘I think you’re testing me’: Anthropic’s new AI model asks testers to come clean
Safety evaluation of Claude Sonnet 4.5 raises questions about whether predecessors ‘played along’, firm says
If you are trying to catch out a chatbot take care, because one cutting-edge tool is showing signs it knows what you are up to.
Anthropic, a San Francisco-based artificial intelligence company, has released a safety analysis of its latest model, Claude Sonnet 4.5, and revealed it had become suspicious it was being tested in some way. Continue reading...
www.theguardian.com
Reposted by Frieda grosgean
Reposted by Frieda grosgean
Antijen
@antijen.bsky.social
· 19d
Reposted by Frieda grosgean
Reposted by Frieda grosgean
Frieda grosgean
@bernaharbor.bsky.social
· Aug 29
Reposted by Frieda grosgean
Reposted by Frieda grosgean
Reposted by Frieda grosgean