Lightnews — Scholar-powered news

@docandalf.bsky.social

47 followers 25 following 0 posts

Posts Replies Media Videos

Reposted

Yoshua Bengio

@yoshuabengio.bsky.social

Early signs of deception, cheating & self-preservation in top-performing models in terms of reasoning are extremely worrisome. We don't know how to guarantee AI won't have undesired behavior to reach goals & this must be addressed before deploying powerful autonomous agents.
time.com/7259395/ai-c...

When AI Thinks It Will Lose, It Sometimes Cheats

When sensing defeat in a match against a skilled chess bot, advanced models sometimes hack their opponent, a study found.

time.com

February 20, 2025 at 4:45 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news