Lightnews — Scholar-powered news

Yoshua Bengio

@yoshuabengio.bsky.social

7.9K followers 39 following 160 posts

Working towards the safe development of AI for the benefit of all at Université de Montréal, LawZero and Mila.

A.M. Turing Award Recipient and most-cited AI researcher.

https://lawzero.org/en
https://yoshuabengio.org/profile/

Posts Replies Media Videos

Yoshua Bengio

@yoshuabengio.bsky.social

For example, research shows that:
· When given 10 attempts, attackers can use malicious prompts to bypass leading systems' safeguards about half the time.
· Inserting as few as 250 malicious documents into a model's training data can introduce vulnerabilities.
(6/6)

November 25, 2025 at 12:06 PM

Yoshua Bengio

@yoshuabengio.bsky.social

Yet significant challenges remain and the real-world effectiveness of many safeguards is uncertain. ⬇️
(5/6)

November 25, 2025 at 12:06 PM

Yoshua Bengio

@yoshuabengio.bsky.social

· A growing number of companies adopting Frontier AI Safety Frameworks, describing safety and security measures they will take as their AI models become more capable,
· Technical safeguards are beginning to inform transparency measures in governance frameworks.
(4/6)

November 25, 2025 at 12:06 PM

Yoshua Bengio

@yoshuabengio.bsky.social

Over the past year, we’ve seen meaningful progress:
· Improvements in adversarial training methods to make models more resistant to potentially harmful requests,
· Better tools for tracking AI-generated content.
(3/6)

November 25, 2025 at 12:06 PM

Yoshua Bengio

@yoshuabengio.bsky.social

This key update, paired with the first one published in October, aims to provide policymakers and researchers with timely information about important trends in frontier AI development.
Read the full Update here: internationalaisafetyreport.org/publication/...
(2/6)

Second Key Update: Technical Safeguards and Risk Management

This Key Update examines developments in technical approaches to general-purpose AI risk management, from training models to refuse harmful requests to watermarking AI-generated content. Since the pub...

internationalaisafetyreport.org

November 25, 2025 at 12:06 PM

Yoshua Bengio

@yoshuabengio.bsky.social

Learn more about the 5 main concrete actions they should pursue in our latest Blueprint for Multinational Advanced AI Development: aigi.ox.ac.uk/publications...

A Blueprint for Multinational Advanced AI Development - Oxford Martin AIGI

Executive Summary The global race to develop advanced AI has entered a newphase marked by staggering investments, rapid technical breakthroughs, and intensifying geopolitical competition. The United S...

aigi.ox.ac.uk

November 24, 2025 at 4:26 PM

Yoshua Bengio

@yoshuabengio.bsky.social

It was equally a pleasure to receive this distinction alongside an exceptional group of colleagues, whose contributions have had a profound impact on the field of AI as we know it today. www.youtube.com/watch?v=0zXS...

The Minds of Modern AI: Jensen Huang, Yann LeCun, Fei-Fei Li & the AI Vision of the Future | FT Live

YouTube video by FT Live

www.youtube.com

November 7, 2025 at 9:33 PM

Yoshua Bengio

@yoshuabengio.bsky.social

Addressing these risks doesn't mean stopping progress.
Innovation & regulation must go hand in hand, notably by building technical safeguards to make AI systems more trustworthy for individuals and businesses. That's at the heart of our work at @law-zero.bsky.social.

November 6, 2025 at 9:02 PM

Yoshua Bengio

@yoshuabengio.bsky.social

Thank you to the University of Copenhagen, the European Commission @ec.europa.eu, and my co-panelists — Lene Oddershede, Max Welling, Peter Sarlin & @fabiantheis.bsky.social — for a day of excellent discussions.

November 3, 2025 at 5:22 PM

Yoshua Bengio

@yoshuabengio.bsky.social

I want to congratulate EVP @hennavirkkunen.bsky.social and EU Commissioner Ekaterina Zaharieva on the launch of RAISE, a critical initiative to support AI for Science and Science for AI.

November 3, 2025 at 5:22 PM

Yoshua Bengio

@yoshuabengio.bsky.social

We covered current evidence around AI's potential for misuse, governance & international cooperation mechanisms, and strategies to ensure AI security.

October 31, 2025 at 1:48 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news