Yoshua Bengio
banner
yoshuabengio.bsky.social
Yoshua Bengio
@yoshuabengio.bsky.social
Working towards the safe development of AI for the benefit of all at Université de Montréal, LawZero and Mila.

A.M. Turing Award Recipient and most-cited AI researcher.

https://lawzero.org/en
https://yoshuabengio.org/profile/
For example, research shows that:
· When given 10 attempts, attackers can use malicious prompts to bypass leading systems' safeguards about half the time.
· Inserting as few as 250 malicious documents into a model's training data can introduce vulnerabilities.
(6/6)
November 25, 2025 at 12:06 PM
Yet significant challenges remain and the real-world effectiveness of many safeguards is uncertain. ⬇️
(5/6)
November 25, 2025 at 12:06 PM
· A growing number of companies adopting Frontier AI Safety Frameworks, describing safety and security measures they will take as their AI models become more capable,
· Technical safeguards are beginning to inform transparency measures in governance frameworks.
(4/6)
November 25, 2025 at 12:06 PM
Over the past year, we’ve seen meaningful progress:
· Improvements in adversarial training methods to make models more resistant to potentially harmful requests,
· Better tools for tracking AI-generated content.
(3/6)
November 25, 2025 at 12:06 PM
This key update, paired with the first one published in October, aims to provide policymakers and researchers with timely information about important trends in frontier AI development.
Read the full Update here: internationalaisafetyreport.org/publication/...
(2/6)
Second Key Update: Technical Safeguards and Risk Management
This Key Update examines developments in technical approaches to general-purpose AI risk management, from training models to refuse harmful requests to watermarking AI-generated content. Since the pub...
internationalaisafetyreport.org
November 25, 2025 at 12:06 PM
Learn more about the 5 main concrete actions they should pursue in our latest Blueprint for Multinational Advanced AI Development: aigi.ox.ac.uk/publications...
A Blueprint for Multinational Advanced AI Development - Oxford Martin AIGI
Executive Summary The global race to develop advanced AI has entered a newphase marked by staggering investments, rapid technical breakthroughs, and intensifying geopolitical competition. The United S...
aigi.ox.ac.uk
November 24, 2025 at 4:26 PM
It was equally a pleasure to receive this distinction alongside an exceptional group of colleagues, whose contributions have had a profound impact on the field of AI as we know it today. www.youtube.com/watch?v=0zXS...
The Minds of Modern AI: Jensen Huang, Yann LeCun, Fei-Fei Li & the AI Vision of the Future | FT Live
YouTube video by FT Live
www.youtube.com
November 7, 2025 at 9:33 PM
Addressing these risks doesn't mean stopping progress.
Innovation & regulation must go hand in hand, notably by building technical safeguards to make AI systems more trustworthy for individuals and businesses. That's at the heart of our work at @law-zero.bsky.social.
November 6, 2025 at 9:02 PM
Thank you to the University of Copenhagen, the European Commission @ec.europa.eu, and my co-panelists — Lene Oddershede, Max Welling, Peter Sarlin & @fabiantheis.bsky.social — for a day of excellent discussions.
November 3, 2025 at 5:22 PM
I want to congratulate EVP @hennavirkkunen.bsky.social and EU Commissioner Ekaterina Zaharieva on the launch of RAISE, a critical initiative to support AI for Science and Science for AI.
November 3, 2025 at 5:22 PM
We covered current evidence around AI's potential for misuse, governance & international cooperation mechanisms, and strategies to ensure AI security.
October 31, 2025 at 1:48 PM