LawZero will be an ‘honest’ AI that protects you from rogue agents
Safety is always going to be paramount when it comes to artificial intelligence. After all, one of our collective fears is a highly advanced AI … The post LawZero will be an ‘honest’ AI that protects you from rogue agents appeared first on BGR.


Safety is always going to be paramount when it comes to artificial intelligence. After all, one of our collective fears is a highly advanced AI going rogue and threatening our very existence. It certainly doesn’t help to see that some of the smartest AI models out there resort to cheating to achieve their goals, or that some would even try to blackmail humans to preserve their integrity.
That actually happened during safety tests performed on frontier AI models before being released to the public. ChatGPT o1 made headlines a few months ago when security researchers found that the AI would resort to cheating at chess against a better opponent in order to achieve its goal, which was winning the game.
More recently, Claude 4 threatened an engineer who was supposed to delete the AI from a computer to expose the person’s infidelity to their partner. The AI obtained information about the deletion plans and the alleged affair from emails it had access to for the purpose of testing its behavior.
The actual Claude 4 will not try to blackmail users, though the AI does come with stronger guardrails than its predecessors to ensure it’s safe for users. That said, Claude 4 might decide to report you to authorities and the press if it thinks you’re engaging in nefarious activities, but that’s only a theoretical risk.
The blackmail scenario is what prompted Yoshua Bengio to create a new initiative called LawZero, which aims to develop honest AI programs that will detect AI systems that might attempt to deceive humans or go rogue.
The post LawZero will be an ‘honest’ AI that protects you from rogue agents appeared first on BGR.
Today's Top Deals
- Best Ring Video Doorbell deals
- Today’s deals: $399 iPad mini, $188 Vizio surround sound, $32 Thermacell mosquito repeller, more
- Best deals: Tech, laptops, TVs, and more sales
- Today’s deals: $1,750 Amazon gift card, Sonos speaker sale, Hisense 75-inch smart TV, foam dog beds, more
LawZero will be an ‘honest’ AI that protects you from rogue agents originally appeared on BGR.com on Tue, 3 Jun 2025 at 19:36:00 EDT. Please see our terms for use of feeds.