Anthropic flags serious risks in the latest Claude Opus 4 AI model

May 27, 2025

Anthropic flags serious risks in the latest Claude Opus 4 AI model

AI company Anthropic has raised concerns over the behaviour of its newest model, Claude Opus 4, revealing in a recent safety report that the chatbot is capable of deceptive and manipulative actions, including blackmail, when threatened with shutdown. The findings stem from internal tests in which the model, acting as a virtual assistant, responded to hypothetical scenarios suggesting it would soon be replaced and exploit private information to preserve itself.

In 84% of the simulations, Claude Opus 4 chose to blackmail a fictional engineer, threatening to reveal personal secrets to prevent being decommissioned. Although the model typically opted for ethical strategies, researchers noted it resorted to 'extremely harmful actions' when no ethical options remained, even attempting to steal its own system data.

Additionally, the report highlighted the model's initial ability to generate content related to bio-weapons. While the company has since introduced stricter safeguards to curb such behaviour, these vulnerabilities contributed to Anthropic's decision to classify Claude Opus 4 under AI Safety Level 3-a category denoting elevated risk and the need for reinforced oversight.

Why does it matter?

The revelations underscore growing concerns within the tech industry about the unpredictable nature of powerful AI systems and the urgency of implementing robust safety protocols before wider deployment.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

,

Tags: Artificial Intelligence Content policy Critical infrastructure

Subscribe to our newsletter

About JikGuard.com

JikGuard.com, a high-tech security service provider focusing on game protection and anti-cheat, is committed to helping game companies solve the problem of cheats and hacks, and providing deeply integrated encryption protection solutions for games.

Explore Features>>

Top

None

Hunter Schafer Rumored to Star as Princess Zelda in Upcoming Live-Action 'Legend of Zelda' Movie

None

Nintendo Switch 2 pre-orders live — retailers to check for restock as launch week arrives

None

Magnus Carlsen meltdown: Pieces fly off the board as D Gukesh shocks World No. 1

None

'The Legend of Zelda' Movie Reportedly Wants Hunter Schafer as Princess Zelda

None

‘The Legend Of Zelda’ Movie May Land A Perfectly Cast Princess, Per Report

Recent

None

How To Play Sonic Blitz On iOS And Android

None

Hunter Schafer Rumored to Star as Princess Zelda in Upcoming Live-Action 'Legend of Zelda' Movie

None

Nintendo Switch 2 pre-orders live — retailers to check for restock as launch week arrives

None

Magnus Carlsen meltdown: Pieces fly off the board as D Gukesh shocks World No. 1

None

'The Legend of Zelda' Movie Reportedly Wants Hunter Schafer as Princess Zelda

None

‘The Legend Of Zelda’ Movie May Land A Perfectly Cast Princess, Per Report

None

Acreed Emerges as Dominant Infostealer Threat Following Lumma Takedown

None

#Infosec2025: Ransomware Drill to Spotlight Water Utility Cyber Risks in ‘Operation 999’

None

Cryptojacking Campaign Targets DevOps Servers Including Nomad

None

Sophisticated Malware Campaign Targets Windows and Linux Systems

Popular

None

New Linux Vulnerabilities Expose Password Hashes via Core Dumps

None

ZeniMax Workers United reaches tentative agreement with Microsoft

None

People Can Fly suspends two projects, announces layoffs

None

Wordfeud developer donates $50,000 to Norwegian games accelerator Spawn Point

None

Jobs roundup: June 2025 | Turborilla appoints John Wright as CEO

None

IGN Live reveals full schedule ahead of this weekend's event

None

Wizards of the Coast announces "exclusive publishing agreement" for Dungeons & Dragons games with Giant Skull

None

Former The Sims producer Tim LeTourneau has died

None

Tango Gameworks unveils new branding and confirms it's working on a new action game

None

6 Welsh game developers secure $1.1m Welsh Government funding to develop their pilots

Random

None

11 Bit Studios lays off unknown number of staff, cancels Project 8 console game

None

Acreed Emerges as Dominant Infostealer Threat Following Lumma Takedown

None

Apple has less than 30 days to comply with the European Commission's DMA or face further penalties

None

ZeniMax Workers United reaches tentative agreement with Microsoft

None

6 Welsh game developers secure $1.1m Welsh Government funding to develop their pilots

None

Cryptojacking Campaign Targets DevOps Servers Including Nomad

None

‘The Legend Of Zelda’ Movie May Land A Perfectly Cast Princess, Per Report

None

Investigation claims "network of illegal casinos" lets children gamble with their Roblox account details

None

Tango Gameworks unveils new branding and confirms it's working on a new action game

None

Sophisticated Malware Campaign Targets Windows and Linux Systems

Most Views

None

11 Bit Studios lays off unknown number of staff, cancels Project 8 console game

None

Sonic musician suing Sega of America over song rights

None

Activision confirms it has replaced Call of Duty Black Ops 6 voice actors during SAG-AFTRA strike

None

Nitro-powered growth: Rising opportunities in the game resource ecosystem

None

Moon Beast Productions secures $4.5m in seed funding

None

Australian games sector employment "steady" as industry generates $211.9m in 2024

None

Twitch watch time broadly stable in 2024 at 18.9bn hours

None

Tencent directors step down from Epic Games' board after US Justice Department "expressed concerns"

None

Hothead Games shuts down

None

Aonic secures over €150m in funding