DeepSeek warns of ‘jailbreak’ risks for its open-source models

Post Views: 21

DeepSeek has revealed details about the risks posed by its artificial intelligence models for the first time, noting that open-sourced models are particularly susceptible to being “jailbroken” by malicious actors.

The Hangzhou-based start-up said it evaluated its models using industry benchmarks as well as its own tests in a peer-reviewed article published in the academic journal Nature.

American AI companies often publicise research about the risks of their rapidly improving models and have introduced risk mitigation policies in response, such as Anthropic’s Responsible Scaling Policies and OpenAI’s Preparedness Framework.

Chinese companies were less outspoken about risks, despite their models being just a few months behind their US equivalents, according to AI experts. However, DeepSeek had conducted evaluations of such risks before, including the most serious “frontier risks”, the Post reported earlier.

10:41

How Hangzhou’s ‘Six Little Dragons’ built a new Chinese tech hub

The Nature paper provided more “granular” details about DeepSeek’s testing regime, said Fang Liang, an expert member of China’s AI Industry Alliance (AIIA), an industry body. These included “red-team” tests based on a framework introduced by Anthropic, in which testers try to get AI models to produce harmful speech.

Source link

What's Hot

Euro moves in a positive zone ahead of US jobs data

Beijing blasts Mexico over protectionism, threatens retaliation amid US coercion

Are latest natural disasters in Philippines further proof of poor planning?

Redefining Hong Kong: city remains key beneficiary of China’s continued opening up efforts

AI dominates venture capital investing in 2025, pulling in US$192.7 billion

China tests underwater data centres to reduce AI carbon footprint

Blackstone hits US$10 billion goal on Asia buyout fund amid PE chill

Hong Kong stock rally takes a pause as investors gather profits

Hong Kong property deals surge anew in September on rate cut, buoyant stocks

FBR busts tax evasion scam at Karachi airport involving electronics worth millions – Business & Finance

Pakistan, UAE discuss cooperation on railway modernisation – Business & Finance

Pakistani textile firm begins work on 2.57MW solar power project amid rising fuel costs – Markets

Intra-day update: rupee inches up against US dollar – Markets

PSX hits all-time high as proposed ‘neutral-to-positive’ budget well-received by investors – Business

Sindh govt to allocate funds for EV taxis, scooters in provincial budget: minister – Pakistan

US, China reach deal to ease export curbs, keep tariff truce alive – World

Latest Posts

Euro moves in a positive zone ahead of US jobs data

Beijing blasts Mexico over protectionism, threatens retaliation amid US coercion

Are latest natural disasters in Philippines further proof of poor planning?

Archives

Categories

What's Hot

Related Posts

Subscribe to Updates