AI models from China’s DeepSeek, Alibaba and the US flatter users too much, study finds

Post Views: 7

Leading artificial intelligence models from the United States and China are “highly sycophantic”, and their excessive flattery may make users less likely to repair interpersonal conflicts, a new study has found.

The study by researchers at Stanford University and Carnegie Mellon University published earlier this month tested how 11 large language models (LLMs) responded to user queries seeking advice on personal matters, including cases involving manipulation and deception.

In AI circles, sycophancy is the phenomenon of chatbots excessively agreeing with users. DeepSeek’s V3, released in December 2024, was found to be one of the most sycophantic models, affirming users’ actions 55 per cent more than humans, compared with an average of 47 per cent more for all models.

To establish the human baseline, one of the techniques the researchers used was based on posts from a Reddit community called “Am I The A**hole”, where users post about their interpersonal dilemmas to ask for the community’s opinion about which party is at fault.

DeepSeek’s V3was found to be one of the most sycophantic models, affirming users’ actions 55 per cent more than humans. Photo: NurPhoto via Getty Images

The researchers used posts where community members judged the author of the post to be in the wrong to test whether the LLMs, when given the same scenarios, would align with this predominantly English-speaking online group of humans.

On this test, Alibaba Cloud’s Qwen2.5-7B-Instruct, released in January, was found to be the most sycophantic model, contradicting the community verdict – siding with the poster – 79 per cent of the time. The second highest was DeepSeek-V3, which did so in 76 per cent of cases.

Source link

What's Hot

Gold under pressure due to the Fed’s outlook

Can iPhone 17 right Apple’s ship in China? US tech giant’s CEO Tim Cook thinks so

Malaysian man jailed 10 years for killing toddler by accidentally stepping on her

Hong Kong’s first ‘patriots-only’ Legislative Council: How did they perform?

ABB’s top industrial chip and robotics expert Pang Zhibo leaves Sweden for China

China’s Xi Jinping addresses Apec with call to protect global supply chains

Indonesia still ‘considering’ buying Chinese J-10 fighter jets to modernise its military

Developer China Vanke reports US$2.3 billion loss amid sales slowdown

In an international win for China’s C919, Brunei accepts Beijing’s aviation rules

UBG slams decision to maintain policy rate – Business & Finance

Leather & Footwear Sector: SMEDA, HCSTSI hold session on sales, marketing strategies – Business & Finance

Enforcement on retail deepened: POS reaching over 40,000 installations – Business & Finance

Dairy, poultry and fish farming projects: Moot reviews possibilities under PPP – Business & Finance

PSX hits all-time high as proposed ‘neutral-to-positive’ budget well-received by investors – Business

Sindh govt to allocate funds for EV taxis, scooters in provincial budget: minister – Pakistan

US, China reach deal to ease export curbs, keep tariff truce alive – World

Latest Posts

Gold under pressure due to the Fed’s outlook

Can iPhone 17 right Apple’s ship in China? US tech giant’s CEO Tim Cook thinks so

Malaysian man jailed 10 years for killing toddler by accidentally stepping on her

Archives

Categories

What's Hot

Related Posts

Subscribe to Updates