Close Menu
World Economist – Global Markets, Finance & Economic Insights
  • Home
  • Economist Impact
    • Economist Intelligence
    • Finance & Economics
  • Business
  • Asia
  • China
  • Europe
  • Economy
  • USA
    • Middle East & Africa
    • Highlights
  • This week
  • World Economy
    • World News
What's Hot

IMF board to meet tomorrow to approve disbursement of $1.2bn to Pakistan – Business & Finance

December 7, 2025

In race for tourists, Singapore pumps up appeal with push into fitness events

December 7, 2025

Plate expectations: why China’s pre-made meal boom is hard to swallow

December 7, 2025
Facebook X (Twitter) Instagram
Sunday, December 7
Facebook X (Twitter) Instagram
World Economist – Global Markets, Finance & Economic Insights
  • Home
  • Economist Impact
    • Economist Intelligence
    • Finance & Economics
  • Business
  • Asia
  • China
  • Europe
  • Economy
  • USA
    • Middle East & Africa
    • Highlights
  • This week
  • World Economy
    • World News
World Economist – Global Markets, Finance & Economic Insights
Home » Alibaba Cloud claims to slash Nvidia GPU use by 82% with new pooling system
Business

Alibaba Cloud claims to slash Nvidia GPU use by 82% with new pooling system

adminBy adminOctober 18, 2025No Comments2 Mins Read
Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram
Share
Facebook Twitter Pinterest Email Copy Link
Post Views: 37


Alibaba Group Holding has introduced a computing pooling solution that it said led to an 82 per cent cut in the number of Nvidia graphics processing units (GPUs) needed to serve its artificial intelligence models.

The system, called Aegaeon, was beta tested in Alibaba Cloud’s model marketplace for more than three months, where it reduced the number of Nvidia H20 GPUs required to serve dozens of models of up to 72 billion parameters from 1,192 to 213, according to a research paper presented this week at the 31st Symposium on Operating Systems Principles (SOSP) in Seoul, South Korea.

“Aegaeon is the first work to reveal the excessive costs associated with serving concurrent LLM workloads on the market,” the researchers from Peking University and Alibaba Cloud wrote.

Alibaba Cloud is the AI and cloud services unit of Hangzhou-based Alibaba, which owns the Post. Its chief technology officer, Zhou Jingren, is one of the paper’s authors.

Cloud services providers, such as Alibaba Cloud and ByteDance’s Volcano Engine, serve thousands of AI models to users concurrently, meaning that many application programming interface calls are handled at the same time.

However, a small handful of models such as Alibaba’s Qwen and DeepSeek are most popular for inference, with most other models only sporadically called upon. This leads to resource inefficiency, with 17.7 per cent of GPUs allocated to serve only 1.35 per cent of requests in Alibaba Cloud’s marketplace, the researchers found.

Researchers globally have sought to improve efficiency by pooling GPU power, allowing one GPU to serve multiple models, for instance.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link
admin
  • Website

Related Posts

Business

Happy Valley nano flats rival The Peak as Hong Kong’s rent squeeze deepens

December 7, 2025
Business

ByteDance’s agentic AI smartphone dials up a digital backlash from China’s top apps

December 7, 2025
Business

Exclusive | Mainland China’s bank apps the best in Asia, beating Hong Kong and Singapore, study says

December 7, 2025
Business

China’s Pudu Robotics rolls out overseas charm offensive with robot dog

December 7, 2025
Business

China’s AI boom fuels solopreneurs as 1-person businesses flourish in tough job market

December 6, 2025
Business

A Chinese home-grown business jet is on the way. Will it rival Gulfstream?

December 6, 2025
Add A Comment
Leave A Reply Cancel Reply

Editors Picks

IMF board to meet tomorrow to approve disbursement of $1.2bn to Pakistan – Business & Finance

December 7, 2025

IMF official praised Pakistan as ‘very good example of reform, resilience’: Finance ministry – Markets

December 6, 2025

Binance delegation meets PM Shehbaz, Field Marshal Asim Munir – Markets

December 6, 2025

From Aitchison to Forbes: Senan Khawaja and Saeed Naeem make it to 30 under 30 list – Business & Finance

December 6, 2025
Latest Posts

PSX hits all-time high as proposed ‘neutral-to-positive’ budget well-received by investors – Business

June 11, 2025

Sindh govt to allocate funds for EV taxis, scooters in provincial budget: minister – Pakistan

June 11, 2025

US, China reach deal to ease export curbs, keep tariff truce alive – World

June 11, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • IMF board to meet tomorrow to approve disbursement of $1.2bn to Pakistan – Business & Finance
  • In race for tourists, Singapore pumps up appeal with push into fitness events
  • Plate expectations: why China’s pre-made meal boom is hard to swallow
  • Singapore launches book on Malaysia separation with declassified files, Lee Kuan Yew quotes
  • Tokyo accuses Chinese fighters of locking on Japanese jets northeast of Taiwan

Recent Comments

No comments to show.

Welcome to World-Economist.com, your trusted source for in-depth analysis, expert insights, and the latest news on global finance and economics. Our mission is to provide readers with accurate, data-driven reports that shape the understanding of economic trends worldwide.

Latest Posts

IMF board to meet tomorrow to approve disbursement of $1.2bn to Pakistan – Business & Finance

December 7, 2025

In race for tourists, Singapore pumps up appeal with push into fitness events

December 7, 2025

Plate expectations: why China’s pre-made meal boom is hard to swallow

December 7, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Archives

  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • June 2024
  • October 2022
  • March 2022
  • July 2021
  • February 2021
  • January 2021
  • November 2019
  • April 2011
  • January 2011
  • December 2007
  • July 2007

Categories

  • AI & Tech
  • Asia
  • Banking
  • Business
  • Business
  • China
  • Climate
  • Computing
  • Economist Impact
  • Economist Intelligence
  • Economy
  • Editor's Choice
  • Europe
  • Europe
  • Featured
  • Featured Business
  • Featured Climate
  • Featured Health
  • Featured Science & Tech
  • Featured Travel
  • Finance & Economics
  • Health
  • Highlights
  • Markets
  • Middle East
  • Middle East & Africa
  • Middle East News
  • Most Viewed News
  • News Highlights
  • Other News
  • Politics
  • Russia
  • Science
  • Science & Tech
  • Social
  • Space Science
  • Sports
  • Sports Roundup
  • Tech
  • This week
  • Top Featured
  • Travel
  • Trending Posts
  • Ukraine Conflict
  • Uncategorized
  • US Politics
  • USA
  • World
  • World & Politics
  • World Economy
  • World News
© 2025 world-economist. Designed by world-economist.
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.