Alibaba Cloud's New System Cuts Nvidia GPU Usage By 82%, Amid Trump's Flip Flop On AI Chip Ban On China

Benzinga
10/20

Alibaba Group Holding has introduced a new computing pooling system called Aegaeon, which dramatically reduces the reliance on Nvidia GPUs by 82% for AI models.

This innovation was tested in Alibaba Cloud's model marketplace for over three months, according to a research paper presented this week at the 31st Symposium on Operating Systems Principles (SOSP) in Seoul, South Korea.

The Aegaeon system has successfully decreased the number of Nvidia H20 GPUs required from 1,192 to just 213 for serving models with up to 72 billion parameters.

"Aegaeon is the first work to reveal the excessive costs associated with serving concurrent LLM workloads on the market," the researchers stated in the paper.

Researchers from Peking University and Alibaba Cloud emphasized the high costs associated with serving concurrent large language model workloads.

Alibaba Cloud, the AI and cloud services division of the Hangzhou-based Alibaba, aims to boost efficiency by pooling GPU resources, allowing a single GPU to support multiple models.

The system addresses resource inefficiency, as previously, 17.7% of GPUs were allocated to serve only 1.35% of requests in Alibaba Cloud's marketplace.

Cloud service providers like Alibaba Cloud and ByteDance‘s Volcano Engine manage thousands of AI models simultaneously, often leading to inefficiencies. The Aegaeon system seeks to optimize this process by reducing the number of GPUs needed.

This development comes amid growing concerns over Nvidia’s presence in China. Recently, China raised security concerns about Nvidia’s H20 chips, especially regarding potential backdoor risks. As part of its deal with Nvidia, the Trump administration has struck an agreement for a 15% revenue share from the company's chip sales to China.

Nvidia CEO Jensen Huang stated that Nvidia’s market share in China has plummeted from 95% to zero. He expressed concerns over the impact of U.S. policies on Nvidia’s market presence in China.

Despite these challenges, Nvidia has financially insulated itself from potential escalations, as its guidance assumes zero revenue from China, according to Huang.

免责声明:投资有风险,本文并非投资建议,以上内容不应被视为任何金融产品的购买或出售要约、建议或邀请,作者或其他用户的任何相关讨论、评论或帖子也不应被视为此类内容。本文仅供一般参考,不考虑您的个人投资目标、财务状况或需求。TTM对信息的准确性和完整性不承担任何责任或保证,投资者应自行研究并在投资前寻求专业建议。

热议股票

  1. 1
     
     
     
     
  2. 2
     
     
     
     
  3. 3
     
     
     
     
  4. 4
     
     
     
     
  5. 5
     
     
     
     
  6. 6
     
     
     
     
  7. 7
     
     
     
     
  8. 8
     
     
     
     
  9. 9
     
     
     
     
  10. 10