NVIDIA CEO Declares Dominance in AI Inference Performance

Deep News
03/17

At the GTC 2026 keynote, Jensen Huang showcased a championship belt emblazoned with the evaluation result from third-party chip analysis firm SemiAnalysis: "InferenceMAX KING." This title was claimed by NVIDIA for its GB300 NVL72 system, signifying it as the most powerful in inference performance.

The data originates from independent benchmarks conducted by SemiAnalysis. Measured in tokens per watt, the GB300 NVL72 outperforms competitors by 50 times. When assessed by cost per token, it is 35 times more economical than rival solutions. On stage, Huang corrected NVIDIA's previously announced efficiency improvement figure of 30 times, stating, "The actual number is 50 times."

The GB300 NVL72 is the flagship inference configuration based on the Blackwell Ultra architecture. It connects 72 GPUs via NVLink 6, providing a total system bandwidth of 260 TB/s. Huang emphasized that inference efficiency directly determines the revenue of AI factories, making it the most critical performance metric currently. The GB300 series is already in delivery, with the next-generation Vera Rubin architecture anticipated to enter mass production in 2027.

免責聲明:投資有風險,本文並非投資建議,以上內容不應被視為任何金融產品的購買或出售要約、建議或邀請,作者或其他用戶的任何相關討論、評論或帖子也不應被視為此類內容。本文僅供一般參考,不考慮您的個人投資目標、財務狀況或需求。TTM對信息的準確性和完整性不承擔任何責任或保證,投資者應自行研究並在投資前尋求專業建議。

熱議股票

  1. 1
     
     
     
     
  2. 2
     
     
     
     
  3. 3
     
     
     
     
  4. 4
     
     
     
     
  5. 5
     
     
     
     
  6. 6
     
     
     
     
  7. 7
     
     
     
     
  8. 8
     
     
     
     
  9. 9
     
     
     
     
  10. 10