感谢IT之家网友 不一样的体验 的线索投递!造芯片的还有高手?刚刚推出的一款最新芯片,直接冲上硅谷热榜。峰值推理速度高达每秒17000 个 token。什么概念呢?当前公认最强的 Cerebras,速度约为 2000 token/s。速度直接快10 倍,同时成本骤减 20 倍、功耗降低 10 倍。这就意味着,LLM 真正来到了亚毫秒级的即时响应速度。但这块一夜之间刷屏硅谷的芯片,并非出自英伟达、...
Source Link感谢IT之家网友 不一样的体验 的线索投递!造芯片的还有高手?刚刚推出的一款最新芯片,直接冲上硅谷热榜。峰值推理速度高达每秒17000 个 token。什么概念呢?当前公认最强的 Cerebras,速度约为 2000 token/s。速度直接快10 倍,同时成本骤减 20 倍、功耗降低 10 倍。这就意味着,LLM 真正来到了亚毫秒级的即时响应速度。但这块一夜之间刷屏硅谷的芯片,并非出自英伟达、...
Source LinkDisclaimer: Investing carries risk. This is not financial advice. The above content should not be regarded as an offer, recommendation, or solicitation on acquiring or disposing of any financial products, any associated discussions, comments, or posts by author or other users should not be considered as such either. It is solely for general information purpose only, which does not consider your own investment objectives, financial situations or needs. TTM assumes no responsibility or warranty for the accuracy and completeness of the information, investors should do their own research and may seek professional advice before investing.