8月19日晚间,DeepSeek官方悄然上线了全新的V3.1版本。 官方公告强调了上下文长度拓展至128k,但随着社区的深入挖掘和实测,这次“小更新”之下其实有着更多模型架构的变革和模型重点能力的微调,在编程能力上的提升也可圈可点,成本优势重回显著。 然而,模型融合的技术路线也引发激烈争论,部分用户反馈旧版模型的“顽疾”复现,对这次更新的评价呈现出两极分化的态势。 发布两天后,DeepSeek官方...
Source Link8月19日晚间,DeepSeek官方悄然上线了全新的V3.1版本。 官方公告强调了上下文长度拓展至128k,但随着社区的深入挖掘和实测,这次“小更新”之下其实有着更多模型架构的变革和模型重点能力的微调,在编程能力上的提升也可圈可点,成本优势重回显著。 然而,模型融合的技术路线也引发激烈争论,部分用户反馈旧版模型的“顽疾”复现,对这次更新的评价呈现出两极分化的态势。 发布两天后,DeepSeek官方...
Source LinkDisclaimer: Investing carries risk. This is not financial advice. The above content should not be regarded as an offer, recommendation, or solicitation on acquiring or disposing of any financial products, any associated discussions, comments, or posts by author or other users should not be considered as such either. It is solely for general information purpose only, which does not consider your own investment objectives, financial situations or needs. TTM assumes no responsibility or warranty for the accuracy and completeness of the information, investors should do their own research and may seek professional advice before investing.