Ant Group Releases Open-Source Multimodal AI Model Ming-Flash-Omni 2.0

Stock News

02/11

Ant Group has officially open-sourced its latest multimodal large model, Ming-Flash-Omni 2.0. The model demonstrates exceptional performance across multiple public benchmarks, with particularly strong capabilities in visual language understanding, controllable speech generation, and image generation and editing. Some metrics have surpassed those of Gemini 2.5 Pro. Ming-Flash-Omni 2.0 is also the industry's first model capable of unified audio generation across all scenarios, enabling the synchronized synthesis of speech, ambient sound effects, and background music within a single audio track. Users can finely adjust parameters such as timbre, speech rate, tone, volume, emotion, and even dialect through natural language instructions. In terms of inference efficiency, the model achieves an extremely low inference frame rate of 3.1Hz, allowing real-time generation of high-fidelity, minute-long audio while significantly optimizing computational costs and response speed. Ant Group has been investing in multimodal research for several years, with the Ming-Omni series now in its third iteration. The open-sourcing of Ming-Flash-Omni 2.0 releases its core capabilities as a reusable foundation, providing a unified entry point for end-to-end multimodal application development. Users can also experience and access the model online through Ant's official platform, Ling Studio.

免责声明：投资有风险，本文并非投资建议，以上内容不应被视为任何金融产品的购买或出售要约、建议或邀请，作者或其他用户的任何相关讨论、评论或帖子也不应被视为此类内容。本文仅供一般参考，不考虑您的个人投资目标、财务状况或需求。TTM对信息的准确性和完整性不承担任何责任或保证，投资者应自行研究并在投资前寻求专业建议。

热议股票

1
2
3
4
5
6
7
8
9
10

{"basename":"/hans","ssrTDKData":{"titleTemplate":"%s - 老虎证券","title":"老虎证券全球投资理财平台| 一站式投资美股新股港股A股","description":"老虎证券助您一站式投资美股，新股，港股，A股等全球金融理财产品。新加坡华人最信赖的在线投资平台，现在加入即享低费用，24/5 无时差炒美股投资理财！","keywords":"老虎证券,老虎证券开户,老虎券商,老虎证券官网,老虎证券app,tigertrade老虎证券,股票,炒股,新加坡股票交易平台,投资,投资理财","social":{"ogDescription":"老虎证券助您一站式投资美股，新股，港股，A股等全球金融理财产品。新加坡华人最信赖的在线投资平台，现在加入即享低费用，24/5 无时差炒美股投资理财！","ogImage":"https://c1.itigergrowtha.com/portal5/static/media/og-logo.be62fbe1.png","ogUrl":"https://www.itiger.com/hans/news/1151853261"},"companyName":"老虎证券"},"pageData":{"isMobile":false,"isTiger":false,"isTTM":true,"region":"SGP","license":"TBSG","edition":"fundamental"},"isCrawlerRequest":true,"__swrFallback__":{"@#url:\"https://stock-news.skytigris.cn/v3/news\",params:#id:\"1151853261\",edition:\"fundamental\",auth_exemption:1,,,undefined,":{"share":"https://ttm.financial/m/news/1151853261?lang=zh_CN&edition=fundamental","thumbnail":"","is_english":true,"pubTime":"2026-02-11 15:30","share_image_url":"https://static.laohu8.com/e9f99090a1c2ed51c021029395664489","id":"1151853261","market":"sg","top_or_hot":-1,"title":"Ant Group Releases Open-Source Multimodal AI Model Ming-Flash-Omni 2.0","media":"Stock News","content":"<p>Ant Group has officially open-sourced its latest multimodal large model, Ming-Flash-Omni 2.0. The model demonstrates exceptional performance across multiple public benchmarks, with particularly strong capabilities in visual language understanding, controllable speech generation, and image generation and editing. Some metrics have surpassed those of Gemini 2.5 Pro. Ming-Flash-Omni 2.0 is also the industry's first model capable of unified audio generation across all scenarios, enabling the synchronized synthesis of speech, ambient sound effects, and background music within a single audio track. Users can finely adjust parameters such as timbre, speech rate, tone, volume, emotion, and even dialect through natural language instructions. In terms of inference efficiency, the model achieves an extremely low inference frame rate of 3.1Hz, allowing real-time generation of high-fidelity, minute-long audio while significantly optimizing computational costs and response speed. Ant Group has been investing in multimodal research for several years, with the Ming-Omni series now in its third iteration. The open-sourcing of Ming-Flash-Omni 2.0 releases its core capabilities as a reusable foundation, providing a unified entry point for end-to-end multimodal application development. Users can also experience and access the model online through Ant's official platform, Ling Studio.</p>","source":null,"html":"<!DOCTYPE html>\n<html>\n<head>\n<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\" />\n<meta name=\"viewport\" content=\"width=device-width,initial-scale=1.0,minimum-scale=1.0,maximum-scale=1.0,user-scalable=no\"/>\n<meta name=\"format-detection\" content=\"telephone=no,email=no,address=no\" />\n<title>Ant Group Releases Open-Source Multimodal AI Model Ming-Flash-Omni 2.0</title>\n<style type=\"text/css\">\na,abbr,acronym,address,applet,article,aside,audio,b,big,blockquote,body,canvas,caption,center,cite,code,dd,del,details,dfn,div,dl,dt,\nem,embed,fieldset,figcaption,figure,footer,form,h1,h2,h3,h4,h5,h6,header,hgroup,html,i,iframe,img,ins,kbd,label,legend,li,mark,menu,nav,\nobject,ol,output,p,pre,q,ruby,s,samp,section,small,span,strike,strong,sub,summary,sup,table,tbody,td,tfoot,th,thead,time,tr,tt,u,ul,var,video{ font:inherit;margin:0;padding:0;vertical-align:baseline;border:0 }\nbody{ font-size:16px; line-height:1.5; color:#999; background:transparent; }\n.wrapper{ overflow:hidden;word-break:break-all;padding:10px; }\nh1,h2{ font-weight:normal; line-height:1.35; margin-bottom:.6em; }\nh3,h4,h5,h6{ line-height:1.35; margin-bottom:1em; }\nh1{ font-size:24px; }\nh2{ font-size:20px; }\nh3{ font-size:18px; }\nh4{ font-size:16px; }\nh5{ font-size:14px; }\nh6{ font-size:12px; }\np,ul,ol,blockquote,dl,table{ margin:1.2em 0; }\nul,ol{ margin-left:2em; }\nul{ list-style:disc; }\nol{ list-style:decimal; }\nli,li p{ margin:10px 0;}\nimg{ max-width:100%;display:block;margin:0 auto 1em; }\nblockquote{ color:#B5B2B1; border-left:3px solid #aaa; padding:1em; }\nstrong,b{font-weight:bold;}\nem,i{font-style:italic;}\ntable{ width:100%;border-collapse:collapse;border-spacing:1px;margin:1em 0;font-size:.9em; }\nth,td{ padding:5px;text-align:left;border:1px solid #aaa; }\nth{ font-weight:bold;background:#5d5d5d; }\n.symbol-link{font-weight:bold;}\n/* header{ border-bottom:1px solid #494756; } */\n.title{ margin:0 0 8px;line-height:1.3;color:#ddd; }\n.meta {color:#5e5c6d;font-size:13px;margin:0 0 .5em; }\na{text-decoration:none; color:#2a4b87;}\n.meta .head { display: inline-block; overflow: hidden}\n.head .h-thumb { width: 30px; height: 30px; margin: 0; padding: 0; border-radius: 50%; float: left;}\n.head .h-content { margin: 0; padding: 0 0 0 9px; float: left;}\n.head .h-name {font-size: 13px; color: #eee; margin: 0;}\n.head .h-time {font-size: 11px; color: #7E829C; margin: 0;line-height: 11px;}\n.small {font-size: 12.5px; display: inline-block; transform: scale(0.9); -webkit-transform: scale(0.9); transform-origin: left; -webkit-transform-origin: left;}\n.smaller {font-size: 12.5px; display: inline-block; transform: scale(0.8); -webkit-transform: scale(0.8); transform-origin: left; -webkit-transform-origin: left;}\n.bt-text {font-size: 12px;margin: 1.5em 0 0 0}\n.bt-text p {margin: 0}\n</style>\n</head>\n<body>\n<div class=\"wrapper\">\n<header>\n<h2 class=\"title\">\nAnt Group Releases Open-Source Multimodal AI Model Ming-Flash-Omni 2.0\n</h2>\n\n<h4 class=\"meta\">\n\n\n<a class=\"head\" href=\"https://laohu8.com/wemedia/1036600163\">\n\n\n<div class=\"h-thumb\" style=\"background-image:url(https://community-static.tradeup.com/news/b3bf6d02ad1a6e9e7377eebbb162346a);background-size:cover;\"></div>\n\n<div class=\"h-content\">\n<p class=\"h-name\">Stock News </p>\n<p class=\"h-time\">2026-02-11 15:30</p>\n</div>\n\n</a>\n\n\n</h4>\n\n</header>\n<article>\n<p>Ant Group has officially open-sourced its latest multimodal large model, Ming-Flash-Omni 2.0. The model demonstrates exceptional performance across multiple public benchmarks, with particularly strong capabilities in visual language understanding, controllable speech generation, and image generation and editing. Some metrics have surpassed those of Gemini 2.5 Pro. Ming-Flash-Omni 2.0 is also the industry's first model capable of unified audio generation across all scenarios, enabling the synchronized synthesis of speech, ambient sound effects, and background music within a single audio track. Users can finely adjust parameters such as timbre, speech rate, tone, volume, emotion, and even dialect through natural language instructions. In terms of inference efficiency, the model achieves an extremely low inference frame rate of 3.1Hz, allowing real-time generation of high-fidelity, minute-long audio while significantly optimizing computational costs and response speed. Ant Group has been investing in multimodal research for several years, with the Ming-Omni series now in its third iteration. The open-sourcing of Ming-Flash-Omni 2.0 releases its core capabilities as a reusable foundation, providing a unified entry point for end-to-end multimodal application development. Users can also experience and access the model online through Ant's official platform, Ling Studio.</p>\n\n</article>\n</div>\n</body>\n</html>\n","isBrief":false,"type":0,"news_type":1,"symbol":"06688","symbol_name":"蚂蚁集团","start_time":0,"source_url":"","article_id":"1151853261","we_media_id":"1036600163","thumbnails":[],"rights":null,"url":"https://stock-news.laohu8.com/highlight/detail?id=1151853261","pubTimestamp":1770795021,"columns":[],"sourceInfo":null,"weMediaInfo":{"media_name":"Stock News","introduction":"Global Stock News First-Time Broadcast","home_visible":1,"id":"1036600163","head_image":"https://community-static.tradeup.com/news/b3bf6d02ad1a6e9e7377eebbb162346a"},"summary":"Ant Group has officially open-sourced its latest multimodal large model, Ming-Flash-Omni 2.0. The model demonstrates exceptional performance across multiple public benchmarks, with particularly...","collect":0,"end_time":0,"defaultTopTitle":"","property":[],"viewcount":null,"language":"en","relate_stocks":{"06688":"蚂蚁集团"},"translate_title":"蚂蚁集团发布开源多模态AI模型Ming-Flash-Omni 2.0","themeId":"","isJumpTheme":false,"ttsUrl":null,"symbols_score_info":{"06688":1},"content_text":"Ant Group has officially open-sourced its latest multimodal large model, Ming-Flash-Omni 2.0. The model demonstrates exceptional performance across multiple public benchmarks, with particularly strong capabilities in visual language understanding, controllable speech generation, and image generation and editing. Some metrics have surpassed those of Gemini 2.5 Pro. Ming-Flash-Omni 2.0 is also the industry's first model capable of unified audio generation across all scenarios, enabling the synchronized synthesis of speech, ambient sound effects, and background music within a single audio track. Users can finely adjust parameters such as timbre, speech rate, tone, volume, emotion, and even dialect through natural language instructions. In terms of inference efficiency, the model achieves an extremely low inference frame rate of 3.1Hz, allowing real-time generation of high-fidelity, minute-long audio while significantly optimizing computational costs and response speed. Ant Group has been investing in multimodal research for several years, with the Ming-Omni series now in its third iteration. The open-sourcing of Ming-Flash-Omni 2.0 releases its core capabilities as a reusable foundation, providing a unified entry point for end-to-end multimodal application development. Users can also experience and access the model online through Ant's official platform, Ling Studio.","kind":"news","is_publish_news":true,"is_publish_highlight":false,"is_publish_live":false,"is_publish_wemedia":null,"editions":null,"column":"","sentiment":"1","news_tag":"productRelease","news_rank":0,"symbols":[],"gpt_button":0,"need_auth":false,"code":"91000000","status":"200"}}}