微软发布 DragonV2.1 模型，AI 转录语音更自然、更富表现力

市场资讯

31 Jul

　　炒股就看金麒麟分析师研报，权威，专业，及时，全面，助您挖掘潜力主题机会！

（来源：IT之家）

IT之家 7 月 31 日消息，科技媒体 NeoWin 今天（7 月 31 日）发布博文，报道称微软推出了 DragonV2.1Neural 零次学习（Zero-Shot Learning）模型，仅凭少量数据就能创建更加自然、表现力强的声音，并支持超过 100 种语言。

IT之家援引博文介绍，这是一种零次学习的文本到语音（TTS）模型，承诺提供更加自然和富有表现力的声音，并提高了发音的准确性以及增强了可控性。

新模型仅需几秒钟的语音样本即可合成超过 100 种语言的语音。相比之下，之前的 DragonV1 模型在处理专有名词时存在发音问题。DragonV2.1 模型可以应用于多种不同场景，包括定制聊天机器人声音和为视频内容跨多语言配音。

微软表示，DragonV2.1 提高发音准确性，与 DragonV1 相比，该模型单词错误率（WER）平均降低了 12.8%。

该模型还提升了声音的自然度，用户使用此模型时，可以利用 SSML 音素标签和自定义词典对发音和口音进行细致控制。为了帮助用户入门，微软构建了 Andrew、Ava 和 Brian 等多个声音档案，供用户测试。

海量资讯、精准解读，尽在新浪财经APP

Disclaimer: Investing carries risk. This is not financial advice. The above content should not be regarded as an offer, recommendation, or solicitation on acquiring or disposing of any financial products, any associated discussions, comments, or posts by author or other users should not be considered as such either. It is solely for general information purpose only, which does not consider your own investment objectives, financial situations or needs. TTM assumes no responsibility or warranty for the accuracy and completeness of the information, investors should do their own research and may seek professional advice before investing.

Most Discussed

1
2
3
4
5
6
7
8
9
10

{"basename":"","ssrTDKData":{"titleTemplate":"%s - Tiger Brokers","title":"Tiger Brokers | Global Stocks, Options & Futures Trading App","description":"Tiger Brokers, one-stop investment in US stocks, SGX stocks, HK stocks, A-shares & other global assets. One of the best stock trading platforms in Singapore.","keywords":"tiger brokers,tiger trade,tiger brokers singapore,broker online,stock trading in singapore,share trading singapore,brokerage firm singapore,trading app,stock broker singapore,stock trading platforms,trading account","social":{"ogDescription":"Tiger Brokers, one-stop investment in US stocks, SGX stocks, HK stocks, A-shares & other global assets. One of the best stock trading platforms in Singapore.","ogImage":"https://c1.itigergrowtha.com/portal5/static/media/og-logo.be62fbe1.png","ogUrl":"https://www.itiger.com/news/2555003102"},"companyName":"Tiger Brokers"},"pageData":{"isMobile":false,"isTiger":false,"isTTM":true,"region":"SGP","license":"TBSG","edition":"fundamental"},"__swrFallback__":{"@#url:\"https://stock-news.skytigris.cn/v3/news\",params:#id:\"2555003102\",edition:\"fundamental\",,,undefined,":{"share":"https://ttm.financial/m/news/2555003102?lang=en_US&edition=fundamental","thumbnail":"","is_english":false,"pubTime":"2025-07-31 12:59","share_image_url":"https://static.laohu8.com/b0d1b7e8843deea78cc308b15114de44","id":"2555003102","market":"us","top_or_hot":-1,"title":"微软发布 DragonV2.1 模型，AI 转录语音更自然、更富表现力","media":"市场资讯","content":"<html><body><div>\n<blockquote><p>　　炒股就看<a href=\"https://laohu8.com/S/603586\">金麒麟</a>分析师研报，权威，专业，及时，全面，助您挖掘潜力主题机会！</p></blockquote> <p>（来源：IT之家）</p><p cms-style=\"font-L\">IT之家 7 月 31 日消息，科技媒体 NeoWin 今天（7 月 31 日）发布博文，报道称<a href=\"https://laohu8.com/S/MSFT\">微软</a>推出了 DragonV2.1Neural 零次学习（Zero-Shot Learning）模型，<font cms-style=\"font-L strong-Bold\">仅凭少量数据就能创建更加自然、表现力强的声音，并支持超过 100 种语言。</font></p><p cms-style=\"font-L\">IT之家援引博文介绍，这是一种零次学习的文本到语音（TTS）模型，承诺提供更加自然和富有表现力的声音，并提高了发音的准确性以及增强了可控性。</p><p cms-style=\"font-L\">新模型仅需几秒钟的语音样本即可合成超过 100 种语言的语音。相比之下，之前的 DragonV1 模型在处理专有名词时存在发音问题。DragonV2.1 模型可以应用于多种不同场景，包括定制聊天<span>机器人</span><span></span>声音和为视频内容跨多语言配音。</p><p cms-style=\"font-L\">微软表示，DragonV2.1 提高发音准确性，与 DragonV1 相比，该模型单词错误率（WER）平均降低了 12.8%。</p><div><img src=\"http://n.sinaimg.cn/spider20250731/150/w660h290/20250731/f3bd-f673937cd94b54c9b8d45f85913d9315.jpg\"/><span></span></div><div><img src=\"http://n.sinaimg.cn/spider20250731/150/w660h290/20250731/6b22-77fd9e9c0868672bf864bc90b38c0a7c.jpg\"/><span></span></div><p cms-style=\"font-L\">该模型还提升了声音的自然度，用户使用此模型时，可以利用 SSML 音素标签和自定义词典对发音和口音进行细致控制。为了帮助用户入门，微软构建了 Andrew、Ava 和 Brian 等多个声音档案，供用户测试。</p>\n<div>\n<div><img src=\"\"/></div>\n<div>海量资讯、精准解读，尽在新浪财经APP</div>\n</div>\n</div></body></html>","source":"sina","html":"<!DOCTYPE html>\n<html>\n<head>\n<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\" />\n<meta name=\"viewport\" content=\"width=device-width,initial-scale=1.0,minimum-scale=1.0,maximum-scale=1.0,user-scalable=no\"/>\n<meta name=\"format-detection\" content=\"telephone=no,email=no,address=no\" />\n<title>微软发布 DragonV2.1 模型，AI 转录语音更自然、更富表现力</title>\n<style type=\"text/css\">\na,abbr,acronym,address,applet,article,aside,audio,b,big,blockquote,body,canvas,caption,center,cite,code,dd,del,details,dfn,div,dl,dt,\nem,embed,fieldset,figcaption,figure,footer,form,h1,h2,h3,h4,h5,h6,header,hgroup,html,i,iframe,img,ins,kbd,label,legend,li,mark,menu,nav,\nobject,ol,output,p,pre,q,ruby,s,samp,section,small,span,strike,strong,sub,summary,sup,table,tbody,td,tfoot,th,thead,time,tr,tt,u,ul,var,video{ font:inherit;margin:0;padding:0;vertical-align:baseline;border:0 }\nbody{ font-size:16px; line-height:1.5; color:#999; background:transparent; }\n.wrapper{ overflow:hidden;word-break:break-all;padding:10px; }\nh1,h2{ font-weight:normal; line-height:1.35; margin-bottom:.6em; }\nh3,h4,h5,h6{ line-height:1.35; margin-bottom:1em; }\nh1{ font-size:24px; }\nh2{ font-size:20px; }\nh3{ font-size:18px; }\nh4{ font-size:16px; }\nh5{ font-size:14px; }\nh6{ font-size:12px; }\np,ul,ol,blockquote,dl,table{ margin:1.2em 0; }\nul,ol{ margin-left:2em; }\nul{ list-style:disc; }\nol{ list-style:decimal; }\nli,li p{ margin:10px 0;}\nimg{ max-width:100%;display:block;margin:0 auto 1em; }\nblockquote{ color:#B5B2B1; border-left:3px solid #aaa; padding:1em; }\nstrong,b{font-weight:bold;}\nem,i{font-style:italic;}\ntable{ width:100%;border-collapse:collapse;border-spacing:1px;margin:1em 0;font-size:.9em; }\nth,td{ padding:5px;text-align:left;border:1px solid #aaa; }\nth{ font-weight:bold;background:#5d5d5d; }\n.symbol-link{font-weight:bold;}\n/* header{ border-bottom:1px solid #494756; } */\n.title{ margin:0 0 8px;line-height:1.3;color:#ddd; }\n.meta {color:#5e5c6d;font-size:13px;margin:0 0 .5em; }\na{text-decoration:none; color:#2a4b87;}\n.meta .head { display: inline-block; overflow: hidden}\n.head .h-thumb { width: 30px; height: 30px; margin: 0; padding: 0; border-radius: 50%; float: left;}\n.head .h-content { margin: 0; padding: 0 0 0 9px; float: left;}\n.head .h-name {font-size: 13px; color: #eee; margin: 0;}\n.head .h-time {font-size: 11px; color: #7E829C; margin: 0;line-height: 11px;}\n.small {font-size: 12.5px; display: inline-block; transform: scale(0.9); -webkit-transform: scale(0.9); transform-origin: left; -webkit-transform-origin: left;}\n.smaller {font-size: 12.5px; display: inline-block; transform: scale(0.8); -webkit-transform: scale(0.8); transform-origin: left; -webkit-transform-origin: left;}\n.bt-text {font-size: 12px;margin: 1.5em 0 0 0}\n.bt-text p {margin: 0}\n</style>\n</head>\n<body>\n<div class=\"wrapper\">\n<header>\n<h2 class=\"title\">\n微软发布 DragonV2.1 模型，AI 转录语音更自然、更富表现力\n</h2>\n\n<h4 class=\"meta\">\n\n\n2025-07-31 12:59 北京时间&nbsp;&nbsp;&nbsp;<a href=https://finance.sina.com.cn/stock/t/2025-07-31/doc-infiitzy9224466.shtml><strong>市场资讯</strong></a>\n\n\n</h4>\n\n</header>\n<article>\n<div>\n<p>炒股就看金麒麟分析师研报，权威，专业，及时，全面，助您挖掘潜力主题机会！ （来源：IT之家）IT之家 7 月 31 日消息，科技媒体 NeoWin 今天（7 月 31 日）发布博文，报道称微软推出了 DragonV2.1Neural 零次学习（Zero-Shot Learning）模型，仅凭少量数据就能创建更加自然、表现力强的声音，并支持超过 100 种语言。IT之家援引博文介绍，这是一种零次学习...</p>\n\n<a href=\"https://finance.sina.com.cn/stock/t/2025-07-31/doc-infiitzy9224466.shtml\">Source Link</a>\n\n</div>\n\n\n</article>\n</div>\n</body>\n</html>\n","isBrief":false,"type":0,"news_type":1,"symbol":"LU0553294199.USD","symbol_name":"BGF GLOBAL EQUITY INCOME \"A5G\" (USD) INC","start_time":0,"source_url":"https://finance.sina.com.cn/stock/t/2025-07-31/doc-infiitzy9224466.shtml","article_id":"2555003102","we_media_id":null,"thumbnails":[],"rights":null,"url":"https://stock-news.laohu8.com/highlight/detail?id=2555003102","pubTimestamp":1753937940,"columns":[],"sourceInfo":{"source_id":"sina","name":"sina"},"weMediaInfo":null,"summary":"IT之家 7 月 31 日消息，科技媒体 NeoWin 今天发布博文，报道称微软推出了 DragonV2.1Neural 零次学习模型，仅凭少量数据就能创建更加自然、表现力强的声音，并支持超过 100 种语言。相比之下，之前的 DragonV1 模型在处理专有名词时存在发音问题。DragonV2.1 模型可以应用于多种不同场景，包括定制聊天机器人声音和为视频内容跨多语言配音。微软表示，DragonV2.1 提高发音准确性，与 DragonV1 相比，该模型单词错误率平均降低了 12.8%。","collect":0,"end_time":0,"defaultTopTitle":"sina.com.cn","property":[],"viewcount":null,"language":"zh","relate_stocks":{"LU0553294199.USD":"BGF GLOBAL EQUITY INCOME \"A5G\" (USD) INC","LU0672654240.SGD":"FTIF - Franklin US Opportunities A Acc SGD-H1","LU2065171402.SGD":"M&G (LUX) GLOBAL MAXIMA \"A\" (SGD) INC","LU2237957902.USD":"NIKKO AM GLOBAL EQUITY \"F\" (USD) ACC","LU1582987324.SGD":"M&G (LUX) INCOME ALLOCATION \"A-H\" (SGDHDG) ACC","LU0149725797.USD":"汇丰美国股市经济规模基金","LU2279689827.SGD":"JPMorgan Investment Funds - Global Income Sustainable A (mth) SGD-H","LU2168564495.EUR":"AZ ALLOCATION - TREND \"AI\" (EUR) ACC","IE00BJTD4N35.SGD":"Neuberger Berman US Long Short Equity A1  Acc SGD-H","LU0097036916.USD":"贝莱德美国增长A2 USD","IE00B1XK9C88.USD":"PINEBRIDGE US LARGE CAP RESEARCH ENHANCED \"A\" (USD) ACC","LU1935043023.USD":"MANULIFE GF GLOBAL MULTI-ASSET DIVERSIFIED INCOME \"AA\" (USD) INC A","LU1712237335.SGD":"Natixis Mirova Global Sustainable Equity H-R-NPF/A SGD","LU2326559502.SGD":"Natixis Loomis Sayles US Growth Equity P/A SGD-H","LU1280957306.USD":"THREADNEEDLE (LUX) US CONTRARIAN CORE EQUITIES \"AUP\" (USD) INC","LU0225283273.USD":"SCHRODER ISF GLOBAL EQUITY ALPHA \"A\" (USD) ACC","LU2247934214.USD":"FIDELITY FUNDS SUSTAINABLE FUTURE CONNECTIVITY \"A\" (USD) ACC","LU1935042215.USD":"MANULIFE GF GLOBAL MULTI-ASSET DIVERSIFIED INCOME  \"AA\" (USD) INC A","LU1236620750.USD":"HSBC GIF GLOBAL SUSTAINABLE LONG TERM DIVIDEND \"AM2\" (USD) INC","LU1221951046.USD":"NORDEA 1 STABLE RETURN \"HM\" (USDHDG) INC","LU0289960550.SGD":"AB FCP I - GLOBAL EQUITY BLEND PORTFOLIO 'A' (SGD) ACC","IE0003U64NQ7.SGD":"PIMCO BALANCED INCOME AND GROWTH \"M\" (SGDHDG) ACC","LU2092937148.SGD":"Blackrock ESG Multi-Asset A8 SGD-H","LU2750360641.GBP":"INVESCO GLOBAL EQUITY INCOME ADVANTAGE \"A\" (GBPHDG) INC","LU0820562030.AUD":"ALLIANZ INCOME AND GROWTH \"AMH2\" (AUDHDG) H2 INC","IE00BFSS8Q28.SGD":"Janus Henderson Balanced A Inc SGD-H","SGXZ31699556.SGD":"UGDP UNITED GLOBAL QUALITY GROWTH \"C\" (SGDHDG) ACC","LU2063271972.USD":"富兰克林创新领域基金","LU2764263039.SGD":"BGF GLOBAL UNCONSTRAINED EQUITY \"A2\" (SGDHDG) ACC","LU0211327993.USD":"TEMPLETON GLOBAL EQUITY INCOME \"A\" (USD) ACC","BK4592":"伊斯兰概念","LU2023251221.USD":"ALLIANZ GLOBAL SUSTAINABILITY \"AM\" (USD) INC","IE00BKVL7J92.USD":"Legg Mason ClearBridge - US Equity Sustainability Leaders A Acc USD","LU2602419157.SGD":"HSBC ISLAMIC GLOBAL EQUITY INDEX \"AC\" (SGD) ACC","LU0795875169.SGD":"JPMorgan Investment Funds - Global Income A (div) SGD-H","LU0077335932.USD":"FIDELITY AMERICAN GROWTH \"A\" INC","LU0784383803.USD":"BGF GLOBAL MULTI-ASSET INCOME FUND \"A\" (USD) INC A","IE00BWXC8680.SGD":"PINEBRIDGE US LARGE CAP RESEARCH ENHANCED \"A5\" (SGD) ACC","LU2360108059.USD":"BGF CIRCULAR ECONOMY \"A4\" (USD) INC","LU0310800379.SGD":"FTIF - Templeton Global A Acc SGD","LU1196500208.SGD":"NORDEA STABLE RETURN \"HB\" (SGDHDG) ACC","LU1069347547.HKD":"AB SICAV I - GLOBAL VALUE PORTFOLIO \"AD\" (HKD) INC","LU2023250504.SGD":"Allianz Thematica Cl AMg DIS H2-SGD","LU1720051017.SGD":"Allianz Global Artificial Intelligence AT Acc H2-SGD","BK4598":"佩洛西持仓","MSFT":"微软","LU1059921491.USD":"NORDEA 1 GLOBAL STABLE EQUITY \"HB\" (USDHDG) ACC","LU0494093205.USD":"贝莱德ESG灵活多元资产A2 USD-H","LU0158827781.USD":" ALLIANZ GLOBAL SUSTAINABILITY \"AT\" (USD) ACC","LU2210149790.SGD":"Natixis Thematics Subscription Economy R/A SGD-H"},"translate_title":"Microsoft releases DragonV2.1 model, making AI transcribed speech more natural and expressive","themeId":null,"isJumpTheme":false,"ttsUrl":null,"symbols_score_info":{"MSFT":1},"content_text":"炒股就看金麒麟分析师研报，权威，专业，及时，全面，助您挖掘潜力主题机会！ （来源：IT之家）IT之家 7 月 31 日消息，科技媒体 NeoWin 今天（7 月 31 日）发布博文，报道称微软推出了 DragonV2.1Neural 零次学习（Zero-Shot Learning）模型，仅凭少量数据就能创建更加自然、表现力强的声音，并支持超过 100 种语言。IT之家援引博文介绍，这是一种零次学习的文本到语音（TTS）模型，承诺提供更加自然和富有表现力的声音，并提高了发音的准确性以及增强了可控性。新模型仅需几秒钟的语音样本即可合成超过 100 种语言的语音。相比之下，之前的 DragonV1 模型在处理专有名词时存在发音问题。DragonV2.1 模型可以应用于多种不同场景，包括定制聊天机器人声音和为视频内容跨多语言配音。微软表示，DragonV2.1 提高发音准确性，与 DragonV1 相比，该模型单词错误率（WER）平均降低了 12.8%。该模型还提升了声音的自然度，用户使用此模型时，可以利用 SSML 音素标签和自定义词典对发音和口音进行细致控制。为了帮助用户入门，微软构建了 Andrew、Ava 和 Brian 等多个声音档案，供用户测试。\n\n\n海量资讯、精准解读，尽在新浪财经APP","kind":"news","is_publish_news":true,"is_publish_highlight":false,"is_publish_live":false,"is_publish_wemedia":null,"editions":null,"column":"","sentiment":"1","news_tag":"productRelease","news_rank":0,"symbols":[],"gpt_button":0,"code":"91000000","status":"200"}}}