阿里通义开源WebSailor 检索性能超DeepSeek R1、Grok-3等模型

智通财经

Jul 07, 2025

智通财经APP获悉，近日，阿里通义开源了网络智能体WebSailor，该智能体具备强大的推理和检索能力，在高难度智能体评测集BrowseComp上，WebSailor的成绩超越了DeepSeek R1、Grok-3等模型和智能体，一举登顶开源网络智能体榜单。目前WebSailor的构建方案及部分数据集已在Github开源。

为了让WebSailor更好地掌握复杂网页信息处理能力，通义团队设计了一套创新性的训练方法，包括三个关键模块：一是“地狱级试炼场”SailorFog-QA，通过真实网页构建图谱，制造信息混淆，让模型跨越多个页面整合线索，挑战人类认知极限；二是“重构推理逻辑”，摒弃冗长重复的推理链，让模型学习简洁、直击重点的思考方式，提升思维灵活性；三是“强化学习DUPO算法”，通过动态筛选高质量训练样本，提高训练效率2~3倍。

在权威评测平台 BrowseComp-en / BrowseComp-zh 中：WebSailor-72B 得分高居开源榜首；中文榜单中，与豆包（Doubao-Search）不分上下；更在英文榜单中超过 Grok-3 等闭源模型。不仅如此，它在相对简单任务（如SimpleQA）中也表现优异。

Disclaimer: Investing carries risk. This is not financial advice. The above content should not be regarded as an offer, recommendation, or solicitation on acquiring or disposing of any financial products, any associated discussions, comments, or posts by author or other users should not be considered as such either. It is solely for general information purpose only, which does not consider your own investment objectives, financial situations or needs. TTM assumes no responsibility or warranty for the accuracy and completeness of the information, investors should do their own research and may seek professional advice before investing.

Most Discussed

1
2
3
4
5
6
7
8
9
10

{"basename":"","ssrTDKData":{"titleTemplate":"%s - Tiger Brokers","title":"Tiger Brokers | Global Stocks, Options & Futures Trading App","description":"Tiger Brokers, one-stop investment in US stocks, SGX stocks, HK stocks, A-shares & other global assets. One of the best stock trading platforms in Singapore.","keywords":"tiger brokers,tiger trade,tiger brokers singapore,broker online,stock trading in singapore,share trading singapore,brokerage firm singapore,trading app,stock broker singapore,stock trading platforms,trading account","social":{"ogDescription":"Tiger Brokers, one-stop investment in US stocks, SGX stocks, HK stocks, A-shares & other global assets. One of the best stock trading platforms in Singapore.","ogImage":"https://c1.itigergrowtha.com/portal5/static/media/og-logo.be62fbe1.png","ogUrl":"https://www.itiger.com/news/2549224586"},"companyName":"Tiger Brokers"},"pageData":{"isMobile":false,"isTiger":false,"isTTM":true,"region":"SGP","license":"TBSG","edition":"fundamental"},"isCrawlerRequest":true,"__swrFallback__":{"@#url:\"https://stock-news.skytigris.cn/v3/news\",params:#id:\"2549224586\",edition:\"fundamental\",auth_exemption:1,,,undefined,":{"share":"https://ttm.financial/m/news/2549224586?lang=en_US&edition=fundamental","thumbnail":"","is_english":false,"pubTime":"2025-07-07 16:03","share_image_url":"https://static.laohu8.com/e9f99090a1c2ed51c021029395664489","id":"2549224586","market":"fut","top_or_hot":-1,"title":"阿里通义开源WebSailor 检索性能超DeepSeek R1、Grok-3等模型","media":"智通财经","content":"<html><body><p>智通财经APP获悉，近日，<a href=\"https://laohu8.com/S/BABA\">阿里</a>通义开源了网络智能体WebSailor，该智能体具备强大的推理和检索能力，在高难度智能体评测集BrowseComp上，WebSailor的成绩超越了DeepSeek R1、Grok-3等模型和智能体，一举登顶开源网络智能体榜单。目前WebSailor的构建方案及部分数据集已在Github开源。</p><p><img src=\"http://img.zhitongcaijing.com/images/contentformat/9d5d4ab862e562e9548db74ec7647f58.jpg\"/></p><p>为了让WebSailor更好地掌握复杂网页信息处理能力，通义团队设计了一套创新性的训练方法，包括三个关键模块：一是“地狱级试炼场”SailorFog-QA，通过真实网页构建图谱，制造信息混淆，让模型跨越多个页面整合线索，挑战人类认知极限；二是“重构推理逻辑”，摒弃冗长重复的推理链，让模型学习简洁、直击重点的思考方式，提升思维灵活性；三是“强化学习DUPO算法”，通过动态筛选高质量训练样本，提高训练效率2~3倍。</p><p>在权威评测平台 BrowseComp-en / BrowseComp-zh 中：WebSailor-72B 得分高居开源榜首；中文榜单中，与豆包（Doubao-Search）不分上下；更在英文榜单中 超过 Grok-3 等闭源模型。不仅如此，它在相对简单任务（如SimpleQA）中也表现优异。</p><p><img src=\"http://img.zhitongcaijing.com/images/contentformat/1c6b4b76ff51aed39f58846a289ae16b.jpg\"/></p><p><img src=\"http://img.zhitongcaijing.com/images/contentformat/b94bddc576107bf49b435cdd71bf3be0.jpg\"/></p><p><img src=\"http://img.zhitongcaijing.com/images/contentformat/c3893c100dcafd4c9bd692e27328b30a.jpg\"/></p></body></html>","source":"stock_zhitongcaijing","html":"<!DOCTYPE html>\n<html>\n<head>\n<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\" />\n<meta name=\"viewport\" content=\"width=device-width,initial-scale=1.0,minimum-scale=1.0,maximum-scale=1.0,user-scalable=no\"/>\n<meta name=\"format-detection\" content=\"telephone=no,email=no,address=no\" />\n<title>阿里通义开源WebSailor 检索性能超DeepSeek R1、Grok-3等模型</title>\n<style type=\"text/css\">\na,abbr,acronym,address,applet,article,aside,audio,b,big,blockquote,body,canvas,caption,center,cite,code,dd,del,details,dfn,div,dl,dt,\nem,embed,fieldset,figcaption,figure,footer,form,h1,h2,h3,h4,h5,h6,header,hgroup,html,i,iframe,img,ins,kbd,label,legend,li,mark,menu,nav,\nobject,ol,output,p,pre,q,ruby,s,samp,section,small,span,strike,strong,sub,summary,sup,table,tbody,td,tfoot,th,thead,time,tr,tt,u,ul,var,video{ font:inherit;margin:0;padding:0;vertical-align:baseline;border:0 }\nbody{ font-size:16px; line-height:1.5; color:#999; background:transparent; }\n.wrapper{ overflow:hidden;word-break:break-all;padding:10px; }\nh1,h2{ font-weight:normal; line-height:1.35; margin-bottom:.6em; }\nh3,h4,h5,h6{ line-height:1.35; margin-bottom:1em; }\nh1{ font-size:24px; }\nh2{ font-size:20px; }\nh3{ font-size:18px; }\nh4{ font-size:16px; }\nh5{ font-size:14px; }\nh6{ font-size:12px; }\np,ul,ol,blockquote,dl,table{ margin:1.2em 0; }\nul,ol{ margin-left:2em; }\nul{ list-style:disc; }\nol{ list-style:decimal; }\nli,li p{ margin:10px 0;}\nimg{ max-width:100%;display:block;margin:0 auto 1em; }\nblockquote{ color:#B5B2B1; border-left:3px solid #aaa; padding:1em; }\nstrong,b{font-weight:bold;}\nem,i{font-style:italic;}\ntable{ width:100%;border-collapse:collapse;border-spacing:1px;margin:1em 0;font-size:.9em; }\nth,td{ padding:5px;text-align:left;border:1px solid #aaa; }\nth{ font-weight:bold;background:#5d5d5d; }\n.symbol-link{font-weight:bold;}\n/* header{ border-bottom:1px solid #494756; } */\n.title{ margin:0 0 8px;line-height:1.3;color:#ddd; }\n.meta {color:#5e5c6d;font-size:13px;margin:0 0 .5em; }\na{text-decoration:none; color:#2a4b87;}\n.meta .head { display: inline-block; overflow: hidden}\n.head .h-thumb { width: 30px; height: 30px; margin: 0; padding: 0; border-radius: 50%; float: left;}\n.head .h-content { margin: 0; padding: 0 0 0 9px; float: left;}\n.head .h-name {font-size: 13px; color: #eee; margin: 0;}\n.head .h-time {font-size: 11px; color: #7E829C; margin: 0;line-height: 11px;}\n.small {font-size: 12.5px; display: inline-block; transform: scale(0.9); -webkit-transform: scale(0.9); transform-origin: left; -webkit-transform-origin: left;}\n.smaller {font-size: 12.5px; display: inline-block; transform: scale(0.8); -webkit-transform: scale(0.8); transform-origin: left; -webkit-transform-origin: left;}\n.bt-text {font-size: 12px;margin: 1.5em 0 0 0}\n.bt-text p {margin: 0}\n</style>\n</head>\n<body>\n<div class=\"wrapper\">\n<header>\n<h2 class=\"title\">\n阿里通义开源WebSailor 检索性能超DeepSeek R1、Grok-3等模型\n</h2>\n\n<h4 class=\"meta\">\n\n\n2025-07-07 16:03 北京时间&nbsp;&nbsp;&nbsp;<a href=http://www.zhitongcaijing.com/content/detail/1314604.html><strong>智通财经</strong></a>\n\n\n</h4>\n\n</header>\n<article>\n<div>\n<p>智通财经APP获悉，近日，阿里通义开源了网络智能体WebSailor，该智能体具备强大的推理和检索能力，在高难度智能体评测集BrowseComp上，WebSailor的成绩超越了DeepSeek R1、Grok-3等模型和智能体，一举登顶开源网络智能体榜单。目前WebSailor的构建方案及部分数据集已在Github开源。为了让WebSailor更好地掌握复杂网页信息处理能力，通义团队设计了一套...</p>\n\n<a href=\"http://www.zhitongcaijing.com/content/detail/1314604.html\">Source Link</a>\n\n</div>\n\n\n</article>\n</div>\n</body>\n</html>\n","isBrief":false,"type":0,"news_type":1,"symbol":"BABA","symbol_name":"阿里巴巴","start_time":0,"source_url":"http://www.zhitongcaijing.com/content/detail/1314604.html","article_id":"2549224586","we_media_id":null,"thumbnails":[],"rights":null,"url":"https://stock-news.laohu8.com/highlight/detail?id=2549224586","pubTimestamp":1751875424,"columns":[],"sourceInfo":{"source_id":"stock_zhitongcaijing","name":"智通财经网"},"weMediaInfo":null,"summary":"智通财经APP获悉，近日，阿里通义开源了网络智能体WebSailor，该智能体具备强大的推理和检索能力，在高难度智能体评测集BrowseComp上，WebSailor的成绩超越了DeepSeek R1、Grok-3等模型和智能体，一举登顶开源网络智能体榜单。目前WebSailor的构建方案及部分数据集已在Github开源。在权威评测平台 BrowseComp-en / BrowseComp-zh 中：WebSailor-72B 得分高居开源榜首；中文榜单中，与豆包不分上下；更在英文榜单中 超过 Grok-3 等闭源模型。","collect":0,"end_time":0,"defaultTopTitle":"zhitongcaijing.com","property":["earning"],"viewcount":null,"language":"zh","relate_stocks":{"BABA":"阿里巴巴"},"translate_title":"AliTongyi's open source WebSailer retrieval performance exceeds DeepSeek R1, Grok-3 and other models","themeId":null,"isJumpTheme":false,"ttsUrl":null,"symbols_score_info":{"BABA":1,"ALBmain":1},"content_text":"智通财经APP获悉，近日，阿里通义开源了网络智能体WebSailor，该智能体具备强大的推理和检索能力，在高难度智能体评测集BrowseComp上，WebSailor的成绩超越了DeepSeek R1、Grok-3等模型和智能体，一举登顶开源网络智能体榜单。目前WebSailor的构建方案及部分数据集已在Github开源。为了让WebSailor更好地掌握复杂网页信息处理能力，通义团队设计了一套创新性的训练方法，包括三个关键模块：一是“地狱级试炼场”SailorFog-QA，通过真实网页构建图谱，制造信息混淆，让模型跨越多个页面整合线索，挑战人类认知极限；二是“重构推理逻辑”，摒弃冗长重复的推理链，让模型学习简洁、直击重点的思考方式，提升思维灵活性；三是“强化学习DUPO算法”，通过动态筛选高质量训练样本，提高训练效率2~3倍。在权威评测平台 BrowseComp-en / BrowseComp-zh 中：WebSailor-72B 得分高居开源榜首；中文榜单中，与豆包（Doubao-Search）不分上下；更在英文榜单中 超过 Grok-3 等闭源模型。不仅如此，它在相对简单任务（如SimpleQA）中也表现优异。","kind":"news","is_publish_news":true,"is_publish_highlight":false,"is_publish_live":false,"is_publish_wemedia":null,"editions":null,"column":"","sentiment":"1","news_tag":"productRelease","news_rank":0,"symbols":[],"gpt_button":0,"need_auth":false,"code":"91000000","status":"200"}}}