智元机器人发布通用具身基座大模型GO-1，可利用人类视频学习

新浪科技

Mar 10, 2025

　　新浪科技讯 3月10日上午消息，由原华为天才少年稚晖君（彭志辉）创立的智元机器人今日发布，首个通用具身基座大模型GO-1。

　　据悉，该模型开创性地提出了Vision-Language-Latent-Action （ViLLA）架构，该架构由VLM（多模态大模型） + MoE（混合专家）组成，其中VLM借助海量互联网图文数据获得通用场景感知和语言理解能力，MoE中的Latent Planner（隐式规划器）借助大量跨本体和人类操作视频数据获得通用的动作理解能力，MoE中的Action Expert（动作专家）借助百万真机数据获得精细的动作执行能力，三者环环相扣，实现了可以利用人类视频学习，完成小样本快速泛化，降低了具身智能门槛，并成功部署到智元多款机器人本体。（文猛）

　　据悉，相比已有的最优模型，GO-1平均成功率提高了32%（46%->78%）。其中，在执行“Pour Water”（倒水）、“Table Bussing”（清理桌面）和 “Restock Beverage”（补充饮料）任务表现尤为突出。

海量资讯、精准解读，尽在新浪财经APP

责任编辑：尉旖涵

Disclaimer: Investing carries risk. This is not financial advice. The above content should not be regarded as an offer, recommendation, or solicitation on acquiring or disposing of any financial products, any associated discussions, comments, or posts by author or other users should not be considered as such either. It is solely for general information purpose only, which does not consider your own investment objectives, financial situations or needs. TTM assumes no responsibility or warranty for the accuracy and completeness of the information, investors should do their own research and may seek professional advice before investing.

Most Discussed

1
2
3
4
5
6
7
8
9
10

{"basename":"","ssrTDKData":{"titleTemplate":"%s - Tiger Brokers","title":"Tiger Brokers | Global Stocks, Options & Futures Trading App","description":"Tiger Brokers, one-stop investment in US stocks, SGX stocks, HK stocks, A-shares & other global assets. One of the best stock trading platforms in Singapore.","keywords":"tiger brokers,tiger trade,tiger brokers singapore,broker online,stock trading in singapore,share trading singapore,brokerage firm singapore,trading app,stock broker singapore,stock trading platforms,trading account","social":{"ogDescription":"Tiger Brokers, one-stop investment in US stocks, SGX stocks, HK stocks, A-shares & other global assets. One of the best stock trading platforms in Singapore.","ogImage":"https://c1.itigergrowtha.com/portal5/static/media/og-logo.be62fbe1.png","ogUrl":"https://www.itiger.com/news/2518425131"},"companyName":"Tiger Brokers"},"pageData":{"isMobile":false,"isTiger":false,"isTTM":true,"region":"SGP","license":"TBSG","edition":"fundamental"},"isCrawlerRequest":true,"__swrFallback__":{"@#url:\"https://stock-news.skytigris.cn/v3/news\",params:#id:\"2518425131\",edition:\"fundamental\",auth_exemption:1,,,undefined,":{"share":"https://ttm.financial/m/news/2518425131?lang=en_US&edition=fundamental","thumbnail":"","is_english":false,"pubTime":"2025-03-10 10:06","share_image_url":"https://static.laohu8.com/b0d1b7e8843deea78cc308b15114de44","id":"2518425131","market":"other","top_or_hot":-1,"title":"智元机器人发布通用具身基座大模型GO-1，可利用人类视频学习","media":"新浪科技","content":"<html><body><div>\n<p>　　新浪科技讯 3月10日上午消息，由原华为天才少年稚晖君（彭志辉）创立的智元<span>机器人</span><span></span>今日发布，首个通用具身基座大模型GO-1。</p>\n<p>　　据悉，该模型开创性地提出了Vision-Language-Latent-Action （ViLLA） 架构，该架构由VLM（多模态大模型） + MoE（混合专家）组成，其中VLM借助海量互联网图文数据获得通用场景感知和语言理解能力，MoE中的Latent Planner（隐式规划器）借助大量跨本体和人类操作视频数据获得通用的动作理解能力，MoE中的Action Expert（动作专家）借助百万真机数据获得精细的动作执行能力，三者环环相扣，实现了可以利用人类视频学习，完成小样本快速泛化，降低了具身智能门槛，并成功部署到智元多款机器人本体。（文猛）</p>\n<p>　　据悉，相比已有的最优模型，GO-1平均成功率提高了32%（46%-&gt;78%）。其中，在执行“Pour Water”（倒水）、“Table Bussing”（清理桌面） 和 “Restock Beverage”（补充饮料） 任务表现尤为突出。</p>\n<div><img src=\"http://n.sinaimg.cn/finance/transform/686/w550h136/20250310/2792-ccdc7929653424eabfb4dfa229722cde.png\"/><span></span></div>\n<div>\n<div><img src=\"\"/></div>\n<div>海量资讯、精准解读，尽在新浪财经APP</div>\n</div>\n<p>责任编辑：尉旖涵 </p>\n</div></body></html>","source":"sina","html":"<!DOCTYPE html>\n<html>\n<head>\n<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\" />\n<meta name=\"viewport\" content=\"width=device-width,initial-scale=1.0,minimum-scale=1.0,maximum-scale=1.0,user-scalable=no\"/>\n<meta name=\"format-detection\" content=\"telephone=no,email=no,address=no\" />\n<title>智元机器人发布通用具身基座大模型GO-1，可利用人类视频学习</title>\n<style type=\"text/css\">\na,abbr,acronym,address,applet,article,aside,audio,b,big,blockquote,body,canvas,caption,center,cite,code,dd,del,details,dfn,div,dl,dt,\nem,embed,fieldset,figcaption,figure,footer,form,h1,h2,h3,h4,h5,h6,header,hgroup,html,i,iframe,img,ins,kbd,label,legend,li,mark,menu,nav,\nobject,ol,output,p,pre,q,ruby,s,samp,section,small,span,strike,strong,sub,summary,sup,table,tbody,td,tfoot,th,thead,time,tr,tt,u,ul,var,video{ font:inherit;margin:0;padding:0;vertical-align:baseline;border:0 }\nbody{ font-size:16px; line-height:1.5; color:#999; background:transparent; }\n.wrapper{ overflow:hidden;word-break:break-all;padding:10px; }\nh1,h2{ font-weight:normal; line-height:1.35; margin-bottom:.6em; }\nh3,h4,h5,h6{ line-height:1.35; margin-bottom:1em; }\nh1{ font-size:24px; }\nh2{ font-size:20px; }\nh3{ font-size:18px; }\nh4{ font-size:16px; }\nh5{ font-size:14px; }\nh6{ font-size:12px; }\np,ul,ol,blockquote,dl,table{ margin:1.2em 0; }\nul,ol{ margin-left:2em; }\nul{ list-style:disc; }\nol{ list-style:decimal; }\nli,li p{ margin:10px 0;}\nimg{ max-width:100%;display:block;margin:0 auto 1em; }\nblockquote{ color:#B5B2B1; border-left:3px solid #aaa; padding:1em; }\nstrong,b{font-weight:bold;}\nem,i{font-style:italic;}\ntable{ width:100%;border-collapse:collapse;border-spacing:1px;margin:1em 0;font-size:.9em; }\nth,td{ padding:5px;text-align:left;border:1px solid #aaa; }\nth{ font-weight:bold;background:#5d5d5d; }\n.symbol-link{font-weight:bold;}\n/* header{ border-bottom:1px solid #494756; } */\n.title{ margin:0 0 8px;line-height:1.3;color:#ddd; }\n.meta {color:#5e5c6d;font-size:13px;margin:0 0 .5em; }\na{text-decoration:none; color:#2a4b87;}\n.meta .head { display: inline-block; overflow: hidden}\n.head .h-thumb { width: 30px; height: 30px; margin: 0; padding: 0; border-radius: 50%; float: left;}\n.head .h-content { margin: 0; padding: 0 0 0 9px; float: left;}\n.head .h-name {font-size: 13px; color: #eee; margin: 0;}\n.head .h-time {font-size: 11px; color: #7E829C; margin: 0;line-height: 11px;}\n.small {font-size: 12.5px; display: inline-block; transform: scale(0.9); -webkit-transform: scale(0.9); transform-origin: left; -webkit-transform-origin: left;}\n.smaller {font-size: 12.5px; display: inline-block; transform: scale(0.8); -webkit-transform: scale(0.8); transform-origin: left; -webkit-transform-origin: left;}\n.bt-text {font-size: 12px;margin: 1.5em 0 0 0}\n.bt-text p {margin: 0}\n</style>\n</head>\n<body>\n<div class=\"wrapper\">\n<header>\n<h2 class=\"title\">\n智元机器人发布通用具身基座大模型GO-1，可利用人类视频学习\n</h2>\n\n<h4 class=\"meta\">\n\n\n2025-03-10 10:06 北京时间&nbsp;&nbsp;&nbsp;<a href=https://finance.sina.com.cn/tech/shenji/2025-03-10/doc-inepcsmt6610370.shtml><strong>新浪科技</strong></a>\n\n\n</h4>\n\n</header>\n<article>\n<div>\n<p>新浪科技讯 3月10日上午消息，由原华为天才少年稚晖君（彭志辉）创立的智元机器人今日发布，首个通用具身基座大模型GO-1。\n　　据悉，该模型开创性地提出了Vision-Language-Latent-Action （ViLLA） 架构，该架构由VLM（多模态大模型） + MoE（混合专家）组成，其中VLM借助海量互联网图文数据获得通用场景感知和语言理解能力，MoE中的Latent Planner（...</p>\n\n<a href=\"https://finance.sina.com.cn/tech/shenji/2025-03-10/doc-inepcsmt6610370.shtml\">Source Link</a>\n\n</div>\n\n\n</article>\n</div>\n</body>\n</html>\n","isBrief":false,"type":0,"news_type":1,"symbol":"GO","symbol_name":"Grocery Outlet Holding","start_time":0,"source_url":"https://finance.sina.com.cn/tech/shenji/2025-03-10/doc-inepcsmt6610370.shtml","article_id":"2518425131","we_media_id":null,"thumbnails":[],"rights":null,"url":"https://stock-news.laohu8.com/highlight/detail?id=2518425131","pubTimestamp":1741572360,"columns":[],"sourceInfo":{"source_id":"sina","name":"sina"},"weMediaInfo":null,"summary":"新浪科技讯 3月10日上午消息，由原华为天才少年稚晖君 创立的智元机器人今日发布，首个通用具身基座大模型GO-1。　　据悉，相比已有的最优模型，GO-1平均成功率提高了32%。其中，在执行“Pour Water”（倒水）、“Table Bussing” 和 “Restock Beverage” 任务表现尤为突出。","collect":0,"end_time":0,"defaultTopTitle":"sina.com.cn","property":[],"viewcount":null,"language":"zh","relate_stocks":{"GO":"Grocery Outlet Holding","BK4113":"食品零售","LU0385154629.USD":"贝莱德营养科学基金A2","LU0471298777.SGD":"Blackrock Nutrition A2 SGD-H"},"translate_title":"Zhiyuan Robot releases the large model GO-1 of universal embodied base, which can use human video learning","themeId":null,"isJumpTheme":false,"ttsUrl":null,"symbols_score_info":{"GO":1},"content_text":"新浪科技讯 3月10日上午消息，由原华为天才少年稚晖君（彭志辉）创立的智元机器人今日发布，首个通用具身基座大模型GO-1。\n　　据悉，该模型开创性地提出了Vision-Language-Latent-Action （ViLLA） 架构，该架构由VLM（多模态大模型） + MoE（混合专家）组成，其中VLM借助海量互联网图文数据获得通用场景感知和语言理解能力，MoE中的Latent Planner（隐式规划器）借助大量跨本体和人类操作视频数据获得通用的动作理解能力，MoE中的Action Expert（动作专家）借助百万真机数据获得精细的动作执行能力，三者环环相扣，实现了可以利用人类视频学习，完成小样本快速泛化，降低了具身智能门槛，并成功部署到智元多款机器人本体。（文猛）\n　　据悉，相比已有的最优模型，GO-1平均成功率提高了32%（46%->78%）。其中，在执行“Pour Water”（倒水）、“Table Bussing”（清理桌面） 和 “Restock Beverage”（补充饮料） 任务表现尤为突出。\n\n\n\n海量资讯、精准解读，尽在新浪财经APP\n\n责任编辑：尉旖涵","kind":"news","is_publish_news":true,"is_publish_highlight":false,"is_publish_live":false,"is_publish_wemedia":null,"editions":null,"column":"","sentiment":"0","news_tag":"","news_rank":0,"symbols":[],"gpt_button":1,"need_auth":false,"code":"91000000","status":"200"}}}