OpenAI发布最新论文：提及DeepSeek和Kimi发现了o1秘密

新浪科技

Feb 12, 2025

　　新浪科技讯 2月12日晚间消息，在中国AI公司的影响下，OpenAI 公开了O系列强化学习的秘密。今天（2月12日），OpenAI发布了关于推理模型在竞技编程中应用的研究论文报告《Competitive Programming with Large Reasoning Models》，文中放出了OpenAI三个推理模型：o1、o1-ioi、o3在IOI（国际信息学奥林匹克竞赛）和CodeForces（全球知名在线编程竞赛）中的成绩。

　　论文显示，在IOI 2024中，o3在严格规则下拿到395.64分，达成金牌成就，并且在CodeForces上的表现与人类精英选手相当。论文中特别提到，中国的DeepSeek-R1和Kimi k1.5通过独立研究显示，利用思维链学习（COT）方法，可显著提升模型在数学解题与编程挑战中的综合表现。R1、k1.5是DeepSeek和Kimi在1月20日同时发布的新型推理模型。

　　该论文通过强化学习（RL）训练的大型语言模型在复杂编码和推理任务上的性能提升，比较了通用推理模型与针对特定领域优化的系统在竞技编程中的表现。研究结果表明，增加强化学习训练计算和测试时计算可显著提升模型性能，使其接近世界顶尖人类选手，这些模型将在科学、编码、数学等领域的AI应用中解锁新的应用体验。（文猛）

海量资讯、精准解读，尽在新浪财经APP

责任编辑：王若云

Disclaimer: Investing carries risk. This is not financial advice. The above content should not be regarded as an offer, recommendation, or solicitation on acquiring or disposing of any financial products, any associated discussions, comments, or posts by author or other users should not be considered as such either. It is solely for general information purpose only, which does not consider your own investment objectives, financial situations or needs. TTM assumes no responsibility or warranty for the accuracy and completeness of the information, investors should do their own research and may seek professional advice before investing.

Most Discussed

1
2
3
4
5
6
7
8
9
10

{"basename":"","ssrTDKData":{"titleTemplate":"%s - Tiger Brokers","title":"Tiger Brokers | Global Stocks, Options & Futures Trading App","description":"Tiger Brokers, one-stop investment in US stocks, SGX stocks, HK stocks, A-shares & other global assets. One of the best stock trading platforms in Singapore.","keywords":"tiger brokers,tiger trade,tiger brokers singapore,broker online,stock trading in singapore,share trading singapore,brokerage firm singapore,trading app,stock broker singapore,stock trading platforms,trading account","social":{"ogDescription":"Tiger Brokers, one-stop investment in US stocks, SGX stocks, HK stocks, A-shares & other global assets. One of the best stock trading platforms in Singapore.","ogImage":"https://c1.itigergrowtha.com/portal5/static/media/og-logo.be62fbe1.png","ogUrl":"https://www.itiger.com/news/2510322293"},"companyName":"Tiger Brokers"},"pageData":{"isMobile":false,"isTiger":false,"isTTM":true,"region":"SGP","license":"TBSG","edition":"fundamental"},"isCrawlerRequest":true,"__swrFallback__":{"@#url:\"https://stock-news.skytigris.cn/v3/news\",params:#id:\"2510322293\",edition:\"fundamental\",auth_exemption:1,,,undefined,":{"share":"https://ttm.financial/m/news/2510322293?lang=en_US&edition=fundamental","thumbnail":"","is_english":false,"pubTime":"2025-02-12 19:02","share_image_url":"https://static.laohu8.com/b0d1b7e8843deea78cc308b15114de44","id":"2510322293","market":"sh","top_or_hot":-1,"title":"OpenAI发布最新论文：提及DeepSeek和Kimi发现了o1秘密","media":"新浪科技","content":"<html><body><div>\n<p>　　新浪科技讯 2月12日晚间消息，在中国AI公司的影响下，OpenAI 公开了O系列强化学习的秘密。今天（2月12日），OpenAI发布了关于推理模型在竞技编程中应用的研究论文报告《Competitive Programming with Large Reasoning Models》，文中放出了OpenAI三个推理模型：o1、o1-ioi、o3在IOI（国际信息学奥林匹克竞赛）和CodeForces（全球知名在线编程竞赛）中的成绩。</p>\n<p>　　论文显示，在IOI 2024中，o3在严格规则下拿到395.64分，达成金牌成就，并且在CodeForces上的表现与人类精英选手相当。论文中特别提到，中国的DeepSeek-R1和Kimi k1.5通过独立研究显示，利用思维链学习（COT）方法，可显著提升模型在数学解题与编程挑战中的综合表现。R1、k1.5是DeepSeek和Kimi在1月20日同时发布的新型推理模型。</p>\n<p>　　该论文通过强化学习（RL）训练的大型语言模型在复杂编码和推理任务上的性能提升，比较了通用推理模型与针对特定领域优化的系统在竞技编程中的表现。研究结果表明，增加强化学习训练计算和测试时计算可显著提升模型性能，使其接近世界顶尖人类选手，这些模型将在科学、编码、数学等领域的AI应用中解锁新的应用体验。（文猛）</p>\n<div><img src=\"http://n.sinaimg.cn/finance/transform/573/w550h823/20250212/2cb0-846c6c487520a22bcbb3e992b5c9558b.png\"/><span></span></div>\n<div>\n<div><img src=\"\"/></div>\n<div>海量资讯、精准解读，尽在新浪财经APP</div>\n</div>\n<p>责任编辑：王若云 </p>\n</div></body></html>","source":"sina","html":"<!DOCTYPE html>\n<html>\n<head>\n<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\" />\n<meta name=\"viewport\" content=\"width=device-width,initial-scale=1.0,minimum-scale=1.0,maximum-scale=1.0,user-scalable=no\"/>\n<meta name=\"format-detection\" content=\"telephone=no,email=no,address=no\" />\n<title>OpenAI发布最新论文：提及DeepSeek和Kimi发现了o1秘密</title>\n<style type=\"text/css\">\na,abbr,acronym,address,applet,article,aside,audio,b,big,blockquote,body,canvas,caption,center,cite,code,dd,del,details,dfn,div,dl,dt,\nem,embed,fieldset,figcaption,figure,footer,form,h1,h2,h3,h4,h5,h6,header,hgroup,html,i,iframe,img,ins,kbd,label,legend,li,mark,menu,nav,\nobject,ol,output,p,pre,q,ruby,s,samp,section,small,span,strike,strong,sub,summary,sup,table,tbody,td,tfoot,th,thead,time,tr,tt,u,ul,var,video{ font:inherit;margin:0;padding:0;vertical-align:baseline;border:0 }\nbody{ font-size:16px; line-height:1.5; color:#999; background:transparent; }\n.wrapper{ overflow:hidden;word-break:break-all;padding:10px; }\nh1,h2{ font-weight:normal; line-height:1.35; margin-bottom:.6em; }\nh3,h4,h5,h6{ line-height:1.35; margin-bottom:1em; }\nh1{ font-size:24px; }\nh2{ font-size:20px; }\nh3{ font-size:18px; }\nh4{ font-size:16px; }\nh5{ font-size:14px; }\nh6{ font-size:12px; }\np,ul,ol,blockquote,dl,table{ margin:1.2em 0; }\nul,ol{ margin-left:2em; }\nul{ list-style:disc; }\nol{ list-style:decimal; }\nli,li p{ margin:10px 0;}\nimg{ max-width:100%;display:block;margin:0 auto 1em; }\nblockquote{ color:#B5B2B1; border-left:3px solid #aaa; padding:1em; }\nstrong,b{font-weight:bold;}\nem,i{font-style:italic;}\ntable{ width:100%;border-collapse:collapse;border-spacing:1px;margin:1em 0;font-size:.9em; }\nth,td{ padding:5px;text-align:left;border:1px solid #aaa; }\nth{ font-weight:bold;background:#5d5d5d; }\n.symbol-link{font-weight:bold;}\n/* header{ border-bottom:1px solid #494756; } */\n.title{ margin:0 0 8px;line-height:1.3;color:#ddd; }\n.meta {color:#5e5c6d;font-size:13px;margin:0 0 .5em; }\na{text-decoration:none; color:#2a4b87;}\n.meta .head { display: inline-block; overflow: hidden}\n.head .h-thumb { width: 30px; height: 30px; margin: 0; padding: 0; border-radius: 50%; float: left;}\n.head .h-content { margin: 0; padding: 0 0 0 9px; float: left;}\n.head .h-name {font-size: 13px; color: #eee; margin: 0;}\n.head .h-time {font-size: 11px; color: #7E829C; margin: 0;line-height: 11px;}\n.small {font-size: 12.5px; display: inline-block; transform: scale(0.9); -webkit-transform: scale(0.9); transform-origin: left; -webkit-transform-origin: left;}\n.smaller {font-size: 12.5px; display: inline-block; transform: scale(0.8); -webkit-transform: scale(0.8); transform-origin: left; -webkit-transform-origin: left;}\n.bt-text {font-size: 12px;margin: 1.5em 0 0 0}\n.bt-text p {margin: 0}\n</style>\n</head>\n<body>\n<div class=\"wrapper\">\n<header>\n<h2 class=\"title\">\nOpenAI发布最新论文：提及DeepSeek和Kimi发现了o1秘密\n</h2>\n\n<h4 class=\"meta\">\n\n\n2025-02-12 19:02 北京时间&nbsp;&nbsp;&nbsp;<a href=https://finance.sina.com.cn/tech/shenji/2025-02-12/doc-inekfnmc2701361.shtml><strong>新浪科技</strong></a>\n\n\n</h4>\n\n</header>\n<article>\n<div>\n<p>新浪科技讯 2月12日晚间消息，在中国AI公司的影响下，OpenAI 公开了O系列强化学习的秘密。今天（2月12日），OpenAI发布了关于推理模型在竞技编程中应用的研究论文报告《Competitive Programming with Large Reasoning Models》，文中放出了OpenAI三个推理模型：o1、o1-ioi、o3在IOI（国际信息学奥林匹克竞赛）和...</p>\n\n<a href=\"https://finance.sina.com.cn/tech/shenji/2025-02-12/doc-inekfnmc2701361.shtml\">Source Link</a>\n\n</div>\n\n\n</article>\n</div>\n</body>\n</html>\n","isBrief":false,"type":0,"news_type":1,"symbol":"BK4202","symbol_name":"服装、服饰与奢侈品","start_time":0,"source_url":"https://finance.sina.com.cn/tech/shenji/2025-02-12/doc-inekfnmc2701361.shtml","article_id":"2510322293","we_media_id":null,"thumbnails":[],"rights":null,"url":"https://stock-news.laohu8.com/highlight/detail?id=2510322293","pubTimestamp":1739358120,"columns":[],"sourceInfo":{"source_id":"sina","name":"sina"},"weMediaInfo":null,"summary":"新浪科技讯 2月12日晚间消息，在中国AI公司的影响下，OpenAI 公开了O系列强化学习的秘密。今天，OpenAI发布了关于推理模型在竞技编程中应用的研究论文报告《Competitive Programming with Large Reasoning Models》，文中放出了OpenAI三个推理模型：o1、o1-ioi、o3在IOI和CodeForces中的成绩。　　论文显示，在IOI 2024中，o3在严格规则下拿到395.64分，达成金牌成就，并且在CodeForces上的表现与人类精英选手相当。R1、k1.5是DeepSeek和Kimi在1月20日同时发布的新型推理模型。","collect":0,"end_time":0,"defaultTopTitle":"sina.com.cn","property":[],"viewcount":null,"language":"zh","relate_stocks":{"BK4202":"服装、服饰与奢侈品","RL":"拉夫劳伦","LU0006061336.USD":"Blackrock US Small and MidCap Opportunities A2 USD","BK4585":"ETF&股票定投概念","BK4588":"碎股"},"translate_title":"OpenAI releases latest paper: mentioning that DeepSeek and Kimi discovered o1 secrets","themeId":null,"isJumpTheme":false,"ttsUrl":null,"symbols_score_info":{"RL":1},"content_text":"新浪科技讯 2月12日晚间消息，在中国AI公司的影响下，OpenAI 公开了O系列强化学习的秘密。今天（2月12日），OpenAI发布了关于推理模型在竞技编程中应用的研究论文报告《Competitive Programming with Large Reasoning Models》，文中放出了OpenAI三个推理模型：o1、o1-ioi、o3在IOI（国际信息学奥林匹克竞赛）和CodeForces（全球知名在线编程竞赛）中的成绩。\n　　论文显示，在IOI 2024中，o3在严格规则下拿到395.64分，达成金牌成就，并且在CodeForces上的表现与人类精英选手相当。论文中特别提到，中国的DeepSeek-R1和Kimi k1.5通过独立研究显示，利用思维链学习（COT）方法，可显著提升模型在数学解题与编程挑战中的综合表现。R1、k1.5是DeepSeek和Kimi在1月20日同时发布的新型推理模型。\n　　该论文通过强化学习（RL）训练的大型语言模型在复杂编码和推理任务上的性能提升，比较了通用推理模型与针对特定领域优化的系统在竞技编程中的表现。研究结果表明，增加强化学习训练计算和测试时计算可显著提升模型性能，使其接近世界顶尖人类选手，这些模型将在科学、编码、数学等领域的AI应用中解锁新的应用体验。（文猛）\n\n\n\n海量资讯、精准解读，尽在新浪财经APP\n\n责任编辑：王若云","kind":"news","is_publish_news":true,"is_publish_highlight":false,"is_publish_live":false,"is_publish_wemedia":null,"editions":null,"column":"","sentiment":"0","news_tag":"","news_rank":0,"symbols":[],"gpt_button":1,"need_auth":false,"code":"91000000","status":"200"}}}