消费电子：DEEPSEEK-R1降本增效看好ASIC赛道及应用端弹性释放

天风证券股份有...

Feb 09, 2025

1、近期Deepseek-R1以其较低训练成本和较强性能引起全球广泛关注，主要源于其V3基模多项降本提效的创新及R1模型增加的第二阶段强化学习训练对推理能力的大幅提升。预训练模型V3：关键创新表现于1）使用多头潜在注意力（MLA）机制，将每次查询所需的 KV 缓存减少了约 93.3%，降低每次查询所需的硬件量，从而大幅降低了推理成本。2）利用 Multi-Token Prediction (...

Source Link

Disclaimer: Investing carries risk. This is not financial advice. The above content should not be regarded as an offer, recommendation, or solicitation on acquiring or disposing of any financial products, any associated discussions, comments, or posts by author or other users should not be considered as such either. It is solely for general information purpose only, which does not consider your own investment objectives, financial situations or needs. TTM assumes no responsibility or warranty for the accuracy and completeness of the information, investors should do their own research and may seek professional advice before investing.

Most Discussed

{"basename":"","ssrTDKData":{"titleTemplate":"%s - Tiger Brokers","title":"Tiger Brokers | Global Stocks, Options & Futures Trading App","description":"Tiger Brokers, one-stop investment in US stocks, SGX stocks, HK stocks, A-shares & other global assets. One of the best stock trading platforms in Singapore.","keywords":"tiger brokers,tiger trade,tiger brokers singapore,broker online,stock trading in singapore,share trading singapore,brokerage firm singapore,trading app,stock broker singapore,stock trading platforms,trading account","social":{"ogDescription":"Tiger Brokers, one-stop investment in US stocks, SGX stocks, HK stocks, A-shares & other global assets. One of the best stock trading platforms in Singapore.","ogImage":"https://c1.itigergrowtha.com/portal5/static/media/og-logo.be62fbe1.png","ogUrl":"https://www.itiger.com/news/2510701043"},"companyName":"Tiger Brokers"},"pageData":{"isMobile":false,"isTiger":false,"isTTM":true,"region":"SGP","license":"TBSG","edition":"fundamental"},"isCrawlerRequest":true,"__swrFallback__":{"@#url:\"https://stock-news.skytigris.cn/v3/news\",params:#id:\"2510701043\",edition:\"fundamental\",auth_exemption:1,,,undefined,":{"share":"https://ttm.financial/m/news/2510701043?lang=en_US&edition=fundamental","thumbnail":"","is_english":false,"pubTime":"2025-02-09 19:03","share_image_url":"https://static.laohu8.com/9a95c1376e76363c1401fee7d3717173","id":"2510701043","market":"us","top_or_hot":-1,"title":"消费电子：DEEPSEEK-R1降本增效 看好ASIC赛道及应用端弹性释放","media":"天风证券股份有...","content":"<div>\n<p>1、近期Deepseek-R1以其较低训练成本和较强性能引起全球广泛关注，主要源于其V3基模多项降本提效的创新及R1模型增加的第二阶段强化学习训练对推理能力的大幅提升。预训练模型V3：关键创新表现于1）使用多头潜在注意力（MLA）机制，将每次查询所需的 KV 缓存减少了约 93.3%，降低每次查询所需的硬件量，从而大幅降低了推理成本。2） 利用 Multi-Token Prediction (...</p>\n\n<a href=\"http://gu.qq.com/resources/shy/news/detail-v2/index.html#/?id=nesSN20250209190320961cdccb&s=b\">Source Link</a>\n\n</div>\n","source":"tencent","html":"<!DOCTYPE html>\n<html>\n<head>\n<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\" />\n<meta name=\"viewport\" content=\"width=device-width,initial-scale=1.0,minimum-scale=1.0,maximum-scale=1.0,user-scalable=no\"/>\n<meta name=\"format-detection\" content=\"telephone=no,email=no,address=no\" />\n<title>消费电子：DEEPSEEK-R1降本增效 看好ASIC赛道及应用端弹性释放</title>\n<style type=\"text/css\">\na,abbr,acronym,address,applet,article,aside,audio,b,big,blockquote,body,canvas,caption,center,cite,code,dd,del,details,dfn,div,dl,dt,\nem,embed,fieldset,figcaption,figure,footer,form,h1,h2,h3,h4,h5,h6,header,hgroup,html,i,iframe,img,ins,kbd,label,legend,li,mark,menu,nav,\nobject,ol,output,p,pre,q,ruby,s,samp,section,small,span,strike,strong,sub,summary,sup,table,tbody,td,tfoot,th,thead,time,tr,tt,u,ul,var,video{ font:inherit;margin:0;padding:0;vertical-align:baseline;border:0 }\nbody{ font-size:16px; line-height:1.5; color:#999; background:transparent; }\n.wrapper{ overflow:hidden;word-break:break-all;padding:10px; }\nh1,h2{ font-weight:normal; line-height:1.35; margin-bottom:.6em; }\nh3,h4,h5,h6{ line-height:1.35; margin-bottom:1em; }\nh1{ font-size:24px; }\nh2{ font-size:20px; }\nh3{ font-size:18px; }\nh4{ font-size:16px; }\nh5{ font-size:14px; }\nh6{ font-size:12px; }\np,ul,ol,blockquote,dl,table{ margin:1.2em 0; }\nul,ol{ margin-left:2em; }\nul{ list-style:disc; }\nol{ list-style:decimal; }\nli,li p{ margin:10px 0;}\nimg{ max-width:100%;display:block;margin:0 auto 1em; }\nblockquote{ color:#B5B2B1; border-left:3px solid #aaa; padding:1em; }\nstrong,b{font-weight:bold;}\nem,i{font-style:italic;}\ntable{ width:100%;border-collapse:collapse;border-spacing:1px;margin:1em 0;font-size:.9em; }\nth,td{ padding:5px;text-align:left;border:1px solid #aaa; }\nth{ font-weight:bold;background:#5d5d5d; }\n.symbol-link{font-weight:bold;}\n/* header{ border-bottom:1px solid #494756; } */\n.title{ margin:0 0 8px;line-height:1.3;color:#ddd; }\n.meta {color:#5e5c6d;font-size:13px;margin:0 0 .5em; }\na{text-decoration:none; color:#2a4b87;}\n.meta .head { display: inline-block; overflow: hidden}\n.head .h-thumb { width: 30px; height: 30px; margin: 0; padding: 0; border-radius: 50%; float: left;}\n.head .h-content { margin: 0; padding: 0 0 0 9px; float: left;}\n.head .h-name {font-size: 13px; color: #eee; margin: 0;}\n.head .h-time {font-size: 11px; color: #7E829C; margin: 0;line-height: 11px;}\n.small {font-size: 12.5px; display: inline-block; transform: scale(0.9); -webkit-transform: scale(0.9); transform-origin: left; -webkit-transform-origin: left;}\n.smaller {font-size: 12.5px; display: inline-block; transform: scale(0.8); -webkit-transform: scale(0.8); transform-origin: left; -webkit-transform-origin: left;}\n.bt-text {font-size: 12px;margin: 1.5em 0 0 0}\n.bt-text p {margin: 0}\n</style>\n</head>\n<body>\n<div class=\"wrapper\">\n<header>\n<h2 class=\"title\">\n消费电子：DEEPSEEK-R1降本增效 看好ASIC赛道及应用端弹性释放\n</h2>\n\n<h4 class=\"meta\">\n\n\n2025-02-09 19:03 北京时间&nbsp;&nbsp;&nbsp;<a href=http://gu.qq.com/resources/shy/news/detail-v2/index.html#/?id=nesSN20250209190320961cdccb&s=b><strong>天风证券股份有...</strong></a>\n\n\n</h4>\n\n</header>\n<article>\n<div>\n<p>1、近期Deepseek-R1以其较低训练成本和较强性能引起全球广泛关注，主要源于其V3基模多项降本提效的创新及R1模型增加的第二阶段强化学习训练对推理能力的大幅提升。预训练模型V3：关键创新表现于1）使用多头潜在注意力（MLA）机制，将每次查询所需的 KV 缓存减少了约 93.3%，降低每次查询所需的硬件量，从而大幅降低了推理成本。2） 利用 Multi-Token Prediction (...</p>\n\n<a href=\"http://gu.qq.com/resources/shy/news/detail-v2/index.html#/?id=nesSN20250209190320961cdccb&s=b\">Source Link</a>\n\n</div>\n\n\n</article>\n</div>\n</body>\n</html>\n","isBrief":false,"type":0,"news_type":1,"symbol":"ON","symbol_name":"安森美半导体","start_time":0,"source_url":"http://gu.qq.com/resources/shy/news/detail-v2/index.html#/?id=nesSN20250209190320961cdccb&s=b","article_id":"2510701043","we_media_id":null,"thumbnails":[],"rights":{"source":"tencent","url":"http://gu.qq.com/resources/shy/news/detail-v2/index.html#/?id=nesSN20250209190320961cdccb&s=b","rn_cache_url":null,"customStyle":"body{padding-top:10px;}#news_title{font-weight:bold;#titleStyle#;}#news_description span{font-size:12px;#descriptionStyle#;}.footer-note{#statement#}","selectors":".mod-LoadTzbdNews, body","filters":".relate-stock, .hot-list, .recom-box, .wx-sou","directOrigin":true},"url":"https://stock-news.laohu8.com/highlight/detail?id=2510701043","pubTimestamp":1739098992,"columns":[],"sourceInfo":{"source_id":"tencent","name":"腾讯"},"weMediaInfo":null,"summary":"3）近期海外AI 链公司META、微软和CLS 最新季度业绩表现分化，印证了预训练增速放缓、后训练及推理驱动ASIC 快速成长的产业趋势。2）智能硬件：看好AI 赋能下消费电子产品多元创新。","collect":0,"end_time":0,"defaultTopTitle":"qq.com","property":["earning"],"viewcount":null,"language":"zh","relate_stocks":{"561100":"消费电子ETF富国","561310":"消电ETF国泰","561600":"消费电子ETF","562950":"消费电子ETF易方达","ON":"安森美半导体","META":"Meta Platforms, Inc."},"translate_title":"Consumer Electronics: DEEPSEEK-R1 reduces costs and increases efficiency, optimistic about the release of ASIC track and application-side flexibility","themeId":null,"isJumpTheme":false,"ttsUrl":null,"symbols_score_info":{"561100":0.9,"561310":0.9,"561600":0.9,"562950":0.9,"META":0.9,"ON":1},"content_text":"1、近期Deepseek-R1以其较低训练成本和较强性能引起全球广泛关注，主要源于其V3基模多项降本提效的创新及R1模型增加的第二阶段强化学习训练对推理能力的大幅提升。预训练模型V3：关键创新表现于1）使用多头潜在注意力（MLA）机制，将每次查询所需的 KV 缓存减少了约 93.3%，降低每次查询所需的硬件量，从而大幅降低了推理成本。2） 利用 Multi-Token Prediction (MTP)新增注意力模块，预测接下来的多个 token，其在训练过程中显著提高了模型性能，并且在推理时可以被移除，利用较低计算资源实现性能提升。3）作为专家混合模型，实现了门控网络（gating network），以平衡的方式将 token 路由到合适的专家，而不会影响模型性能。提高了训练效率，同时也降低了推理成本等。增加后训练的R1：则未使用监督微调而是强化学习微调的方法，展现出从零开始学习推理能力。2、AI 创新范式下后训练和推理环节的迭代或将为ASIC 带来重要成长机遇：1）Scaling Law VS新推理范式：2020 年至2023 年间，模型在海量互联网文本上训练，只需少量额外训练。这种以往范式依赖于预训练，在其他条件相同的情况下，扩大AI 系统的训练规模会导致在各类认知任务上的性能平稳提升。而这种方式不仅成本越来越高，且已经难以取得稳健的进步。2024 年，使用强化学习（RL）训练模型生成思维链已成为模型扩展的新焦点。其专注于通过合成数据生成和在现有模型上进行后训练中的强化学习来提升推理能力，其迭代速度更快，仍处于扩展曲线（scaling curve）的早期，且以较少的计算资源即可获得显著收益。2）ASIC：AI 领域的ASIC 通过特定算法的定制化优化，实现了高效推理和计算，其特性适用于新范式下训练模型在特定、可客观衡量的任务（如数学、编程竞赛）及类似推理任务上的表现，且有利于打破GPU垄断、降低成本。据电子发烧友网公众号及Marvell 预测，2023 年ASIC 占数据中心加速计算芯片的16%，规模约为66 亿美元；随着 AI计算需求的增长，ASIC 占比有望提升至25%，其预计2028 年数据中心ASIC 市场规模将提升至429 亿美元，CAGR为45.4%。博通表示谷歌、Meta、亚马逊都是公司AI 定制芯片的大客户。其CEO 表示，公司2027 年超大规模客户的AI 收入将达到600-900 亿美元，几乎每年翻倍，其预计未来或50%的算力都会是ASIC。3）近期海外AI 链公司META、微软和CLS 最新季度业绩表现分化，印证了预训练增速放缓、后训练及推理驱动ASIC 快速成长的产业趋势。Meta 在AI 广告业务的带动下收入创历史新高，其预计2025 年或将构建一个具备中级水平工程师编码和问题解决能力的AI 智能体，可能成为历史上最重要的创新之一并发展为一个非常大的市场。微软的云计算业务增长放缓，DeepSeek-R1 已可通过微软的AI 平台获取，并很快能在微软AI 电脑Copilot+ PC 上运行。CLS 或受益于定制化趋势，表示CCS 业务需求强劲，24Q4 收入同比+30%，环比+3%，收入占比提升6pct 至68%。CCS（Connectivity & Cloud Solutions）为存储、服务器和通信市场的客户提供定制化的 HPS 产品以及硬件平台解决方案，亚马逊、谷歌云、微软或Meta 或为其大规模客户3、对标杰文斯悖论，持续看好算力投资及算力需求增长趋势。1）算力需求或将从预训练端转移至后训练及推理端并保持高速增长：根据杰文斯悖论，当某种资源的使用效率提高后，虽然单次使用时消耗更少，但因为成本降低、使用更方便，反而可能让人们用得更多，导致整体消耗量反而上升，我们认为大模型的发展亦是如此。Anthropic 的CEO 认为各公司在训练强大的AI 模型上的投入不断增加，尽管成本曲线会周期性下移，训练特定智能水平模型的成本也在迅速下降。然而节省下来的成本又被投入到使用相同巨额预算开发更智能的模型中。2）算力投资持续：扎克伯格在META 业绩会中预计，Meta 今年的资本支出将在600 亿美元至650 亿美元之间，将大力发展人工智能。未来几年，Meta 还将投入数千亿美元用于人工智能基础设施。微软预计2025 财年的AI 数据中心方面开支将超过800 亿美元。4、AI 应用成本降低的同时或将显现增长弹性，看好应用端潜力释放。海外方面看好具备垂直应用转化软实力的META等，国内方面看好具备完善生态能力的果链及智能硬件创新。1）果链：软硬件创新持续催化，苹果新一轮产品周期开启，持续看好新一轮产品周期对于苹果供应链厂商估值和业绩提振。2）智能硬件：看好AI 赋能下消费电子产品多元创新。根据 wellsennXR 的预测，2025 年开始，AI 智能眼镜将在传统眼镜销量保持稳定增长的大背景下快速向传统眼镜渗透；2029 年，AI 智能眼镜年销量有望达到5500 万副；到2035 年，AI 智能眼镜销量有望达14 亿副，看好AI 终端带动硬件需求提升。建议关注：苹果产业链：立讯精密、领益智造、蓝思科技、创新新材（和金属材料组、机械组联合覆盖）、工业富联、鹏鼎控股、东山精密、珠海冠宇（和电新组联合覆盖）、比亚迪电子（港股）、高伟电子（港股）、信维通信、欣旺达（和电新组联合覆盖）、水晶光电、长电科技、蓝特光学、中石科技等；AI SOC：恒玄、星宸、瑞芯微、晶晨、全志、乐鑫、中科蓝讯、炬芯、富瀚微等；国产算力：工业富联、中芯国际、寒武纪、海光、龙芯中科等；存储：兆易创新、江波龙风险提示：地缘政治风险、AI 新技术迭代不及预期、下游算力需求不及预期等","kind":"news","is_publish_news":true,"is_publish_highlight":false,"is_publish_live":false,"is_publish_wemedia":null,"editions":null,"column":"","sentiment":"1","news_tag":"","news_rank":0,"symbols":[],"gpt_button":0,"need_auth":false,"code":"91000000","status":"200"}}}

消费电子：DEEPSEEK-R1降本增效 看好ASIC赛道及应用端弹性释放

Most Discussed

消费电子：DEEPSEEK-R1降本增效看好ASIC赛道及应用端弹性释放