当前位置:主页 > 业界 >

Intellifusion Bets Big on Inference AI Chips as Demand Surges, Eyes 2026 Lau...

时间:2025-07-25 18:01:06

  
 

  AsianFin -- Intellifusion, one of China’s earliest AI chip developers, is making a strategic pivot toward AI model inference—betting that the era of training dominance is giving way to inference-led growth in computing demand.

  The Shenzhen-based firm, listed on Shanghai’s STAR Market , unveiled its latest suite of inference-focused products on July 25, ahead of the 2025 World Artificial Intelligence Conference. Among them: the DeepQiong X6000 Mesh inference accelerator card, boasting 256 TOPS of compute and optimized for high-throughput workloads such as decoding 256 video streams in real-time and supporting large models with hundreds of billions of parameters.

  Intellifusion’s new all-in-one servers—Shenmu 6203 , Tianzhou 6408 , and Tianzhou 680G —extend this performance into data centers and edge environments, delivering up to 4 PFLOPS of inference capacity. CEO Chen Ning says these products mark a turning point for the company, which is now “fully committed” to inference computing chips after 11 years of neural processing unit development.

  “2025 will be a defining year for AI. Large models are maturing, costs are falling, and inference is about to outpace training in both growth and application,” Chen told TMTPost.

  
 

  AI development is typically divided into two stages: training, which demands massive datasets and compute, and inference, where trained models are deployed to solve real-world problems. As AI adoption broadens—from chatbots to autonomous vehicles—cloud-based inference is quickly taking center stage.

  According to IDC, cloud-based inference accounted for 58.5% of AI computing power in 2022 and is projected to hit 62.2% by 2026. AMD CEO Lisa Su forecasts AI inference compute demand will grow over 80% annually—potentially surpassing training as the primary driver for data center expansion.

  “The inference chip market remains a blue ocean,” Chen said. “While the training chip sector is worth hundreds of billions, inference is just beginning. We believe it will outpace training within five years.”

  At the heart of Intellifusion’s new offerings is the DeepQiong X6000 Mesh accelerator card, powered by the firm’s self-developed fourth-generation NPU optimized for Transformer-based models. The card uses a D2D Chiplet design and C2C mesh architecture—an innovation in China’s AI chip ecosystem. Intellifusion claims it is the first company to mass-produce such chips using fully domestic fabrication and packaging processes.

  Complementing the chip, Intellifusion is rolling out inference servers and integrated machines for data centers and smart city deployments. Customers include municipal computing centers, telecom carriers, research institutes, and major Chinese internet firms.

  “The DeepSeek all-in-one machines break the ‘last mile’ in closed-loop AI deployment,” Chen said, adding that the cooling AI hype is not a retreat, but a rational reshuffling to real-world use cases.

  Intellifusion’s shift is already showing results. The company reported 2024 revenue of more than 900 million yuan , up 81.3% year-on-year. Q1 2025 revenue surged 168.2% to 264 million yuan, a record for the period.

  A deal with Deyuan Fanghui to provide 4,000 PFLOPS in inference compute over three years is expected to contribute 1.6 billion yuan in revenue. Payments began in early 2025, with roughly 200 million yuan booked in the first half.

  On the consumer side, Intellifusion is seeing strong uptake of its Qiancheng AI technologies in wearables, supplying Huawei, Honor, and OPPO, while its “Dr. Luka” hardware line continues to gain traction. The company expects 50%+ growth in its consumer business in H1 2025.

  Looking ahead, Intellifusion is preparing to launch its next-generation inference chip architecture—“Computing Power Building Blocks 2.0”—by late 2026, featuring:

  : Native FP8/FP4, custom operators for large models, 5× compute efficiency, 3× energy efficiency.

  : 10× bandwidth and memory efficiency.

  : Full-mesh, all-reduce, memory semantic access.

  : Heterogeneous die, UCIE D2D Chiplets.

  : PCIe interface with CPU-NPU shared memory access.

  CTO Li Aijun says the upgrades will support embedded, edge, and cloud inference for models such as MoE and edge-scale large models.

  Founded in 2014, Intellifusion has invested heavily in edge computing chips and has already shipped five generations of NPUs. In 2023, it launched its DeepEdge10 platform, targeting scenarios from IoT to intelligent computing centers.

  Now, the company is placing its biggest bet yet on inference.

  “Most inventions in the U.S. stay in labs,” said Chen. “But in China, the value is in large-scale implementation. AI inference chips will become the core infrastructure enabling AI to reshape all hardware—from glasses to robots—over the next five years.”

  Chen believes that by linking data, algorithms, and chip development through China’s vast application scenarios, Intellifusion can drive a “data flywheel” of continuous innovation. He sees AI inference chips as China’s opportunity to gain a foothold in the Fourth Industrial Revolution.

  “Our biggest asset isn’t chips. It’s our team,” he said. “With the right DNA, we’ll overcome challenges—from supply chains to ecosystems—and continue building a globally competitive inference chip company.”

热点推荐
1 Solana密集上线四款非原生链资产

消息,据吴说区块链发推称:Solana 在近 24 小时内密集上线了四个非原生链资产,包括 FUN、L...

2 熊市反弹在即?

消息,据CryptoQuant发推称:价格逼近365日均线年熊市反弹正是在此位置遇阻。市场情绪高涨,但...

3 巨鲸挂单1亿美元抄底BTC与ETH

消息,据Ai 姨发推称:Hyperliquid 某大户在 ETH 和 BTC 价格关键点位挂出总计超 1 亿美元的限价买...

4 美国经济数据强于预期、地缘局势缓和,

消息,受美国经济数据强于预期以及伊朗地缘政治紧张局势缓解影响,金价周五延续前一交易...

5 某鲸鱼循环借贷增持黄金,买入8337枚XA

消息,1 月 16 日,据 Lookonchain 监测,某鲸鱼在链上循环借贷增持黄金,过去 20 天累计从 Aave...

6 Kraken:今年加密市场或有重大调整,宏观

消息,加密货币交易所Kraken分析认为,2026年加密市场将经历重大调整,市场重心将从价格波动...

7 智能合约之父:支持X撤销InfoFi应用API访问

消息,智能合约之父、Castle Island Ventures 联合创始人 Nic Carter 在 X 平台发文表示,Kaito 等平台的...

8 WOO 已销毁 3 亿枚 WOO 代币,并将转型 AI

消息,WOO 已销毁 3 亿锁仓 WOO 代币,占总量的 15%。WOO 于 2018 年由 Kronos Research 创办,曾发布交...

9 美国拟对全球主权财富基金「征税」,或

消息,1 月 16 日,美国当局已提议进行一项重大改革,可能让主权财富基金需为其在美国的投...

10 XRP 在 X 平台上的现金标签搜索量名列前茅

XRP 在 X 平台上的搜索量位居现金标签榜首,与比特币、以太坊、特斯拉和 GameStop 并列,这反映...

成都来彰科技 蜀ICP备2025134723号-1

资讯来源互联网,如有版权问题请联系管理员删除。