HomeAI

Infinigence's 'Token Factory' Model Drives 20x Growth, Optimizing AI Inference

🤖 AI 📰 China 🕐 2 hr ago
Infinigence's 'Token Factory' Model Drives 20x Growth, Optimizing AI Inference

Infinigence, a Chinese AI infrastructure company with deep ties to Tsinghua University's Department of Electronic Engineering, has emerged as a unique player in the AI value chain by positioning itself as a neutral "token factory" between chip manufacturers and model developers. According to data disclosed in May, the company's Agentic MaaS platform has experienced token call volume growth exceeding 20x from December to April, reflecting a structural shift in the AI industry:

AI infrastructure company Infinigence has experienced over 20x growth in token call volume on its Agentic MaaS platform since December, reflecting a shift towards inference as the dominant AI workload. Positioned as a neutral 'token factory,' Infinigence optimizes compute resources between chip manufacturers and model developers. The company's strategy focuses on token economics and achieving cost-performance improvements through innovations like prefill-decode separation. This model allows for efficient deployment of domestic Chinese chips and enables smaller teams to achieve significant AI-driven productivity gains, akin to the 3G to 4G mobile internet transition.

Infinigence's innovative 'token factory' model addresses the growing demand for AI inference, optimizing resource allocation and driving significant growth by bridging the gap between hardware and software.

#llm#tech#chip#software#hardware

📌 Source

This summary is auto-compiled from XML. Visit the original article for the full text.

Read original article →
📱
News AI World — Mobile app
Get these headlines in 45 languages, with instant translation, on your phone. Drop your Gmail for early access.
← Back to all news