Moonshot AI's Kimi K2 outperforms GPT-4 in key benchmarks – and it's free

10 months ago

Moonshot AI released Kimi K2, an open-source language model with 1 trillion parameters, optimized for coding and autonomous agent tasks.
Kimi K2 outperforms proprietary models like OpenAI's GPT-4.1 and Anthropic's Claude in benchmarks, especially in coding (53.7% accuracy on LiveCodeBench) and math (97.4% on MATH-500).
The model uses a mixture-of-experts architecture with 32 billion active parameters and introduces the MuonClip optimizer, reducing training instability and costs.
Moonshot AI's pricing strategy ($0.15 per million input tokens) undercuts competitors while offering comparable performance, leveraging open-source adoption for market expansion.
Kimi K2 excels in autonomous task execution, handling multi-step workflows like data analysis and travel planning without human intervention.
The release signals a convergence between open-source and proprietary AI models, challenging incumbents' business models and technological advantages.

Hasty Briefsbeta