Provider of advanced AI models and technologies.
Kimi Linear is a hybrid linear attention architecture that outperforms traditional full attention methods. Features Kimi Delta Attention (KDA) for efficient memory usage, reducing KV caches by up to 75% and boosting throughput by up to 6x for contexts as long as 1M tokens.
Kimi K2 Thinking is Moonshot AI's most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in Kimi K2, it activates 32 billion parameters per forward pass and supports 256K-token context windows. The model is optimized for persistent step-by-step thought, dynamic tool invocation, and complex reasoning workflows that span hundreds of turns. It interleaves step-by-step reasoning with tool use, enabling autonomous research, coding, and writing that can persist for hundreds of sequential actions without drift.