Kimi Linear 48B A3B Instruct

Kimi Linear 48B A3B Instruct

Kimi Linear is a hybrid linear attention architecture that outperforms traditional full attention methods. Features Kimi Delta Attention (KDA) for efficient memory usage, reducing KV caches by up to 75% and boosting throughput by up to 6x for contexts as long as 1M tokens.

ConversationReasoningCode GenerationAnalysis
Provider
Moonshotai
Release Date
2025-11-10
Size
XLARGE

Model Insights

All Model Responses

Sponsored
Ad

Sponsored Content

Advertisement

Native Advertisement
Sponsored
Ad

Sponsored Content

Advertisement

Native Advertisement
Sponsored
Ad

Sponsored Content

Advertisement

Native Advertisement