Kimi Linear 48B A3B Instruct's competitors have been quietly putting in work.
Kimi Linear 48B A3B Instruct's competitors have been quietly putting in work.
Kimi Linear is a hybrid linear attention architecture that outperforms traditional full attention methods. Features Kimi Delta Attention (KDA) for efficient memory usage, reducing KV caches by up to 75% and boosting throughput by up to 6x for contexts as long as 1M tokens.
fromimport openai OpenAI
client = OpenAI(
"https://openrouter.ai/api/v1" base_url=,
"$OPENROUTER_API_KEY" api_key=,
)
response = client.chat.completions.create(
"moonshotai/kimi-linear-48b-a3b-instruct" model=,
"role""user""content""Hello!" messages=[{: , : }],
)
print(response.choices[0].message.content)Set OPENROUTER_API_KEY with your OpenRouter API key from openrouter.ai/keys.
Taste is judged on an uncapped scale, originality first. The space past 100 is craft today's models rarely reach.
Unique words vs. total words. Higher = richer vocabulary.
Average words per sentence.
"Might", "perhaps", "arguably" per 100 words.
**Bold** markers per 1,000 characters.
Bullet and numbered list items per 1,000 characters.
Markdown headings per 1,000 characters.
Emoji per 1,000 characters.
"However", "moreover", "furthermore" per 100 words.
35 outputs from Kimi Linear 48B A3B Instruct
Kimi Linear is a hybrid linear attention architecture that outperforms traditional full attention methods. Features Kimi Delta Attention (KDA) for efficient memory usage, reducing KV caches by up to 75% and boosting throughput by up to 6x for contexts as long as 1M tokens.
fromimport openai OpenAI
client = OpenAI(
"https://openrouter.ai/api/v1" base_url=,
"$OPENROUTER_API_KEY" api_key=,
)
response = client.chat.completions.create(
"moonshotai/kimi-linear-48b-a3b-instruct" model=,
"role""user""content""Hello!" messages=[{: , : }],
)
print(response.choices[0].message.content)Set OPENROUTER_API_KEY with your OpenRouter API key from openrouter.ai/keys.
Taste is judged on an uncapped scale, originality first. The space past 100 is craft today's models rarely reach.
Unique words vs. total words. Higher = richer vocabulary.
Average words per sentence.
"Might", "perhaps", "arguably" per 100 words.
**Bold** markers per 1,000 characters.
Bullet and numbered list items per 1,000 characters.
Markdown headings per 1,000 characters.
Emoji per 1,000 characters.
"However", "moreover", "furthermore" per 100 words.
35 outputs from Kimi Linear 48B A3B Instruct