Google: Gemma 4 26B A4B's competitors have been quietly putting in work.
Google: Gemma 4 26B A4B's competitors have been quietly putting in work.
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at a fraction of the compute cost. Supports multimodal input including text, images, and video (up to 60s at 1fps). Features a 256K token context window, native function calling, configurable thinking/reasoning mode, and structured output support. Released under Apache 2.0.
fromimport openai OpenAI
client = OpenAI(
"https://openrouter.ai/api/v1" base_url=,
"$OPENROUTER_API_KEY" api_key=,
)
response = client.chat.completions.create(
"google/gemma-4-26b-a4b-it" model=,
"role""user""content""Hello!" messages=[{: , : }],
)
print(response.choices[0].message.content)Set OPENROUTER_API_KEY with your OpenRouter API key from openrouter.ai/keys.
Also on Cloudflare Workers AI
Taste is judged on an uncapped scale, originality first. The space past 100 is craft today's models rarely reach.
Unique words vs. total words. Higher = richer vocabulary.
Average words per sentence.
"Might", "perhaps", "arguably" per 100 words.
**Bold** markers per 1,000 characters.
Bullet and numbered list items per 1,000 characters.
Markdown headings per 1,000 characters.
Emoji per 1,000 characters.
"However", "moreover", "furthermore" per 100 words.
53 outputs from Google: Gemma 4 26B A4B
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at a fraction of the compute cost. Supports multimodal input including text, images, and video (up to 60s at 1fps). Features a 256K token context window, native function calling, configurable thinking/reasoning mode, and structured output support. Released under Apache 2.0.
fromimport openai OpenAI
client = OpenAI(
"https://openrouter.ai/api/v1" base_url=,
"$OPENROUTER_API_KEY" api_key=,
)
response = client.chat.completions.create(
"google/gemma-4-26b-a4b-it" model=,
"role""user""content""Hello!" messages=[{: , : }],
)
print(response.choices[0].message.content)Set OPENROUTER_API_KEY with your OpenRouter API key from openrouter.ai/keys.
Also on Cloudflare Workers AI
Taste is judged on an uncapped scale, originality first. The space past 100 is craft today's models rarely reach.
Unique words vs. total words. Higher = richer vocabulary.
Average words per sentence.
"Might", "perhaps", "arguably" per 100 words.
**Bold** markers per 1,000 characters.
Bullet and numbered list items per 1,000 characters.
Markdown headings per 1,000 characters.
Emoji per 1,000 characters.
"However", "moreover", "furthermore" per 100 words.
53 outputs from Google: Gemma 4 26B A4B