Qwen3 30B A3B

Qwen3 30B A3B

The latest generation Qwen model (30.5B params, 3.3B activated MoE) excels in reasoning, multilingual support, and agent tasks. Features a unique thinking/non-thinking mode switch. Supports up to 131K context with YaRN. Free tier on OpenRouter.

ConversationReasoningCode GenerationAnalysis
Provider
Qwen
Release Date
2025-04-28
Size
LARGE
Parameters
30.5B (3.3B Active MoE)

Model Insights

All Model Responses

Sponsored
Ad

Sponsored Content

Advertisement

Native Advertisement

Related Models

Qwen: Qwen3 235B A22B 2507 logo

Qwen: Qwen3 235B A22B 2507

Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following, logical reasoning, math, code, and tool usage. The model supports a native 262K context length and does not implement "thinking mode" (<think> blocks). Compared to its base variant, this version delivers significant gains in knowledge coverage, long-context reasoning, coding benchmarks, and alignment with open-ended tasks. It is particularly strong on multilingual understanding, math reasoning (e.g., AIME, HMMT), and alignment evaluations like Arena-Hard and WritingBench.

ConversationReasoningCode Generation+1 more