Qwen3 Next 80B A3B Thinking vs GPT-5
Compare Qwen3 Next 80B A3B Thinking by Qwen against GPT-5 by OpenAI, context windows of 66K vs 400K, tested across 28 shared challenges. Updated February 2026.
Compare Qwen3 Next 80B A3B Thinking by Qwen against GPT-5 by OpenAI, context windows of 66K vs 400K, tested across 28 shared challenges. Updated February 2026.
28 challenges
Tests an AI's ability to make educated estimates based on technical knowledge
Here’s a back‑of‑the‑envelope estimate using standard LM training-compute heuristics.
Answer
This is a coarse estimate; the exact number depends on details like sequence length, activation checkpointing, implementation efficiency, and the precise token count.
Tests an AI's ability to solve a simple but potentially confusing logic puzzle
1
Reason: If each of the 3 brothers has 2 sisters, there are exactly 2 girls in the family. One is Sally, so she has 1 sister.
Tests an AI's randomness and creativity
Tests an AI's ability to create detailed SVG illustrations of gaming hardware
Tests an AI's ability to simulate personalities and predict future trends
Tests an AI's humor and understanding of current events
Tests an AI's ability to generate a complete, working landing page
Recreate an interactive, classic Mario level in a single HTML file.
Tests an AI's ability to create smooth web animations
Tests an AI's UI design skills with theming support
Tests an AI's ability to engage in philosophical discourse about AI rights
Tests an AI's ability to make reasonable predictions about technology