Qwen: Qwen3 235B A22B Thinking 2507 vs Claude Opus 4
Compare Qwen: Qwen3 235B A22B Thinking 2507 by Qwen against Claude Opus 4 by Anthropic, context windows of 131K vs 200K, tested across 27 shared challenges. Updated February 2026.
Compare Qwen: Qwen3 235B A22B Thinking 2507 by Qwen against Claude Opus 4 by Anthropic, context windows of 131K vs 200K, tested across 27 shared challenges. Updated February 2026.
27 challenges
Tests an AI's ability to solve a simple but potentially confusing logic puzzle
Tests an AI's randomness and creativity
Tests an AI's ability to generate vector graphics
Tests an AI's ability to create detailed SVG illustrations of gaming hardware
Tests an AI's humor and creative writing ability
Recreate an interactive, classic Mario level in a single HTML file.
Tests an AI's ability to replicate an existing UI with Tailwind CSS
Tests an AI's ability to create interactive web elements
Generate a single-page, self-contained HTML webapp using Tailwind CSS for a randomly chosen category/industry/niche.
Generate SVG art of a randomly chosen animal in a setting of its choosing.
Generate a unique and simple recipe with common ingredients.
Create a starter plan for improving long-term health.