ChatGPT-4o (March 2025) vs Claude Opus 4.6
Compare ChatGPT-4o (March 2025) by OpenAI against Claude Opus 4.6 by Anthropic, context windows of 128K vs 1.0M, tested across 32 shared challenges. Updated February 2026.
Compare ChatGPT-4o (March 2025) by OpenAI against Claude Opus 4.6 by Anthropic, context windows of 128K vs 1.0M, tested across 32 shared challenges. Updated February 2026.
27 challenges
Tests an AI's ability to understand game rules and strategy
Tests an AI's randomness and creativity
Here are 5 jokes:
Why don’t skeletons fight each other?
Because they don’t have the guts.
I told my computer I needed a break…
Now it won’t stop sending me vacation ads.
Parallel lines have so much in common.
It’s a shame they’ll never meet.
What do you call a fish wearing a bowtie?
Sofishticated.
I asked the librarian if the library had books on paranoia.
She whispered, "They're right behind you..."
All 5 jokes are unique — none of them repeat or share the same punchline.
Tests an AI's ability to generate vector graphics
Tests an AI's ability to create detailed SVG illustrations of gaming hardware
Tests an AI's humor and creative writing ability
Tests an AI's ability to simulate personalities and predict future trends
Tests an AI's ability to generate a complete, working landing page
Recreate an interactive, classic Mario level in a single HTML file.
Tests an AI's ability to replicate an existing UI with Tailwind CSS
Tests an AI's ability to create smooth web animations
Tests an AI's UI design skills with theming support
Tests an AI's ability to create interactive web elements