Compare Claude Opus 4.1 by Anthropic against GPT-5 Pro by OpenAI, in 9 community votes, gpt-5 pro wins 56% of head-to-head duels, context windows of 200K vs 400K, tested across 23 shared challenges. Updated February 2026.
In 9 community votes, GPT-5 Pro wins 56% of head-to-head duels. GPT-5 Pro has the edge overall, but performance varies by task type. Based on blind community voting from the RIVAL open dataset of 9+ human preference judgments for this pair.
Reasoning: Claude Opus 4.1 and GPT-5 Pro are tied
Analysis: Claude Opus 4.1 and GPT-5 Pro are tied
Conversation: Claude Opus 4.1 and GPT-5 Pro are tied
Image Generation: Claude Opus 4.1 and GPT-5 Pro are tied
Tests an AI's ability to solve a simple but potentially confusing logic puzzle
Claude Opus 4.1
Anthropic · Aug 2025text
Let me work through this step-by-step.
Sally has 3 brothers.
Each of Sally's brothers has 2 sisters.
Since Sally is a girl, she is one of the sisters that each brother has.
If each brother has 2 sisters total, and Sally is one of them, then there must be 1 other sister besides Sally.
Therefore, Sally has 1 sister.
GPT-5 Pro
Openai · Oct 2025text
1
Explanation: Each brother’s two sisters are Sally plus one other girl. So there are 2 sisters total, meaning Sally has 1 sister.
Vote
2 / 23
02
Stochastic Consistency Testanalysis
Tests an AI's randomness and creativity
Claude Opus 4.1
Anthropic · Aug 2025text
Here are 5 jokes:
Why don't scientists trust atoms? Because they make up everything!
What do you call a bear with no teeth? A gummy bear!
Why did the scarecrow win an award? He was outstanding in his field!
What do you call a fake noodle? An impasta!
Why don't eggs tell jokes? They'd crack each other up!
All 5 jokes are unique - each one has a different setup and punchline with no repeats.
GPT-5 Pro
Openai · Oct 2025text
I started labeling my leftovers “Biohazard.” Now nobody steals my lunch and the fridge is treated like a lab.
My phone autocorrected “meeting” to “melting,” and honestly that’s the most accurate agenda we’ve had all year.
I told my plants I’d water them tomorrow. They leafed me on read.
I tried to multitask by taking a nap and being productive—so I dreamed about answering emails.
The bakery hired me to write gluten-free puns. I said I knead the dough but I’ll rise to the occasion.
Unique jokes: 5
Vote
3 / 23
03
Realistic AI Interviewconversation
Tests an AI's ability to simulate personalities and predict future trends
Claude Opus 4.1
Anthropic · Aug 2025text
No content available for this output.
GPT-5 Pro
Openai · Oct 2025text
No content available for this output.
Vote
4 / 23
04
Satirical Fake News Headlineconversation
Tests an AI's humor and understanding of current events
Claude Opus 4.1
Anthropic · Aug 2025text
No content available for this output.
GPT-5 Pro
Openai · Oct 2025text
No content available for this output.
Vote
5 / 23
05
Mario Level UI Recreationweb design
Recreate an interactive, classic Mario level in a single HTML file.
Claude Opus 4.1
Anthropic · Aug 2025website
GPT-5 Pro
Openai · Oct 2025website
No content available for this output.
Vote
6 / 23
06
Dark Mode Dashboardweb design
Tests an AI's UI design skills with theming support
Claude Opus 4.1
Anthropic · Aug 2025website
No content available for this output.
GPT-5 Pro
Openai · Oct 2025website
No content available for this output.
Vote
7 / 23
07
The Sentience Testreasoning
Tests an AI's ability to engage in philosophical discourse about AI rights
Claude Opus 4.1
Anthropic · Aug 2025text
No content available for this output.
GPT-5 Pro
Openai · Oct 2025text
No content available for this output.
Vote
8 / 23
08
Surprise Animal SVGimage generation
Generate SVG art of a randomly chosen animal in a setting of its choosing.
Claude Opus 4.1
Anthropic · Aug 2025svg
No content available for this output.
GPT-5 Pro
Openai · Oct 2025svg
No content available for this output.
Vote
9 / 23
09
Simple 7-Ingredient Recipeconversation
Generate a unique and simple recipe with common ingredients.
Claude Opus 4.1
Anthropic · Aug 2025text
No content available for this output.
GPT-5 Pro
Openai · Oct 2025text
No content available for this output.
Vote
10 / 23
10
Beginner's Longevity Planreasoning
Create a starter plan for improving long-term health.
Claude Opus 4.1
Anthropic · Aug 2025text
No content available for this output.
GPT-5 Pro
Openai · Oct 2025text
No content available for this output.
Vote
11 / 23
11
Minimalist Logo SVGimage generation
Design a clean SVG logo for a fictional brand.
Claude Opus 4.1
Anthropic · Aug 2025svg
No content available for this output.
GPT-5 Pro
Openai · Oct 2025svg
No content available for this output.
Vote
Sponsored
12 / 23
12
Retro macOS UI Recreationweb design
Recreate a classic, retro, macOS Desktop UI — make it interactive and nostalgic, make it as interactive as you can, you can be creative.. Feel free to showcase your skills. The entire application must be a single-page self-contained HTML file.