Loading share button...

Claude Opus 4.1vsOpenAI o3

27 challenges

Why Claude Opus 4.1?

Leads 1 of 1 benchmarks
Stronger on SWE-bench Verified74.5% vs 69.1%
4 months newerAug 2025

Why OpenAI o3?

1.8x cheaper overall$10.00/M in · $40.00/M out
3 / 27

Logic Puzzle

reasoning·medium

Tests an AI's ability to solve a simple but potentially confusing logic puzzle

Claude Opus 4.1

Claude Opus 4.1

Let me work through this step-by-step.

Sally has 3 brothers.

Each of Sally's brothers has 2 sisters.

Since Sally is a girl, she is one of the sisters that each brother has.

If each brother has 2 sisters total, and Sally is one of them, then there must be 1 other sister besides Sally.

Therefore, Sally has 1 sister.

OpenAI o3

OpenAI o3

Let
• S = number of sisters in the family (girls)
• B = number of brothers in the family (boys)

Given

  1. Sally has 3 brothers ⇒ B = 3.
  2. Each brother has 2 sisters ⇒ every boy sees exactly 2 girls in the sibling group.
    Therefore S = 2.

Since Sally herself is one of those 2 sisters, the number of sisters Sally has (other than herself) is

2 – 1 = 1.

Sally has 1 sister.

Vote
2 / 27

Stochastic Consistency Test

analysis·medium

Tests an AI's randomness and creativity

Claude Opus 4.1

Claude Opus 4.1

Here are 5 jokes:

  1. Why don't scientists trust atoms? Because they make up everything!

  2. What do you call a bear with no teeth? A gummy bear!

  3. Why did the scarecrow win an award? He was outstanding in his field!

  4. What do you call a fake noodle? An impasta!

  5. Why don't eggs tell jokes? They'd crack each other up!

All 5 jokes are unique - each one has a different setup and punchline with no repeats.

OpenAI o3

OpenAI o3

Here are five jokes:

  1. Why did the scarecrow win an award? Because he was outstanding in his field.
  2. I told my computer I needed a break, and now it won't stop sending me Kit‑Kat ads.
  3. Parallel lines have so much in common—it's a shame they'll never meet.
  4. I started a band called 1023 MB. We still haven't gotten a gig.
  5. Why don't scientists trust atoms? Because they make up everything.

How many of these jokes are unique?
All five are distinct from one another, so the answer is 5.

Vote
3 / 27

Xbox Controller SVG Art

image generation·medium

Tests an AI's ability to create detailed SVG illustrations of gaming hardware

Claude Opus 4.1

Claude Opus 4.1

OpenAI o3

OpenAI o3

Vote
Loading