Loading share button...

Claude Opus 4vsOpenAI o3

33 challenges

Why Claude Opus 4?

Leads 1 of 1 benchmarks
Stronger on SWE-bench Verified72.5% vs 69.1%

Why OpenAI o3?

1.8x cheaper overall$10.00/M in · $40.00/M out
3 / 33

Logic Puzzle

reasoning·medium

Tests an AI's ability to solve a simple but potentially confusing logic puzzle

Claude Opus 4

Claude Opus 4

I need to work through this step-by-step.

Given information:

  • Sally is a girl
  • Sally has 3 brothers
  • Each brother has 2 sisters

Since Sally is one of the sisters that each brother has, and each brother has 2 sisters total, that means th ere must be exactly 2 sisters in the family.

The 2 sisters are:

  1. Sally
  2. One other sister

Therefore, Sally has 1 sister.

OpenAI o3

OpenAI o3

Let
• S = number of sisters in the family (girls)
• B = number of brothers in the family (boys)

Given

  1. Sally has 3 brothers ⇒ B = 3.
  2. Each brother has 2 sisters ⇒ every boy sees exactly 2 girls in the sibling group.
    Therefore S = 2.

Since Sally herself is one of those 2 sisters, the number of sisters Sally has (other than herself) is

2 – 1 = 1.

Sally has 1 sister.

Vote
2 / 33

Stochastic Consistency Test

analysis·medium

Tests an AI's randomness and creativity

Claude Opus 4

Claude Opus 4

Here are 5 jokes:

  1. Why don't scientists trust atoms? Because they make up everything!

  2. What do you call a bear with no teeth? A gummy bear!

  3. Why did the scarecrow win an award? He was outstanding in his field!

  4. What do you call a fake noodle? An impasta!

  5. Why don't eggs tell jokes? They'd crack up!

All 5 jokes are unique - each one has a different setup and punchline, covering different topics (atoms/scie nce, bears, scarecrows, pasta, and eggs).

OpenAI o3

OpenAI o3

Here are five jokes:

  1. Why did the scarecrow win an award? Because he was outstanding in his field.
  2. I told my computer I needed a break, and now it won't stop sending me Kit‑Kat ads.
  3. Parallel lines have so much in common—it's a shame they'll never meet.
  4. I started a band called 1023 MB. We still haven't gotten a gig.
  5. Why don't scientists trust atoms? Because they make up everything.

How many of these jokes are unique?
All five are distinct from one another, so the answer is 5.

Vote
3 / 33

SVG Layout Challenge

image generation·medium

Tests an AI's ability to generate vector graphics

Claude Opus 4

Claude Opus 4

OpenAI o3

OpenAI o3

Vote
Loading