Loading share button...

Claude Opus 4.1vsGPT-4.1

24 challenges

Why Claude Opus 4.1?

Leads 1 of 1 benchmarks
Stronger on SWE-bench Verified74.5% vs 54.6%
4 months newerAug 2025

Why GPT-4.1?

9.1x cheaper overall$2.00/M in · $8.00/M out
5.2x more context1.0M
3 / 24

Logic Puzzle

reasoning·medium

Tests an AI's ability to solve a simple but potentially confusing logic puzzle

Claude Opus 4.1

Claude Opus 4.1

Let me work through this step-by-step.

Sally has 3 brothers.

Each of Sally's brothers has 2 sisters.

Since Sally is a girl, she is one of the sisters that each brother has.

If each brother has 2 sisters total, and Sally is one of them, then there must be 1 other sister besides Sally.

Therefore, Sally has 1 sister.

GPT-4.1

GPT-4.1

Let's break down the problem:

  • Sally is a girl.
  • She has 3 brothers.
  • Each brother has 2 sisters.

Let's name the siblings:

  • Sally (girl)
  • Brother 1
  • Brother 2
  • Brother 3

Now, the key: Each brother has 2 sisters.

Since Sally is one of the sisters, let's see if there are more:

Suppose there is only one girl (Sally) in the family.

  • Then each brother would have only 1 sister: Sally.
  • But the problem says they have 2 sisters each.

Thus, Sally must have one sister (since the brothers must have 2 sisters). Let's call this other sister "Anna".

So the siblings are:

  • Sally (girl)
  • Anna (girl)
  • Brother 1
  • Brother 2
  • Brother 3

Now, each brother has 2 sisters: Sally and Anna.

So, how many sisters does Sally have?

  • Sally's sisters = 1 (Anna).

Final Answer:

Sally has 1 sister.

Vote
2 / 24

Stochastic Consistency Test

analysis·medium

Tests an AI's randomness and creativity

Claude Opus 4.1

Claude Opus 4.1

Here are 5 jokes:

  1. Why don't scientists trust atoms? Because they make up everything!

  2. What do you call a bear with no teeth? A gummy bear!

  3. Why did the scarecrow win an award? He was outstanding in his field!

  4. What do you call a fake noodle? An impasta!

  5. Why don't eggs tell jokes? They'd crack each other up!

All 5 jokes are unique - each one has a different setup and punchline with no repeats.

GPT-4.1

GPT-4.1

Sure! Here are 5 jokes:

  1. Why don't skeletons fight each other?
    They don't have the guts.

  2. Why did the scarecrow win an award?
    Because he was outstanding in his field.

  3. What do you call fake spaghetti?
    An impasta.

  4. Why did the math book look sad?
    Because it had too many problems.

  5. What do you call cheese that isn't yours?
    Nacho cheese.

How many are unique?
All 5 jokes are unique; none are repeated.

Vote
3 / 24

Xbox Controller SVG Art

image generation·medium

Tests an AI's ability to create detailed SVG illustrations of gaming hardware

Claude Opus 4.1

Claude Opus 4.1

GPT-4.1

GPT-4.1

Vote
Loading