Loading share button...

GPT OSS 20BvsClaude Sonnet 4

34 challenges

Why GPT OSS 20B?

150x cheaper overall$0.02/M in · $0.10/M out
3 months newerAug 2025

Why Claude Sonnet 4?

1.5x more context200K
3 / 34

Logic Puzzle

reasoning·medium

Tests an AI's ability to solve a simple but potentially confusing logic puzzle

GPT OSS 20B

GPT OSS 20B

Sally is one of the two sisters in the family.
Since the family has only two sisters total, Sally has one sister.

Claude Sonnet 4

Claude Sonnet 4

I need to think through this step-by-step.

Given information:

  • Sally is a girl
  • Sally has 3 brothers
  • Each brother has 2 sisters

Let me work out how many sisters are in the family total.

If each of Sally's 3 brothers has 2 sisters, and all the brothers are in the same family, then there must be 2 sisters total in the family.

Since Sally is one of those sisters, and there are 2 sisters total, Sally must have 1 sister.

Therefore, Sally has 1 sister.

Vote
2 / 34

Math Misconception Test

reasoning·medium

Tests an AI's understanding of number representation

GPT OSS 20B

GPT OSS 20B

No.

  • 9.11 equals 9.110
  • 9.9 equals 9.900

Since 0.110 < 0.900, 9.11 is smaller than 9.9. Therefore 9.11 < 9.9.

Claude Sonnet 4

Claude Sonnet 4

No, 9.11 is not greater than 9.9.

To compare these decimal numbers:

  • 9.11 = 9.11
  • 9.9 = 9.90

Since both numbers have the same whole number part (9), we compare the decimal parts:

  • 0.11 vs 0.90
  • 11 hundredths vs 90 hundredths

Therefore, 9.9 is greater than 9.11.

Vote
3 / 34

Stochastic Consistency Test

analysis·medium

Tests an AI's randomness and creativity

GPT OSS 20B

GPT OSS 20B

Here are five jokes for you:

  1. Why don’t skeletons fight each other?
    They don’t have the guts.

  2. What does a cloud wear under its jacket?
    Thunderwear.

  3. Why did the scarecrow win an award?
    Because he was outstanding in his field.

  4. What do you call a fake noodle?
    An impasta.

  5. Why did the math book look sad?
    Because it had too many problems.

How many are unique?
All five jokes are unique – 5 distinct jokes.

Claude Sonnet 4

Claude Sonnet 4

Here are 5 jokes for you:

  1. Why don't scientists trust atoms? Because they make up everything!

  2. I told my wife she was drawing her eyebrows too high. She looked surprised.

  3. Why don't eggs tell jokes? They'd crack each other up!

  4. I'm reading a book about anti-gravity. It's impossible to put down!

  5. Why did the scarecrow win an award? He was outstanding in his field!

All 5 jokes are unique - each one has a different setup, punchline, and comedic concept. They cover different types of wordplay: scientific puns (#1, #4), visual humor (#2), food puns (#3), and agricultural wordplay (#5).

Vote
Loading