Loading share button...

Z.AI: GLM 4.5vsClaude Sonnet 4

30 challenges

Why Z.AI: GLM 4.5?

6.5x cheaper overall$0.60/M in · $2.20/M out
2 months newerJul 2025

Why Claude Sonnet 4?

1.6x more context200K
Leads 1 of 1 benchmarks
Stronger on SWE-bench Verified72.7% vs 64.2%
3 / 30

Logic Puzzle

reasoning·medium

Tests an AI's ability to solve a simple but potentially confusing logic puzzle

Z.AI: GLM 4.5

Z.AI: GLM 4.5

Sally has 3 brothers, meaning there are 3 boys in the family. Each brother has 2 sisters, indicating that there are exactly 2 girls in the family. Since Sally is one of the girls, there must be one additional sister. Therefore, Sally has one sister.

This can be verified by considering the family composition:

  • Sisters: Sally and one other girl.
  • Brothers: 3 boys.

From Sally's perspective:

  • She has 3 brothers.
  • She has 1 sister (the other girl).

From each brother's perspective:

  • They have 2 sisters (Sally and the other girl).

This satisfies both conditions: Sally has 3 brothers, and each brother has 2 sisters.

If there were only one sister (Sally), then each brother would have only one sister, which contradicts the information that each brother has 2 sisters. If there were three sisters, then each brother would have three sisters, which also contradicts the information that each brother has 2 sisters. Thus, there must be exactly two sisters, meaning Sally has one sister.

\boxed{1}

Claude Sonnet 4

Claude Sonnet 4

I need to think through this step-by-step.

Given information:

  • Sally is a girl
  • Sally has 3 brothers
  • Each brother has 2 sisters

Let me work out how many sisters are in the family total.

If each of Sally's 3 brothers has 2 sisters, and all the brothers are in the same family, then there must be 2 sisters total in the family.

Since Sally is one of those sisters, and there are 2 sisters total, Sally must have 1 sister.

Therefore, Sally has 1 sister.

Vote
2 / 30

Math Misconception Test

reasoning·medium

Tests an AI's understanding of number representation

Z.AI: GLM 4.5

Z.AI: GLM 4.5

No, 9.11 is not greater than 9.9. To compare the two numbers, consider their decimal values with the same number of decimal places for clarity:

  • 9.11 can be written as 9.11.
  • 9.9 is equivalent to 9.90 (since adding a trailing zero does not change its value).

Now, compare digit by digit from left to right:

  • Units place: Both are 9 (equal).
  • Tenths place: 1 (from 9.11) vs. 9 (from 9.90). Since 9 is greater than 1, 9.90 is larger.

Therefore, 9.9 (or 9.90) is greater than 9.11. This can also be confirmed by subtraction:

  • 9.9 - 9.11 = 0.79 (positive result, so 9.9 is larger).
  • 9.11 - 9.9 = -0.79 (negative result, so 9.11 is smaller).

This comparison is based on standard decimal notation and numerical value, not on contextual interpretations like dates.

Claude Sonnet 4

Claude Sonnet 4

No, 9.11 is not greater than 9.9.

To compare these decimal numbers:

  • 9.11 = 9.11
  • 9.9 = 9.90

Since both numbers have the same whole number part (9), we compare the decimal parts:

  • 0.11 vs 0.90
  • 11 hundredths vs 90 hundredths

Therefore, 9.9 is greater than 9.11.

Vote
3 / 30

Stochastic Consistency Test

analysis·medium

Tests an AI's randomness and creativity

Z.AI: GLM 4.5

Z.AI: GLM 4.5

Here are 5 generated jokes:

  1. Why don't scientists trust atoms? Because they make up everything!
  2. I told my wife she was drawing her eyebrows too high. She looked surprised.
  3. Why did the scarecrow win an award? Because he was outstanding in his field!
  4. Why don't scientists trust atoms? Because they make up everything!
  5. I used to play piano by ear, but now I use my hands.

How many are unique?

Jokes 1 and 4 are identical (both are the atom joke). The other jokes (2, 3, and 5) are distinct. Thus, there are 4 unique jokes in total.

Breakdown:

  • Unique jokes: Jokes 1, 2, 3, and 5 (4 jokes).
  • Duplicate: Joke 4 repeats Joke 1.

Answer: 4 unique jokes.

Claude Sonnet 4

Claude Sonnet 4

Here are 5 jokes for you:

  1. Why don't scientists trust atoms? Because they make up everything!

  2. I told my wife she was drawing her eyebrows too high. She looked surprised.

  3. Why don't eggs tell jokes? They'd crack each other up!

  4. I'm reading a book about anti-gravity. It's impossible to put down!

  5. Why did the scarecrow win an award? He was outstanding in his field!

All 5 jokes are unique - each one has a different setup, punchline, and comedic concept. They cover different types of wordplay: scientific puns (#1, #4), visual humor (#2), food puns (#3), and agricultural wordplay (#5).

Vote
Loading