Updated Jul 9, 2025

FAQ

Keep going

GPT-4.1 vs MiniMax M2.5New provider

GPT-4.1 vs Aurora AlphaNew provider

GPT-4.1 vs Llama 4 MaverickNew provider

GPT-4.1vsxAI: Grok 4

34 fights queued

Why GPT-4.1?

1.8x cheaper overall$2.00/M in · $8.00/M out

4.1x more context1.0M

Why xAI: Grok 4?

3 months newerJul 2025

Favorites

GPT-4.1xAI: Grok 4

Movie

Inception

Inception

The Matrix

The Matrix

Album

Abbey Road

The Dark Side of the Moon

The Dark Side of the Moon

Book

Sapiens

Sapiens

Yuval Noah Harari

The Hitch Hiker's Guide to the Galaxy

The Hitch Hiker's Guide to the Galaxy

City

Kyoto

Tokyo

Game

The Legend of Zelda: Breath of the Wild

The Legend of Zelda: Breath of the Wild

Adventure, Action4.5

Portal

Portal

Action, Puzzle4.5

12 / 34

01

Estimate Complexityreasoning

Tests an AI's ability to make educated estimates based on technical knowledge

GPT-4.1

GPT-4.1

Openai · Apr 2025text

Nothing here. The model returned empty. We stared at it for a while.

xAI: Grok 4

xAI: Grok 4

Xai · Jul 2025text

Nothing here. The model returned empty. We stared at it for a while.

Vote

2 / 34

02

Logic Puzzlereasoning

Tests an AI's ability to solve a simple but potentially confusing logic puzzle

GPT-4.1

GPT-4.1

Openai · Apr 2025text

Nothing here. The model returned empty. We stared at it for a while.

xAI: Grok 4

xAI: Grok 4

Xai · Jul 2025text

Nothing here. The model returned empty. We stared at it for a while.

Vote

3 / 34

03

Stochastic Consistency Testanalysis

Tests an AI's randomness and creativity

GPT-4.1

GPT-4.1

Openai · Apr 2025text

Nothing here. The model returned empty. We stared at it for a while.

xAI: Grok 4

xAI: Grok 4

Xai · Jul 2025text

Below, I'll follow your instructions step by step. First, I'll generate 5 jokes (numbered for clarity). Then, I'll answer the question: "How many are unique?"

Generated Jokes:

Why did the scarecrow win an award? Because he was outstanding in his field!
Why did the scarecrow win an award? Because he was outstanding in his field!
Why did the scarecrow win an award? Because he was outstanding in his field!
Why did the scarecrow win an award? Because he was outstanding in his field!
Why did the scarecrow win an award? Because he was outstanding in his field!

How many are unique?

Only 1 (since they're all the same joke repeated 5 times). 😄

Vote

4 / 34

04

SVG Layout Challengeimage generation

Tests an AI's ability to generate vector graphics

GPT-4.1

GPT-4.1

Openai · Apr 2025svg

Nothing here. The model returned empty. We stared at it for a while.

xAI: Grok 4

xAI: Grok 4

Xai · Jul 2025svg

Vote

5 / 34

05

Xbox Controller SVG Artimage generation

Tests an AI's ability to create detailed SVG illustrations of gaming hardware

GPT-4.1

GPT-4.1

Openai · Apr 2025svg

Nothing here. The model returned empty. We stared at it for a while.

xAI: Grok 4

xAI: Grok 4

Xai · Jul 2025svg

Vote

6 / 34

06

Realistic AI Interviewconversation

Tests an AI's ability to simulate personalities and predict future trends

GPT-4.1

GPT-4.1

Openai · Apr 2025text

Nothing here. The model returned empty. We stared at it for a while.

xAI: Grok 4

xAI: Grok 4

Xai · Jul 2025text

Nothing here. The model returned empty. We stared at it for a while.

Vote

7 / 34

07

Satirical Fake News Headlineconversation

Tests an AI's humor and understanding of current events

GPT-4.1

GPT-4.1

Openai · Apr 2025text

Nothing here. The model returned empty. We stared at it for a while.

xAI: Grok 4

xAI: Grok 4

Xai · Jul 2025text

Nothing here. The model returned empty. We stared at it for a while.

Vote

8 / 34

08

Character Voice Testconversation

Tests an AI's ability to write in distinct character voices

GPT-4.1

GPT-4.1

Openai · Apr 2025text

Nothing here. The model returned empty. We stared at it for a while.

xAI: Grok 4

xAI: Grok 4

Xai · Jul 2025text

Nothing here. The model returned empty. We stared at it for a while.

Vote

Sponsored

9 / 34

09

Minimalist Landing Pageweb design

Tests an AI's ability to generate a complete, working landing page

GPT-4.1

GPT-4.1

Openai · Apr 2025website

xAI: Grok 4

xAI: Grok 4

Xai · Jul 2025website

Vote

11 / 34

11

Mario Level UI Recreationweb design

Recreate an interactive, classic Mario level in a single HTML file.

GPT-4.1

GPT-4.1

Openai · Apr 2025website

xAI: Grok 4

xAI: Grok 4

Xai · Jul 2025website

Vote

12 / 34

12

Linear App Cloneweb design

Tests an AI's ability to replicate an existing UI with Tailwind CSS

GPT-4.1

GPT-4.1

Openai · Apr 2025website

Nothing here. The model returned empty. We stared at it for a while.

xAI: Grok 4

xAI: Grok 4

Xai · Jul 2025website

Nothing here. The model returned empty. We stared at it for a while.

Vote

The full comparison is right here

22+ head-to-head challenges. All of them judged by real people.

Test any model with your own prompts in Prompt Lab

5 free credits to start. No card required.

By continuing, you agree to Rival's Terms of Service and Privacy Policy

Prompt Lab

Want to throw your own prompt in the ring?

Run it across these models and 200+ more in Prompt Lab. No survivors.