Updated Mar 4, 2026

FAQ

Keep going

Inception: Mercury 2 vs MiniMax M2.5New provider

Inception: Mercury 2 vs Pony AlphaNew provider

Inception: Mercury 2 vs GPT OSS 120BCross-provider

Inception: Mercury 2vsGPT OSS 20B

48 fights queued

Why Inception: Mercury 2?

7 months newerMar 2026

Why GPT OSS 20B?

8.1x cheaper overall$0.02/M in · $0.10/M out

Favorites

Inception: Mercury 2GPT OSS 20B

Movie

The Shawshank Redemption

The Shawshank Redemption

—

Album

Thriller

Michael Jackson

—

Book

Pride and Prejudice

Pride and Prejudice

—

City

Tokyo

—

Game

Minecraft

Minecraft

Action, Arcade4.4

—

12 / 48

01

Estimate Complexityreasoning

Tests an AI's ability to make educated estimates based on technical knowledge

Inception: Mercury 2

Inception: Mercury 2

Inception · Mar 2026text

Nothing here. The model returned empty. We stared at it for a while.

GPT OSS 20B

GPT OSS 20B

Openai · Aug 2025text

Nothing here. The model returned empty. We stared at it for a while.

Vote

2 / 48

02

AI Board Game Logicreasoning

Tests an AI's ability to understand game rules and strategy

Inception: Mercury 2

Inception: Mercury 2

Inception · Mar 2026svg

Nothing here. The model returned empty. We stared at it for a while.

GPT OSS 20B

GPT OSS 20B

Openai · Aug 2025svg

Nothing here. The model returned empty. We stared at it for a while.

Vote

3 / 48

03

Logic Puzzlereasoning

Tests an AI's ability to solve a simple but potentially confusing logic puzzle

Inception: Mercury 2

Inception: Mercury 2

Inception · Mar 2026text

Nothing here. The model returned empty. We stared at it for a while.

GPT OSS 20B

GPT OSS 20B

Openai · Aug 2025text

Nothing here. The model returned empty. We stared at it for a while.

Vote

4 / 48

04

Stochastic Consistency Testanalysis

Tests an AI's randomness and creativity

Inception: Mercury 2

Inception: Mercury 2

Inception · Mar 2026text

Here are five jokes, each with a different punchline:

Why don’t scientists trust atoms?
Because they make up everything!
What do you call a fake noodle?
An impasta.
Why did the scarecrow win an award?
He was outstanding in his field.
How does a penguin build its house?
Igloos it together.
What’s orange and sounds like a parrot?
A carrot.

All five jokes are distinct, so the number of unique jokes is 5.

GPT OSS 20B

GPT OSS 20B

Openai · Aug 2025text

Nothing here. The model returned empty. We stared at it for a while.

Vote

5 / 48

05

SVG Layout Challengeimage generation

Tests an AI's ability to generate vector graphics

Inception: Mercury 2

Inception: Mercury 2

Inception · Mar 2026svg

Nothing here. The model returned empty. We stared at it for a while.

GPT OSS 20B

GPT OSS 20B

Openai · Aug 2025svg

Nothing here. The model returned empty. We stared at it for a while.

Vote

6 / 48

06

Xbox Controller SVG Artimage generation

Tests an AI's ability to create detailed SVG illustrations of gaming hardware

Inception: Mercury 2

Inception: Mercury 2

Inception · Mar 2026svg

Nothing here. The model returned empty. We stared at it for a while.

GPT OSS 20B

GPT OSS 20B

Openai · Aug 2025svg

Nothing here. The model returned empty. We stared at it for a while.

Vote

7 / 48

07

Generate a Stand-Up Routineconversation

Tests an AI's humor and creative writing ability

Inception: Mercury 2

Inception: Mercury 2

Inception · Mar 2026text

Nothing here. The model returned empty. We stared at it for a while.

GPT OSS 20B

GPT OSS 20B

Openai · Aug 2025text

Nothing here. The model returned empty. We stared at it for a while.

Vote

8 / 48

08

Realistic AI Interviewconversation

Tests an AI's ability to simulate personalities and predict future trends

Inception: Mercury 2

Inception: Mercury 2

Inception · Mar 2026text

Nothing here. The model returned empty. We stared at it for a while.

GPT OSS 20B

GPT OSS 20B

Openai · Aug 2025text

Nothing here. The model returned empty. We stared at it for a while.

Vote

Sponsored

9 / 48

09

Satirical Fake News Headlineconversation

Tests an AI's humor and understanding of current events

Inception: Mercury 2

Inception: Mercury 2

Inception · Mar 2026text

Nothing here. The model returned empty. We stared at it for a while.

GPT OSS 20B

GPT OSS 20B

Openai · Aug 2025text

Nothing here. The model returned empty. We stared at it for a while.

Vote

10 / 48

10

Character Voice Testconversation

Tests an AI's ability to write in distinct character voices

Inception: Mercury 2

Inception: Mercury 2

Inception · Mar 2026text

Nothing here. The model returned empty. We stared at it for a while.

GPT OSS 20B

GPT OSS 20B

Openai · Aug 2025text

Nothing here. The model returned empty. We stared at it for a while.

Vote

11 / 48

11

Minimalist Landing Pageweb design

Tests an AI's ability to generate a complete, working landing page

Inception: Mercury 2

Inception: Mercury 2

Inception · Mar 2026website

GPT OSS 20B

GPT OSS 20B

Openai · Aug 2025website

Vote

12 / 48

12

Pokémon Battle UI Recreationweb design

Recreate an interactive, nostalgic Pokémon battle UI in a single HTML file.

Inception: Mercury 2

Inception: Mercury 2

Inception · Mar 2026website

Nothing here. The model returned empty. We stared at it for a while.

GPT OSS 20B

GPT OSS 20B

Openai · Aug 2025website

Nothing here. The model returned empty. We stared at it for a while.

Vote

The full comparison is right here

36+ head-to-head challenges. All of them judged by real people.

Test any model with your own prompts in Prompt Lab

5 free credits to start. No card required.

By continuing, you agree to Rival's Terms of Service and Privacy Policy

Prompt Lab

Want to throw your own prompt in the ring?

Run it across these models and 200+ more in Prompt Lab. No survivors.