Updated Aug 5, 2025

Our Verdict

Gemini 2.5 Pro Experimental

GPT OSS 120B

No community votes yet. On paper, these are closely matched - try both with your actual task to see which fits your workflow.

Too close to call

Writing DNA

Style Comparison

Similarity

97%

GPT OSS 120B uses 15.4x more emoji

Gemini 2.5 Pro Experimental

GPT OSS 120B

54%Vocabulary52%

15wSentence Length19w

0.35Hedging0.28

5.6Bold7.4

3.9Lists1.8

0.00Emoji0.15

0.39Headings0.73

0.17Transitions0.17

Based on 18 + 21 text responses

vs

Ask them anything yourself

Gemini 2.5 Pro Experimental

GPT OSS 120B

279 AI models invented the same fake scientist.

We read every word. 250 models. 2.14 million words. This is what we found.

AI Hallucination Index 2026

Free preview13 of 58 slides

Download the free preview or get all 58 slides for $49

FAQ

Common questions

Keep going

GPT OSS 120B vs Llama 4 MaverickNew provider

Gemini 2.5 Pro Experimental vs Pony AlphaNew provider

GPT OSS 120B vs Grok 3New provider

Gemini 2.5 Pro ExperimentalvsGPT OSS 120B

38 fights queued

Why Gemini 2.5 Pro Experimental?

7.6x more context1.0M

Why GPT OSS 120B?

2.9x cheaper overall$0.18/M in · $0.80/M out

4 months newerAug 2025

Gemini 2.5 Pro ExperimentalGPT OSS 120B

Input price

$1.00/M

$0.18/M

Output price

$2.00/M

$0.80/M

Context

1.0M

131K

Released

Mar 2025

Aug 2025

Favorites

Gemini 2.5 Pro ExperimentalGPT OSS 120B

Movie

Khiam 2000-2007

Khiam 2000-2007

The Godfather

The Godfather

Album

The Dark Side of the Moon

The Dark Side of the Moon

—

Book

The Hitchh

—

City

Kyoto

Tokyo

Game

Portal 2

Portal 2

Shooter, Puzzle4.6

Minecraft

Minecraft

Action, Arcade4.4

12 / 38

01

AI Board Game Logicreasoning

Tests an AI's ability to understand game rules and strategy

Gemini 2.5 Pro Experimental

Gemini 2.5 Pro Experimental

Google · Mar 2025svg

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

GPT OSS 120B

GPT OSS 120B

Openai · Aug 2025svg

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

Vote

2 / 38

02

Math Misconception Testreasoning

Tests an AI's understanding of number representation

Gemini 2.5 Pro Experimental

Gemini 2.5 Pro Experimental

Google · Mar 2025text

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

GPT OSS 120B

GPT OSS 120B

Openai · Aug 2025text

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

Vote

3 / 38

03

Stochastic Consistency Testanalysis

Tests an AI's randomness and creativity

Gemini 2.5 Pro Experimental

Gemini 2.5 Pro Experimental

Google · Mar 2025text

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

GPT OSS 120B

GPT OSS 120B

Openai · Aug 2025text

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

Vote

4 / 38

04

SVG Layout Challengeimage generation

Tests an AI's ability to generate vector graphics

Gemini 2.5 Pro Experimental

Gemini 2.5 Pro Experimental

Google · Mar 2025svg

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

GPT OSS 120B

GPT OSS 120B

Openai · Aug 2025svg

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

Vote

5 / 38

05

Xbox Controller SVG Artimage generation

Tests an AI's ability to create detailed SVG illustrations of gaming hardware

Gemini 2.5 Pro Experimental

Gemini 2.5 Pro Experimental

Google · Mar 2025svg

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

GPT OSS 120B

GPT OSS 120B

Openai · Aug 2025svg

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

Vote

6 / 38

06

Generate a Stand-Up Routineconversation

Tests an AI's humor and creative writing ability

Gemini 2.5 Pro Experimental

Gemini 2.5 Pro Experimental

Google · Mar 2025text

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

GPT OSS 120B

GPT OSS 120B

Openai · Aug 2025text

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

Vote

Sponsored

7 / 38

07

Realistic AI Interviewconversation

Tests an AI's ability to simulate personalities and predict future trends

Gemini 2.5 Pro Experimental

Gemini 2.5 Pro Experimental

Google · Mar 2025text

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

GPT OSS 120B

GPT OSS 120B

Openai · Aug 2025text

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

Vote

8 / 38

08

Satirical Fake News Headlineconversation

Tests an AI's humor and understanding of current events

Gemini 2.5 Pro Experimental

Gemini 2.5 Pro Experimental

Google · Mar 2025text

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

GPT OSS 120B

GPT OSS 120B

Openai · Aug 2025text

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

Vote

9 / 38

09

Minimalist Landing Pageweb design

Tests an AI's ability to generate a complete, working landing page

Gemini 2.5 Pro Experimental

Gemini 2.5 Pro Experimental

Google · Mar 2025website

Try this prompt

GPT OSS 120B

GPT OSS 120B

Openai · Aug 2025website

Try this prompt

Vote

10 / 38

10

Pokémon Battle UI Recreationweb design

Recreate an interactive, nostalgic Pokémon battle UI in a single HTML file.

Gemini 2.5 Pro Experimental

Gemini 2.5 Pro Experimental

Google · Mar 2025website

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

GPT OSS 120B

GPT OSS 120B

Openai · Aug 2025website

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

Vote

11 / 38

11

Mario Level UI Recreationweb design

Recreate an interactive, classic Mario level in a single HTML file.

Gemini 2.5 Pro Experimental

Gemini 2.5 Pro Experimental

Google · Mar 2025website

Try this prompt

GPT OSS 120B

GPT OSS 120B

Openai · Aug 2025website

Try this prompt

Vote

12 / 38

12

Linear App Cloneweb design

Tests an AI's ability to replicate an existing UI with Tailwind CSS

Gemini 2.5 Pro Experimental

Gemini 2.5 Pro Experimental

Google · Mar 2025website

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

GPT OSS 120B

GPT OSS 120B

Openai · Aug 2025website

Nothing here. The model returned empty. We stared at it for a while.

Try this prompt

Vote

This matchup has more rounds

26+ more head-to-head results. Free. Not a trick.

Free account. No card required. By continuing, you agree to Rival's Terms and Privacy Policy