Compare Gemini 2.0 Flash Thinking by Google AI against GPT-5.1-Codex by OpenAI, context windows of 500K vs 400K, tested across 21 shared challenges. Updated April 2026.
Gemini 2.0 Flash Thinking and GPT-5.1-Codex are both competitive models. Gemini 2.0 Flash Thinking costs $0.25/M input tokens vs $1.25/M for GPT-5.1-Codex. Context windows: 500K vs 400K tokens. Compare their real outputs side by side below.
Gemini 2.0 Flash Thinking is made by google while GPT-5.1-Codex is from openai. Gemini 2.0 Flash Thinking has a 500K token context window compared to GPT-5.1-Codex's 400K. On pricing, Gemini 2.0 Flash Thinking costs $0.25/M input tokens vs $1.25/M for GPT-5.1-Codex.
No community votes yet. On paper, GPT-5.1-Codex has the edge — bigger model tier, newer.
Gemini 2.0 Flash Thinking is 20x cheaper per token — worth considering if cost matters.
GPT-5.1-Codex uses 15.0x more headings
Ask them anything yourself
Some models write identically. You are paying for the brand.
178 models fingerprinted across 32 writing dimensions. Free research.
185x
price gap between models that write identically
178
models
12
clone pairs
32
dimensions
279 AI models invented the same fake scientist.
We read every word. 250 models. 2.14 million words. This is what we found.

Compare Gemini 2.0 Flash Thinking by Google AI against GPT-5.1-Codex by OpenAI, context windows of 500K vs 400K, tested across 21 shared challenges. Updated April 2026.
Gemini 2.0 Flash Thinking and GPT-5.1-Codex are both competitive models. Gemini 2.0 Flash Thinking costs $0.25/M input tokens vs $1.25/M for GPT-5.1-Codex. Context windows: 500K vs 400K tokens. Compare their real outputs side by side below.
Gemini 2.0 Flash Thinking is made by google while GPT-5.1-Codex is from openai. Gemini 2.0 Flash Thinking has a 500K token context window compared to GPT-5.1-Codex's 400K. On pricing, Gemini 2.0 Flash Thinking costs $0.25/M input tokens vs $1.25/M for GPT-5.1-Codex.
No community votes yet. On paper, GPT-5.1-Codex has the edge — bigger model tier, newer.
Gemini 2.0 Flash Thinking is 20x cheaper per token — worth considering if cost matters.
GPT-5.1-Codex uses 15.0x more headings
Ask them anything yourself
Some models write identically. You are paying for the brand.
178 models fingerprinted across 32 writing dimensions. Free research.
185x
price gap between models that write identically
178
models
12
clone pairs
32
dimensions
279 AI models invented the same fake scientist.
We read every word. 250 models. 2.14 million words. This is what we found.

21 fights queued
9+ more head-to-head results. Free. Not a trick.
Free account. No card required. By continuing, you agree to Rival's Terms and Privacy Policy
21 fights queued
9+ more head-to-head results. Free. Not a trick.
Free account. No card required. By continuing, you agree to Rival's Terms and Privacy Policy