Compare Gemini 2.5 Flash Preview 05-20 (thinking) by Google AI against Grok 4.20 Beta by xAI, context windows of 1.0M vs 2.0M, tested across 11 shared challenges. Updated April 2026.
Gemini 2.5 Flash Preview 05-20 (thinking) and Grok 4.20 Beta are both competitive models. Gemini 2.5 Flash Preview 05-20 (thinking) costs $0.15/M input tokens vs $2/M for Grok 4.20 Beta. Context windows: 1049K vs 2000K tokens. Compare their real outputs side by side below.
Gemini 2.5 Flash Preview 05-20 (thinking) is made by google while Grok 4.20 Beta is from xai. Gemini 2.5 Flash Preview 05-20 (thinking) has a 1049K token context window compared to Grok 4.20 Beta's 2000K. On pricing, Gemini 2.5 Flash Preview 05-20 (thinking) costs $0.15/M input tokens vs $2/M for Grok 4.20 Beta.
No community votes yet. On paper, Grok 4.20 Beta has the edge — bigger model tier, newer, bigger context window.
Grok 4.20 Beta uses 35.4x more hedging
Ask them anything yourself
Some models write identically. You are paying for the brand.
178 models fingerprinted across 32 writing dimensions. Free research.
185x
price gap between models that write identically
178
models
12
clone pairs
32
dimensions
279 AI models invented the same fake scientist.
We read every word. 250 models. 2.14 million words. This is what we found.

Compare Gemini 2.5 Flash Preview 05-20 (thinking) by Google AI against Grok 4.20 Beta by xAI, context windows of 1.0M vs 2.0M, tested across 11 shared challenges. Updated April 2026.
Gemini 2.5 Flash Preview 05-20 (thinking) and Grok 4.20 Beta are both competitive models. Gemini 2.5 Flash Preview 05-20 (thinking) costs $0.15/M input tokens vs $2/M for Grok 4.20 Beta. Context windows: 1049K vs 2000K tokens. Compare their real outputs side by side below.
Gemini 2.5 Flash Preview 05-20 (thinking) is made by google while Grok 4.20 Beta is from xai. Gemini 2.5 Flash Preview 05-20 (thinking) has a 1049K token context window compared to Grok 4.20 Beta's 2000K. On pricing, Gemini 2.5 Flash Preview 05-20 (thinking) costs $0.15/M input tokens vs $2/M for Grok 4.20 Beta.
No community votes yet. On paper, Grok 4.20 Beta has the edge — bigger model tier, newer, bigger context window.
Grok 4.20 Beta uses 35.4x more hedging
Ask them anything yourself
Some models write identically. You are paying for the brand.
178 models fingerprinted across 32 writing dimensions. Free research.
185x
price gap between models that write identically
178
models
12
clone pairs
32
dimensions
279 AI models invented the same fake scientist.
We read every word. 250 models. 2.14 million words. This is what we found.

11 fights queued
11 fights queued