Compare Claude 3.7 Thinking Sonnet by Anthropic against GPT-5.1-Codex by OpenAI, context windows of 200K vs 400K, tested across 53 shared challenges. Updated April 2026.
Claude 3.7 Thinking Sonnet and GPT-5.1-Codex are both competitive models. Claude 3.7 Thinking Sonnet costs $6/M input tokens vs $1.25/M for GPT-5.1-Codex. Context windows: 200K vs 400K tokens. Compare their real outputs side by side below.
Claude 3.7 Thinking Sonnet is made by anthropic while GPT-5.1-Codex is from openai. Claude 3.7 Thinking Sonnet has a 200K token context window compared to GPT-5.1-Codex's 400K. On pricing, Claude 3.7 Thinking Sonnet costs $6/M input tokens vs $1.25/M for GPT-5.1-Codex.
No community votes yet. On paper, GPT-5.1-Codex has the edge — bigger model tier, newer, bigger context window.
GPT-5.1-Codex is 3.0x cheaper per token — worth considering if cost matters.
Claude 3.7 Thinking Sonnet uses 3.6x more headings
Ask them anything yourself
Some models write identically. You are paying for the brand.
178 models fingerprinted across 32 writing dimensions. Free research.
185x
price gap between models that write identically
178
models
12
clone pairs
32
dimensions
279 AI models invented the same fake scientist.
We read every word. 250 models. 2.14 million words. This is what we found.

Compare Claude 3.7 Thinking Sonnet by Anthropic against GPT-5.1-Codex by OpenAI, context windows of 200K vs 400K, tested across 53 shared challenges. Updated April 2026.
Claude 3.7 Thinking Sonnet and GPT-5.1-Codex are both competitive models. Claude 3.7 Thinking Sonnet costs $6/M input tokens vs $1.25/M for GPT-5.1-Codex. Context windows: 200K vs 400K tokens. Compare their real outputs side by side below.
Claude 3.7 Thinking Sonnet is made by anthropic while GPT-5.1-Codex is from openai. Claude 3.7 Thinking Sonnet has a 200K token context window compared to GPT-5.1-Codex's 400K. On pricing, Claude 3.7 Thinking Sonnet costs $6/M input tokens vs $1.25/M for GPT-5.1-Codex.
No community votes yet. On paper, GPT-5.1-Codex has the edge — bigger model tier, newer, bigger context window.
GPT-5.1-Codex is 3.0x cheaper per token — worth considering if cost matters.
Claude 3.7 Thinking Sonnet uses 3.6x more headings
Ask them anything yourself
Some models write identically. You are paying for the brand.
178 models fingerprinted across 32 writing dimensions. Free research.
185x
price gap between models that write identically
178
models
12
clone pairs
32
dimensions
279 AI models invented the same fake scientist.
We read every word. 250 models. 2.14 million words. This is what we found.

48 fights queued
36+ more head-to-head results. Free. Not a trick.
Free account. No card required. By continuing, you agree to Rival's Terms and Privacy Policy
48 fights queued
36+ more head-to-head results. Free. Not a trick.
Free account. No card required. By continuing, you agree to Rival's Terms and Privacy Policy