Claude 3.7 Sonnet vs GPT-5.1-Codex-Mini

Compare Claude 3.7 Sonnet by Anthropic against GPT-5.1-Codex-Mini by OpenAI, context windows of 200K vs 400K, tested across 54 shared challenges. Updated April 2026.

Which is better, Claude 3.7 Sonnet or GPT-5.1-Codex-Mini?

Claude 3.7 Sonnet and GPT-5.1-Codex-Mini are both competitive models. Claude 3.7 Sonnet costs $3/M input tokens vs $1.5/M for GPT-5.1-Codex-Mini. Context windows: 200K vs 400K tokens. Compare their real outputs side by side below.

Key Differences Between Claude 3.7 Sonnet and GPT-5.1-Codex-Mini

Claude 3.7 Sonnet is made by anthropic while GPT-5.1-Codex-Mini is from openai. Claude 3.7 Sonnet has a 200K token context window compared to GPT-5.1-Codex-Mini's 400K. On pricing, Claude 3.7 Sonnet costs $3/M input tokens vs $1.5/M for GPT-5.1-Codex-Mini.

Our Verdict
GPT-5.1-Codex-Mini
GPT-5.1-Codex-Mini
Claude 3.7 Sonnet
Claude 3.7 SonnetRunner-up

No community votes yet. On paper, GPT-5.1-Codex-Mini has the edge — bigger model tier, newer, bigger context window.

Slight edge
Writing DNA

Style Comparison

Similarity
98%

Claude 3.7 Sonnet uses 23.0x more headings

Claude 3.7 Sonnet
GPT-5.1-Codex-Mini
62%Vocabulary65%
35wSentence Length20w
0.99Hedging0.56
1.2Bold3.3
4.3Lists3.4
0.00Emoji0.00
1.78Headings0.08
0.23Transitions0.05
Based on 13 + 13 text responses
vs

Ask them anything yourself

Claude 3.7 SonnetGPT-5.1-Codex-Mini

279 AI models invented the same fake scientist.

We read every word. 250 models. 2.14 million words. This is what we found.

AI Hallucination Index 2026
Free preview13 of 58 slides
FAQ

Common questions