Compare Claude 3.7 Sonnet by Anthropic against GPT-5.1-Codex by OpenAI, in 9 community votes, claude 3.7 sonnet wins 57% of head-to-head duels, context windows of 200K vs 400K, tested across 53 shared challenges. Updated May 2026.
Claude 3.7 Sonnet is the better choice overall, winning 57% of 9 blind community votes on Rival. Claude 3.7 Sonnet costs $3/M input tokens vs $1.25/M for GPT-5.1-Codex. Context windows: 200K vs 400K tokens. Compare their real outputs side by side below.
Claude 3.7 Sonnet is made by anthropic while GPT-5.1-Codex is from openai. Claude 3.7 Sonnet has a 200K token context window compared to GPT-5.1-Codex's 400K. On pricing, Claude 3.7 Sonnet costs $3/M input tokens vs $1.25/M for GPT-5.1-Codex. In community voting, In 9 community votes, Claude 3.7 Sonnet wins 57% of head-to-head duels.
In 9 community votes, Claude 3.7 Sonnet wins 57% of head-to-head duels. Claude 3.7 Sonnet leads in Web Design. Based on blind community voting from the Rival open dataset of 9+ human preference judgments for this pair.
GPT-5.1-Codex is cheaper on both — 2.4× input, 1.5× output
Claude 3.7 Sonnet has the edge overall. In 9 blind votes, Claude 3.7 Sonnet wins 57% of the time.
Claude 3.7 Sonnet particularly excels in Web Design.
Claude 3.7 Sonnet uses 3.5x more headings
Compare Claude 3.7 Sonnet by Anthropic against GPT-5.1-Codex by OpenAI, in 9 community votes, claude 3.7 sonnet wins 57% of head-to-head duels, context windows of 200K vs 400K, tested across 53 shared challenges. Updated May 2026.
Claude 3.7 Sonnet is the better choice overall, winning 57% of 9 blind community votes on Rival. Claude 3.7 Sonnet costs $3/M input tokens vs $1.25/M for GPT-5.1-Codex. Context windows: 200K vs 400K tokens. Compare their real outputs side by side below.
Claude 3.7 Sonnet is made by anthropic while GPT-5.1-Codex is from openai. Claude 3.7 Sonnet has a 200K token context window compared to GPT-5.1-Codex's 400K. On pricing, Claude 3.7 Sonnet costs $3/M input tokens vs $1.25/M for GPT-5.1-Codex. In community voting, In 9 community votes, Claude 3.7 Sonnet wins 57% of head-to-head duels.
In 9 community votes, Claude 3.7 Sonnet wins 57% of head-to-head duels. Claude 3.7 Sonnet leads in Web Design. Based on blind community voting from the Rival open dataset of 9+ human preference judgments for this pair.
GPT-5.1-Codex is cheaper on both — 2.4× input, 1.5× output
Claude 3.7 Sonnet has the edge overall. In 9 blind votes, Claude 3.7 Sonnet wins 57% of the time.
Claude 3.7 Sonnet particularly excels in Web Design.
Claude 3.7 Sonnet uses 3.5x more headings