Claude 3.7 Sonnet vs GPT-5.1-Codex

Compare Claude 3.7 Sonnet by Anthropic against GPT-5.1-Codex by OpenAI, in 9 community votes, claude 3.7 sonnet wins 57% of head-to-head duels, context windows of 200K vs 400K, tested across 40 shared challenges. Updated February 2026.

In 9 community votes, Claude 3.7 Sonnet wins 57% of head-to-head duels. Claude 3.7 Sonnet leads in Web Design. Based on blind community voting from the RIVAL open dataset of 9+ human preference judgments for this pair.

Web Design: Claude 3.7 Sonnet wins 67% of votes
Reasoning: Claude 3.7 Sonnet and GPT-5.1-Codex are tied
Image Generation: Claude 3.7 Sonnet and GPT-5.1-Codex are tied
FAQ