Claude 3.7 Sonnet vs GPT OSS 120B

Compare Claude 3.7 Sonnet by Anthropic against GPT OSS 120B by OpenAI, in 16 community votes, gpt oss 120b wins 55% of head-to-head duels, context windows of 200K vs 131K, tested across 35 shared challenges. Updated February 2026.

In 16 community votes, GPT OSS 120B wins 55% of head-to-head duels. Claude 3.7 Sonnet leads in Image Generation, while GPT OSS 120B leads in Reasoning. Based on blind community voting from the RIVAL open dataset of 16+ human preference judgments for this pair.

Reasoning: GPT OSS 120B wins 83% of votes
Image Generation: Claude 3.7 Sonnet wins 67% of votes
FAQ