Compare Claude 3.7 Sonnet by Anthropic against GPT OSS 120B by OpenAI, in 16 community votes, gpt oss 120b wins 55% of head-to-head duels, context windows of 200K vs 131K, tested across 54 shared challenges. Updated June 2026.
GPT OSS 120B is the better choice overall, winning 55% of 16 blind community votes on Rival. Claude 3.7 Sonnet costs $3/M input tokens vs $0.18/M for GPT OSS 120B. Context windows: 200K vs 131K tokens. Compare their real outputs side by side below.
Claude 3.7 Sonnet is made by anthropic while GPT OSS 120B is from openai. Claude 3.7 Sonnet has a 200K token context window compared to GPT OSS 120B's 131K. On pricing, Claude 3.7 Sonnet costs $3/M input tokens vs $0.18/M for GPT OSS 120B. In community voting, In 16 community votes, GPT OSS 120B wins 55% of head-to-head duels.
In 16 community votes, GPT OSS 120B wins 55% of head-to-head duels. Claude 3.7 Sonnet leads in Image Generation, while GPT OSS 120B leads in Reasoning. Based on blind community voting from the Rival open dataset of 16+ human preference judgments for this pair.
GPT OSS 120B has the edge overall. In 16 blind votes, GPT OSS 120B wins 55% of the time.
Pick Claude 3.7 Sonnet for Image Generation. Pick GPT OSS 120B for Reasoning. GPT OSS 120B is 19x cheaper per token — worth considering if cost matters.
GPT OSS 120B is cheaper on both — 17× input, 19× output
GPT OSS 120B uses 15.4x more emoji
Compare Claude 3.7 Sonnet by Anthropic against GPT OSS 120B by OpenAI, in 16 community votes, gpt oss 120b wins 55% of head-to-head duels, context windows of 200K vs 131K, tested across 54 shared challenges. Updated June 2026.
GPT OSS 120B is the better choice overall, winning 55% of 16 blind community votes on Rival. Claude 3.7 Sonnet costs $3/M input tokens vs $0.18/M for GPT OSS 120B. Context windows: 200K vs 131K tokens. Compare their real outputs side by side below.
Claude 3.7 Sonnet is made by anthropic while GPT OSS 120B is from openai. Claude 3.7 Sonnet has a 200K token context window compared to GPT OSS 120B's 131K. On pricing, Claude 3.7 Sonnet costs $3/M input tokens vs $0.18/M for GPT OSS 120B. In community voting, In 16 community votes, GPT OSS 120B wins 55% of head-to-head duels.
In 16 community votes, GPT OSS 120B wins 55% of head-to-head duels. Claude 3.7 Sonnet leads in Image Generation, while GPT OSS 120B leads in Reasoning. Based on blind community voting from the Rival open dataset of 16+ human preference judgments for this pair.
GPT OSS 120B has the edge overall. In 16 blind votes, GPT OSS 120B wins 55% of the time.
Pick Claude 3.7 Sonnet for Image Generation. Pick GPT OSS 120B for Reasoning. GPT OSS 120B is 19x cheaper per token — worth considering if cost matters.
GPT OSS 120B is cheaper on both — 17× input, 19× output
GPT OSS 120B uses 15.4x more emoji