Grok 3 Thinking vs GPT OSS 120B - AI Model Comparison
Compare capabilities and responses between Grok 3 Thinking by xai and GPT OSS 120B by openai.
Common Challenges (14)
- Estimate Complexity: Tests an AI's ability to make educated estimates based on technical knowledge...
- Math Misconception Test: Tests an AI's understanding of number representation...
- Stochastic Consistency Test: Tests an AI's randomness and creativity...
- SVG Layout Challenge: Tests an AI's ability to generate vector graphics...
- Xbox Controller SVG Art: Tests an AI's ability to create detailed SVG illustrations of gaming hardware...
- And 9 more challenges...
About Grok 3 Thinking
Provider: xai
Release Date: 2025-02-19
Context Window: 128000
About GPT OSS 120B
Provider: openai
Release Date: 2025-08-05
Context Window: 131072
Grok 3 ThinkingvsGPT OSS 120B
Comparing across 14 challenges
conversationreasoningcode generation+1
117B total (5.1B active per forward pass) 131,072k
conversationreasoningcode generation+3
Battle Arena
0/14 challenges loaded
Loading more challenges...0/14