Grok 3 Thinking

Grok 3 Thinking

Grok 3 Thinking exposes the full chain-of-thought process during problem-solving, including error backtracking and alternative solution exploration. Scores 84.6% on GPQA Diamond benchmark for expert-level Q&A.

ConversationReasoningCode GenerationAnalysis
Provider
Xai
Release Date
2025-02-19
Size
XLARGE
Parameters
2.7T

Benchmark Performance

Performance metrics on industry standard AI benchmarks that measure capabilities across reasoning, knowledge, and specialized tasks.

MMLU

86.2%
View Source

GPQA Diamond

84.6%
View Source

MATH

80.5%
View Source

Model Insights

All Model Responses