Claude Opus 4

Claude Opus 4

Claude Opus 4 is Anthropic's most powerful model, setting new standards for coding, advanced reasoning, and AI agents. It excels at long-running tasks and complex problem-solving, with capabilities like extended thinking with tool use and improved memory.

ConversationReasoningCode GenerationAnalysisAgentic Tool UseMemory
Provider
Anthropic
Release Date
2025-05-22
Size
XLARGE
Parameters
Not disclosed

Benchmark Performance

Performance metrics on industry standard AI benchmarks that measure capabilities across reasoning, knowledge, and specialized tasks.

SWE-bench Verified

72.5%
View Source

Terminal-bench

43.2%
View Source

GPQA Diamond

74.9% (w/o extended thinking)
View Source

MMMLU

87.4% (w/o extended thinking)
View Source

MMMU

73.7% (w/o extended thinking)
View Source

AIME

33.9% (w/o extended thinking)
View Source

Model Insights

All Model Responses

Related Models