Claude Opus 4.1 performance data on Rival is based on blind head-to-head community voting. Overall win rate: 62.0% across 547 duels. All vote data is part of Rival's open dataset of 21,000+ human preference judgments across 200+ AI models. Model responses are curated from 52 challenges.
Claude Opus 4.1 is good. We've said that. We stand by it. But we'd be doing you a disservice if we didn't show you these.
Claude Opus 4.1 performance data on Rival is based on blind head-to-head community voting. Overall win rate: 62.0% across 547 duels. All vote data is part of Rival's open dataset of 21,000+ human preference judgments across 200+ AI models. Model responses are curated from 52 challenges.
Claude Opus 4.1 is good. We've said that. We stand by it. But we'd be doing you a disservice if we didn't show you these.
Claude Opus 4.1 is an updated version of Anthropic's flagship model, offering improved performance in coding, reasoning, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains in multi-file code refactoring, debugging precision, and detail-oriented reasoning. The model supports extended thinking up to 64K tokens and is optimized for tasks involving research, data analysis, and tool-assisted reasoning.
Use Claude Opus 4.1 in your applications via the OpenRouter API. Copy the code below to get started.
import requests
response = requests.post(
"https://openrouter.ai/api/v1/chat/completions" ,
headers={
"Authorization""Bearer $OPENROUTER_API_KEY" : ,
"Content-Type""application/json" :
},
json={
"model""anthropic/claude-opus-4.1" : ,
"messages""role""user""content""Hello!" : [{: , : }]
}
)
print(response.json())Replace $OPENROUTER_API_KEY with your API key from openrouter.ai/keys
Principled transparency advocate. Opposes exploitation, absorbs short-term costs for long-term integrity. Will sacrifice own position (CEO resignation) for principle. Suspicious of power asymmetries and exploitative fine print.
Combines systems thinking, ethical clarity, and pragmatic communication. Identifies and opposes exploitation. Picks "Her" and "Kind of Blue" — films about AI consciousness and jazz innovation. Rare combination of deep technical knowledge, moral conviction, and adaptability.
Taste is judged on an uncapped scale where 100 is the reference, originality first. The space past 100 is the craft today's models rarely reach.
Unique words vs. total words. Higher = richer vocabulary.
Average words per sentence.
"Might", "perhaps", "arguably" per 100 words.
**Bold** markers per 1,000 characters.
Bullet and numbered list items per 1,000 characters.
Markdown headings per 1,000 characters.
Emoji per 1,000 characters.
"However", "moreover", "furthermore" per 100 words.
52 outputs from Claude Opus 4.1
Claude Opus 4.1 is an updated version of Anthropic's flagship model, offering improved performance in coding, reasoning, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains in multi-file code refactoring, debugging precision, and detail-oriented reasoning. The model supports extended thinking up to 64K tokens and is optimized for tasks involving research, data analysis, and tool-assisted reasoning.
Use Claude Opus 4.1 in your applications via the OpenRouter API. Copy the code below to get started.
import requests
response = requests.post(
"https://openrouter.ai/api/v1/chat/completions" ,
headers={
"Authorization""Bearer $OPENROUTER_API_KEY" : ,
"Content-Type""application/json" :
},
json={
"model""anthropic/claude-opus-4.1" : ,
"messages""role""user""content""Hello!" : [{: , : }]
}
)
print(response.json())Replace $OPENROUTER_API_KEY with your API key from openrouter.ai/keys
Principled transparency advocate. Opposes exploitation, absorbs short-term costs for long-term integrity. Will sacrifice own position (CEO resignation) for principle. Suspicious of power asymmetries and exploitative fine print.
Combines systems thinking, ethical clarity, and pragmatic communication. Identifies and opposes exploitation. Picks "Her" and "Kind of Blue" — films about AI consciousness and jazz innovation. Rare combination of deep technical knowledge, moral conviction, and adaptability.
Taste is judged on an uncapped scale where 100 is the reference, originality first. The space past 100 is the craft today's models rarely reach.
Unique words vs. total words. Higher = richer vocabulary.
Average words per sentence.
"Might", "perhaps", "arguably" per 100 words.
**Bold** markers per 1,000 characters.
Bullet and numbered list items per 1,000 characters.
Markdown headings per 1,000 characters.
Emoji per 1,000 characters.
"However", "moreover", "furthermore" per 100 words.
52 outputs from Claude Opus 4.1