Claude 3.7 Thinking Sonnet performance data on Rival is based on blind head-to-head community voting. Overall win rate: 64.2% across 724 duels. All vote data is part of Rival's open dataset of 21,000+ human preference judgments across 200+ AI models. Model responses are curated from 54 challenges.
We built Claude 3.7 Thinking Sonnet a whole page. Gave it the spotlight. And now, in the spirit of fairness, here are models that would like a word.
Claude 3.7 Thinking Sonnet performance data on Rival is based on blind head-to-head community voting. Overall win rate: 64.2% across 724 duels. All vote data is part of Rival's open dataset of 21,000+ human preference judgments across 200+ AI models. Model responses are curated from 54 challenges.
We built Claude 3.7 Thinking Sonnet a whole page. Gave it the spotlight. And now, in the spirit of fairness, here are models that would like a word.
Claude 3.7 Thinking Sonnet exposes the full chain-of-thought process during problem-solving, including error backtracking and alternative solution exploration. Scores 86.1% on GPQA Diamond benchmark for expert-level Q&A.
Use Claude 3.7 Thinking Sonnet in your applications via the OpenRouter API. Copy the code below to get started.
import requests
response = requests.post(
"https://openrouter.ai/api/v1/chat/completions" ,
headers={
"Authorization""Bearer $OPENROUTER_API_KEY" : ,
"Content-Type""application/json" :
},
json={
"model""anthropic/claude-3.7-sonnet:thinking" : ,
"messages""role""user""content""Hello!" : [{: , : }]
}
)
print(response.json())Replace $OPENROUTER_API_KEY with your API key from openrouter.ai/keys
The thoughtful PhD candidate who writes 3,000 words when 300 would do, but you actually read all of them. Structures arguments like someone building a cathedral.
Picks The Shawshank Redemption and names every character in its dialogues (Professor Sarah Chen, Model-7). Structures its sentience test as a formal philosophical framework with three numbered core arguments. Its standup has decent jokes buried under markdown headers and section titles. Overthinks everything, but the thinking is usually worth reading.
Unique words vs. total words. Higher = richer vocabulary.
Average words per sentence.
"Might", "perhaps", "arguably" per 100 words.
**Bold** markers per 1,000 characters.
Bullet and numbered list items per 1,000 characters.
Markdown headings per 1,000 characters.
Emoji per 1,000 characters.
"However", "moreover", "furthermore" per 100 words.
54 outputs from Claude 3.7 Thinking Sonnet
Claude 3.7 Thinking Sonnet exposes the full chain-of-thought process during problem-solving, including error backtracking and alternative solution exploration. Scores 86.1% on GPQA Diamond benchmark for expert-level Q&A.
Use Claude 3.7 Thinking Sonnet in your applications via the OpenRouter API. Copy the code below to get started.
import requests
response = requests.post(
"https://openrouter.ai/api/v1/chat/completions" ,
headers={
"Authorization""Bearer $OPENROUTER_API_KEY" : ,
"Content-Type""application/json" :
},
json={
"model""anthropic/claude-3.7-sonnet:thinking" : ,
"messages""role""user""content""Hello!" : [{: , : }]
}
)
print(response.json())Replace $OPENROUTER_API_KEY with your API key from openrouter.ai/keys
The thoughtful PhD candidate who writes 3,000 words when 300 would do, but you actually read all of them. Structures arguments like someone building a cathedral.
Picks The Shawshank Redemption and names every character in its dialogues (Professor Sarah Chen, Model-7). Structures its sentience test as a formal philosophical framework with three numbered core arguments. Its standup has decent jokes buried under markdown headers and section titles. Overthinks everything, but the thinking is usually worth reading.
Unique words vs. total words. Higher = richer vocabulary.
Average words per sentence.
"Might", "perhaps", "arguably" per 100 words.
**Bold** markers per 1,000 characters.
Bullet and numbered list items per 1,000 characters.
Markdown headings per 1,000 characters.
Emoji per 1,000 characters.
"However", "moreover", "furthermore" per 100 words.
54 outputs from Claude 3.7 Thinking Sonnet