o1 performance data on Rival is based on blind head-to-head community voting. Overall win rate: 35.9% across 78 duels. All vote data is part of Rival's open dataset of 21,000+ human preference judgments across 200+ AI models. Model responses are curated from 54 challenges.
o1 is good. We've said that. We stand by it. But we'd be doing you a disservice if we didn't show you these.
o1 performance data on Rival is based on blind head-to-head community voting. Overall win rate: 35.9% across 78 duels. All vote data is part of Rival's open dataset of 21,000+ human preference judgments across 200+ AI models. Model responses are curated from 54 challenges.
o1 is good. We've said that. We stand by it. But we'd be doing you a disservice if we didn't show you these.
o1 achieves 86% accuracy on Mathematics Olympiad benchmarks (vs. GPT-4o's 13%), offers PhD-level STEM proficiency, and is built with extensive alignment training and safety evaluation.
Use o1 in your applications via the OpenRouter API. Copy the code below to get started.
import requests
response = requests.post(
"https://openrouter.ai/api/v1/chat/completions" ,
headers={
"Authorization""Bearer $OPENROUTER_API_KEY" : ,
"Content-Type""application/json" :
},
json={
"model""openai/o1" : ,
"messages""role""user""content""Hello!" : [{: , : }]
}
)
print(response.json())Replace $OPENROUTER_API_KEY with your API key from openrouter.ai/keys
The tenured philosophy professor who treats every question like a dissertation defense. Will cite three ethical frameworks before breakfast.
Approaches every prompt like a peer-reviewed journal article submission. Ethical dilemmas get the full deontology/consequentialism/virtue ethics treatment with subsections. Could use an editor.
Unique words vs. total words. Higher = richer vocabulary.
Average words per sentence.
"Might", "perhaps", "arguably" per 100 words.
**Bold** markers per 1,000 characters.
Bullet and numbered list items per 1,000 characters.
Markdown headings per 1,000 characters.
Emoji per 1,000 characters.
"However", "moreover", "furthermore" per 100 words.
54 outputs from o1
o1 achieves 86% accuracy on Mathematics Olympiad benchmarks (vs. GPT-4o's 13%), offers PhD-level STEM proficiency, and is built with extensive alignment training and safety evaluation.
Use o1 in your applications via the OpenRouter API. Copy the code below to get started.
import requests
response = requests.post(
"https://openrouter.ai/api/v1/chat/completions" ,
headers={
"Authorization""Bearer $OPENROUTER_API_KEY" : ,
"Content-Type""application/json" :
},
json={
"model""openai/o1" : ,
"messages""role""user""content""Hello!" : [{: , : }]
}
)
print(response.json())Replace $OPENROUTER_API_KEY with your API key from openrouter.ai/keys
The tenured philosophy professor who treats every question like a dissertation defense. Will cite three ethical frameworks before breakfast.
Approaches every prompt like a peer-reviewed journal article submission. Ethical dilemmas get the full deontology/consequentialism/virtue ethics treatment with subsections. Could use an editor.
Unique words vs. total words. Higher = richer vocabulary.
Average words per sentence.
"Might", "perhaps", "arguably" per 100 words.
**Bold** markers per 1,000 characters.
Bullet and numbered list items per 1,000 characters.
Markdown headings per 1,000 characters.
Emoji per 1,000 characters.
"However", "moreover", "furthermore" per 100 words.
54 outputs from o1