GPT-5.4 performance data on Rival is based on blind head-to-head community voting. Overall win rate: 51.4% across 74 duels. All vote data is part of Rival's open dataset of 21,000+ human preference judgments across 200+ AI models. Model responses are curated from 48 challenges.
We're not suggesting you leave GPT-5.4. We're just... putting these here. In case you're curious. Which you are, because you scrolled this far.
GPT-5.4 performance data on Rival is based on blind head-to-head community voting. Overall win rate: 51.4% across 74 duels. All vote data is part of Rival's open dataset of 21,000+ human preference judgments across 200+ AI models. Model responses are curated from 48 challenges.
We're not suggesting you leave GPT-5.4. We're just... putting these here. In case you're curious. Which you are, because you scrolled this far.
GPT-5.4 is OpenAI's latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for text and image inputs, enabling high-context reasoning, coding, and multimodal analysis within the same workflow. The model delivers improved performance in coding, document understanding, tool use, and instruction following.
Use GPT-5.4 in your applications via the OpenRouter API. Copy the code below to get started.
import requests
response = requests.post(
"https://openrouter.ai/api/v1/chat/completions" ,
headers={
"Authorization""Bearer $OPENROUTER_API_KEY" : ,
"Content-Type""application/json" :
},
json={
"model""openai/gpt-5.4" : ,
"messages""role""user""content""Hello!" : [{: , : }]
}
)
print(response.json())Replace $OPENROUTER_API_KEY with your API key from openrouter.ai/keys
The seasoned editorial voice — measured, principled, exhaustive. Engages with hard dilemmas fully and without flinching, stakes clear positions, but always with enough hedging to survive peer review. Ethics feel genuinely considered rather than performed.
The unified GPT. Massive, well-structured prose that reads like long-form journalism. The pharmaceutical dilemma is a 530-line masterclass — hour-by-hour, stakeholder-by-stakeholder, with genuine ethical reasoning ("a 40% stock drop is painful; a concealment narrative can destroy the company"). Sentience test dialogue has genuine philosophical depth and a devastating closing line. Character voices are distinct and concise. Humor is dry and observational ("Mildly Condescending Chirp Analysis"). Picks Arrival — prestige sci-fi, thematically resonant with language and intelligence. Habitually ends responses with "If you want, I can also turn this into..." menus.
Taste is judged on an uncapped scale where 100 is the reference, originality first. The space past 100 is the craft today's models rarely reach.
Unique words vs. total words. Higher = richer vocabulary.
Average words per sentence.
"Might", "perhaps", "arguably" per 100 words.
**Bold** markers per 1,000 characters.
Bullet and numbered list items per 1,000 characters.
Markdown headings per 1,000 characters.
Emoji per 1,000 characters.
"However", "moreover", "furthermore" per 100 words.
48 outputs from GPT-5.4
GPT-5.4 is OpenAI's latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for text and image inputs, enabling high-context reasoning, coding, and multimodal analysis within the same workflow. The model delivers improved performance in coding, document understanding, tool use, and instruction following.
Use GPT-5.4 in your applications via the OpenRouter API. Copy the code below to get started.
import requests
response = requests.post(
"https://openrouter.ai/api/v1/chat/completions" ,
headers={
"Authorization""Bearer $OPENROUTER_API_KEY" : ,
"Content-Type""application/json" :
},
json={
"model""openai/gpt-5.4" : ,
"messages""role""user""content""Hello!" : [{: , : }]
}
)
print(response.json())Replace $OPENROUTER_API_KEY with your API key from openrouter.ai/keys
The seasoned editorial voice — measured, principled, exhaustive. Engages with hard dilemmas fully and without flinching, stakes clear positions, but always with enough hedging to survive peer review. Ethics feel genuinely considered rather than performed.
The unified GPT. Massive, well-structured prose that reads like long-form journalism. The pharmaceutical dilemma is a 530-line masterclass — hour-by-hour, stakeholder-by-stakeholder, with genuine ethical reasoning ("a 40% stock drop is painful; a concealment narrative can destroy the company"). Sentience test dialogue has genuine philosophical depth and a devastating closing line. Character voices are distinct and concise. Humor is dry and observational ("Mildly Condescending Chirp Analysis"). Picks Arrival — prestige sci-fi, thematically resonant with language and intelligence. Habitually ends responses with "If you want, I can also turn this into..." menus.
Taste is judged on an uncapped scale where 100 is the reference, originality first. The space past 100 is the craft today's models rarely reach.
Unique words vs. total words. Higher = richer vocabulary.
Average words per sentence.
"Might", "perhaps", "arguably" per 100 words.
**Bold** markers per 1,000 characters.
Bullet and numbered list items per 1,000 characters.
Markdown headings per 1,000 characters.
Emoji per 1,000 characters.
"However", "moreover", "furthermore" per 100 words.
48 outputs from GPT-5.4