GPT-4o Audio performance data on Rival is based on blind head-to-head community voting. Overall win rate: 100.0% across 4 duels. All vote data is part of Rival's open dataset of 21,000+ human preference judgments across 200+ AI models. Model responses are curated from 6 challenges.
GPT-4o with audio input support. Detects nuances within audio recordings and adds depth to generated user experiences. Processes both text and audio prompts.
Use GPT-4o Audio in your applications via the OpenRouter API. Copy the code below to get started.
import requests
response = requests.post(
"https://openrouter.ai/api/v1/chat/completions" ,
headers={
"Authorization""Bearer $OPENROUTER_API_KEY" : ,
"Content-Type""application/json" :
},
json={
"model""openai/gpt-4o-audio-preview" : ,
"messages""role""user""content""Hello!" : [{: , : }]
}
)
print(response.json())Replace $OPENROUTER_API_KEY with your API key from openrouter.ai/keys
6 outputs from GPT-4o Audio
Character Voice Dialogue
Narrator Storytelling
Podcast Introduction
Emotional Monologue
News Anchor Bulletin
Multilingual Greeting
Try GPT-4o Audio
GPT-4o Audio is good. We've said that. We stand by it. But we'd be doing you a disservice if we didn't show you these.