GPT-4o Audio by OpenAI — Pricing, Benchmarks &amp; Real Outputs

Alternatives to GPT-4o Audio

GPT-4o Audio's competitors have been quietly putting in work.

GPT-4o Audio has opinions about your brand too.Ask 10 models what they tell people about you. Verbatim receipts.Run the mirror

Updated Feb 14, 2026

Share

Loading...

Compare GPT-4o Audio

Alternatives to GPT-4o Audio

GPT-4o Audio's competitors have been quietly putting in work.

GPT-4o Audio has opinions about your brand too.Ask 10 models what they tell people about you. Verbatim receipts.Run the mirror

Updated Feb 14, 2026

Share

GPT-4o Audio

GPT Audio:

GPT-4o Audio

GPT-4o with audio input support. Detects nuances within audio recordings and adds depth to generated user experiences. Processes both text and audio prompts.

Provider

Release Date

2025-03-20

Size

LARGE

Pricing

In: $2.5/1M

Out: $10/1M

Get API accessProvider and language code samples

Provider

fromimport openai  OpenAI

client = OpenAI(
"https://openrouter.ai/api/v1"    base_url=,
"$OPENROUTER_API_KEY"    api_key=,
)

response = client.chat.completions.create(
"openai/gpt-4o-audio-preview"    model=,
"role""user""content""Hello!"    messages=[{: , : }],
)
print(response.choices[0].message.content)

Set OPENROUTER_API_KEY with your OpenRouter API key from openrouter.ai/keys.

Sponsored

Model Responses

6 outputs from GPT-4o Audio

Related Models

GPT Audio

OpenAI's first generally available audio model. Features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Supports both text and audio input/output.

GPT Audio Mini

Cost-efficient version of GPT Audio. Features an upgraded decoder for more natural sounding voices and better voice consistency at a fraction of the cost.

OpenAI: GPT-5.6 Luna

GPT-5.6 Luna is a fast, cost-efficient model in OpenAI's GPT-5.6 series, suited for high-volume, latency-sensitive tasks such as chat, classification, and lightweight agentic workflows, with capable reasoning for its price tier.

OpenAI: GPT-5.6 Terra

GPT-5.6 Terra is a balanced model in OpenAI's GPT-5.6 series, positioned between the flagship Sol tier and the cost-efficient Luna tier. It suits everyday coding, reasoning, and agentic tasks where capability and cost need to be balanced, at roughly half the cost of Sol.

OpenAI: GPT-5.6 Sol

GPT-5.6 Sol is the flagship model in OpenAI's GPT-5.6 series, suited for complex reasoning, coding, and agentic workflows. It is particularly strong at command-line and multi-step coding tasks and long-horizon problem solving.

OpenAI: GPT-5.6 Luna Pro

GPT-5.6 Luna Pro is the same underlying model as GPT-5.6 Luna, served with reasoning mode set to pro for higher-quality responses on complex tasks.

Keep exploring

GPT-4o Audio vs MiniMax M3

Real outputs compared side by side

Best AI for Creative Writing

Find the best AI for creative writing. Ranked across comedy, fiction, satire,...

Loading...

Compare GPT-4o Audio

Alternatives to GPT-4o Audio

GPT-4o Audio's competitors have been quietly putting in work.

GPT-4o Audio has opinions about your brand too.Ask 10 models what they tell people about you. Verbatim receipts.Run the mirror

Updated Feb 14, 2026

Share

GPT-4o Audio

GPT Audio:

GPT-4o Audio

GPT-4o with audio input support. Detects nuances within audio recordings and adds depth to generated user experiences. Processes both text and audio prompts.

Provider

Release Date

2025-03-20

Size

LARGE

Pricing

In: $2.5/1M

Out: $10/1M

Get API accessProvider and language code samples

Provider

fromimport openai  OpenAI

client = OpenAI(
"https://openrouter.ai/api/v1"    base_url=,
"$OPENROUTER_API_KEY"    api_key=,
)

response = client.chat.completions.create(
"openai/gpt-4o-audio-preview"    model=,
"role""user""content""Hello!"    messages=[{: , : }],
)
print(response.choices[0].message.content)

Set OPENROUTER_API_KEY with your OpenRouter API key from openrouter.ai/keys.

Sponsored

Model Responses

6 outputs from GPT-4o Audio

Related Models

GPT Audio

OpenAI's first generally available audio model. Features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Supports both text and audio input/output.

GPT Audio Mini

Cost-efficient version of GPT Audio. Features an upgraded decoder for more natural sounding voices and better voice consistency at a fraction of the cost.

OpenAI: GPT-5.6 Luna

GPT-5.6 Luna is a fast, cost-efficient model in OpenAI's GPT-5.6 series, suited for high-volume, latency-sensitive tasks such as chat, classification, and lightweight agentic workflows, with capable reasoning for its price tier.

OpenAI: GPT-5.6 Terra

GPT-5.6 Terra is a balanced model in OpenAI's GPT-5.6 series, positioned between the flagship Sol tier and the cost-efficient Luna tier. It suits everyday coding, reasoning, and agentic tasks where capability and cost need to be balanced, at roughly half the cost of Sol.

OpenAI: GPT-5.6 Sol

GPT-5.6 Sol is the flagship model in OpenAI's GPT-5.6 series, suited for complex reasoning, coding, and agentic workflows. It is particularly strong at command-line and multi-step coding tasks and long-horizon problem solving.

OpenAI: GPT-5.6 Luna Pro

GPT-5.6 Luna Pro is the same underlying model as GPT-5.6 Luna, served with reasoning mode set to pro for higher-quality responses on complex tasks.

Keep exploring

GPT-4o Audio vs MiniMax M3

Real outputs compared side by side

Best AI for Creative Writing

Find the best AI for creative writing. Ranked across comedy, fiction, satire,...

Loading...

Compare GPT-4o Audio

Alternatives to GPT-4o Audio

GPT-4o Audio's competitors have been quietly putting in work.

GPT-4o Audio has opinions about your brand too.Ask 10 models what they tell people about you. Verbatim receipts.Run the mirror

GPT-4o Audio

GPT Audio:

GPT-4o Audio

GPT-4o with audio input support. Detects nuances within audio recordings and adds depth to generated user experiences. Processes both text and audio prompts.

Provider

Release Date

2025-03-20

Size

LARGE

Pricing

In: $2.5/1M

Out: $10/1M

Get API accessProvider and language code samples

Provider

fromimport openai  OpenAI

client = OpenAI(
"https://openrouter.ai/api/v1"    base_url=,
"$OPENROUTER_API_KEY"    api_key=,
)

response = client.chat.completions.create(
"openai/gpt-4o-audio-preview"    model=,
"role""user""content""Hello!"    messages=[{: , : }],
)
print(response.choices[0].message.content)

Set OPENROUTER_API_KEY with your OpenRouter API key from openrouter.ai/keys.

Sponsored

Model Responses

6 outputs from GPT-4o Audio

Related Models

GPT Audio

OpenAI's first generally available audio model. Features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Supports both text and audio input/output.

GPT Audio Mini

Cost-efficient version of GPT Audio. Features an upgraded decoder for more natural sounding voices and better voice consistency at a fraction of the cost.

OpenAI: GPT-5.6 Luna

GPT-5.6 Luna is a fast, cost-efficient model in OpenAI's GPT-5.6 series, suited for high-volume, latency-sensitive tasks such as chat, classification, and lightweight agentic workflows, with capable reasoning for its price tier.

OpenAI: GPT-5.6 Terra

GPT-5.6 Terra is a balanced model in OpenAI's GPT-5.6 series, positioned between the flagship Sol tier and the cost-efficient Luna tier. It suits everyday coding, reasoning, and agentic tasks where capability and cost need to be balanced, at roughly half the cost of Sol.

OpenAI: GPT-5.6 Sol

GPT-5.6 Sol is the flagship model in OpenAI's GPT-5.6 series, suited for complex reasoning, coding, and agentic workflows. It is particularly strong at command-line and multi-step coding tasks and long-horizon problem solving.

OpenAI: GPT-5.6 Luna Pro

GPT-5.6 Luna Pro is the same underlying model as GPT-5.6 Luna, served with reasoning mode set to pro for higher-quality responses on complex tasks.

Keep exploring

GPT-4o Audio vs MiniMax M3

Real outputs compared side by side

Best AI for Creative Writing

Find the best AI for creative writing. Ranked across comedy, fiction, satire,...

GPT-4o Audio

GPT Audio:

GPT-4o Audio

GPT-4o with audio input support. Detects nuances within audio recordings and adds depth to generated user experiences. Processes both text and audio prompts.

Provider

Release Date

2025-03-20

Size

LARGE

Pricing

In: $2.5/1M

Out: $10/1M

Get API accessProvider and language code samples

Provider

fromimport openai  OpenAI

client = OpenAI(
"https://openrouter.ai/api/v1"    base_url=,
"$OPENROUTER_API_KEY"    api_key=,
)

response = client.chat.completions.create(
"openai/gpt-4o-audio-preview"    model=,
"role""user""content""Hello!"    messages=[{: , : }],
)
print(response.choices[0].message.content)

Set OPENROUTER_API_KEY with your OpenRouter API key from openrouter.ai/keys.

Sponsored

Model Responses

6 outputs from GPT-4o Audio

Related Models

GPT Audio

OpenAI's first generally available audio model. Features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Supports both text and audio input/output.

GPT Audio Mini

Cost-efficient version of GPT Audio. Features an upgraded decoder for more natural sounding voices and better voice consistency at a fraction of the cost.

OpenAI: GPT-5.6 Luna

GPT-5.6 Luna is a fast, cost-efficient model in OpenAI's GPT-5.6 series, suited for high-volume, latency-sensitive tasks such as chat, classification, and lightweight agentic workflows, with capable reasoning for its price tier.

OpenAI: GPT-5.6 Terra

GPT-5.6 Terra is a balanced model in OpenAI's GPT-5.6 series, positioned between the flagship Sol tier and the cost-efficient Luna tier. It suits everyday coding, reasoning, and agentic tasks where capability and cost need to be balanced, at roughly half the cost of Sol.

OpenAI: GPT-5.6 Sol

GPT-5.6 Sol is the flagship model in OpenAI's GPT-5.6 series, suited for complex reasoning, coding, and agentic workflows. It is particularly strong at command-line and multi-step coding tasks and long-horizon problem solving.

OpenAI: GPT-5.6 Luna Pro

GPT-5.6 Luna Pro is the same underlying model as GPT-5.6 Luna, served with reasoning mode set to pro for higher-quality responses on complex tasks.

Keep exploring

GPT-4o Audio vs MiniMax M3

Real outputs compared side by side