← All models

openai

OpenAI: GPT Audio

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.

128,000 context
Modalities:text, audio->text, audio
Released:1/19/2026

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.

Weekly tokens

15.0M

Tokens generated this week (network-wide)

Usage by period

Today1.5M tokens
This week23.0M tokens
This month121.6M tokens
Trending23.0M tokens