Skip to main content

GPT-4o Long Output

Alpha program

OpenAI is offering an experimental version of GPT-4o with a maximum of 64K output tokens per request. We hope this experiment helps you explore new use cases that are unlocked by longer completions. 

Alpha participants can access GPT-4o long output by using the gpt-4o-64k-output-alpha model name.

Pricing

Long completions are more costly from an inference perspective, so the per-token pricing of this model is increased to match the costs.

Model
Input usage
Output usage
gpt-4o-64k-output-alpha
$6.00 / 1M tokens
$18.00 / 1M tokens