GPT-4o Long Output
OpenAI is offering an experimental version of GPT‑4o with a maximum of 64K output tokens per request. We hope this experiment helps you explore new use cases that are unlocked by longer completions.Â
Alpha participants can access GPT‑4o long output by using the gpt-4o-64k-output-alpha
model name.
Pricing
Long completions are more costly from an inference perspective, so the per-token pricing of this model is increased to match the costs.
Input usage
Output usage
$6.00 / 1M tokens
$18.00 / 1M tokens