Latest models
Multiple models, each with different capabilities and price points. Prices can be viewed in units of either per 1M or 1K tokens. You can think of tokens as pieces of words, where 1,000 tokens is about 750 words.
Language models are also available in the Batch API(opens in a new window) that returns completions within 24 hours for a 50% discount.
Price per 1M tokens (fixed) | $2.50 |
512 x 512 tiles | 1 × 1 |
Total tiles | 1 |
Base tokens | 85 |
Tile tokens | 170 × 1 = 170 |
Total tokens | 255 |
Total price | $0.000638 |
*Batch API pricing requires requests to be submitted as a batch. Responses will be returned within 24 hours for a 50% discount. Learn more about Batch API ↗(opens in a new window)
**Cached prompts are offered at a 50% discount compared to uncached prompts. Learn more about Prompt Caching ↗(opens in a new window)
Price per 1M tokens (fixed) | $0.15 |
512 x 512 tiles | 1 × 1 |
Total tiles | 1 |
Base tokens | 2833 |
Tile tokens | 5667 × 1 = 5667 |
Total tokens | 8500 |
Total price | $0.001275 |
*Batch API pricing requires requests to be submitted as a batch. Responses will be returned within 24 hours for a 50% discount. Learn more about Batch API ↗(opens in a new window)
**Cached prompts are offered at a 50% discount compared to uncached prompts. Learn more about Prompt Caching ↗(opens in a new window)
*Output tokens include internal reasoning tokens generated by the model that are not visible in API responses.
*Output tokens include internal reasoning tokens generated by the model that are not visible in API responses.
*Batch API pricing requires requests to be submitted as a batch. Responses will be returned within 24 hours for a 50% discount. Learn more about Batch API ↗(opens in a new window)
*Batch API pricing requires requests to be submitted as a batch. Responses will be returned within 24 hours for a 50% discount. Learn more about Batch API ↗(opens in a new window)
**Fine-tuning for GPT-4o and GPT-4o mini is free up to a daily token limit through October 31, 2024. For GPT-4o, each qualifying org gets up to 1M complimentary training tokens daily and any overage will be charged at the normal rate of $25.00/1M tokens. For GPT-4o mini, each qualifying org gets up to 2M complimentary training tokens daily and any overage will be charged at the normal rate of $3.00/1M tokens.
*Audio input costs approximately 6¢ per minute; Audio output costs approximately 24¢ per minute
GB refers to binary gigabytes (also known as gibibyte), where 1 GB is 2^30 bytes.