Skip to main content

API Platform

Pricing

Simple and flexible. Only pay for what you use.

Latest models

Multiple models, each with different capabilities and price points. Prices can be viewed in units of either per 1M or 1K tokens. You can think of tokens as pieces of words, where 1,000 tokens is about 750 words.

Language models are also available in the Batch API(opens in a new window) that returns completions within 24 hours for a 50% discount.

GPT-4o

GPT-4o is our most advanced multimodal model that’s faster and cheaper than GPT-4 Turbo with stronger vision capabilities. The model has 128K context and an October 2023 knowledge cutoff.

Pricing
Pricing with Batch API*
$2.50 / 1M input tokens
$1.25 / 1M input tokens
$1.25 / 1M cached** input tokens
$10.00 / 1M output tokens
$5.00 / 1M output tokens
Pricing
Pricing with Batch API*
$2.50 / 1M input tokens
$1.25 / 1M input tokens
$1.25 / 1M cached** input tokens
$10.00 / 1M output tokens
$5.00 / 1M output tokens
Pricing
Pricing with Batch API*
$2.50 / 1M input tokens
$1.25 / 1M input tokens
$1.25 / 1M cached** input tokens
$10.00 / 1M output tokens
$5.00 / 1M output tokens
Pricing
Pricing with Batch API*
Text
$2.50 / 1M input tokens
$10.00 / 1M output tokens
Audio
$40.00 / 1M input tokens
$80.00 / 1M output tokens
Pricing
Pricing with Batch API*
Text
$2.50 / 1M input tokens
$10.00 / 1M output tokens
Audio
$40.00 / 1M input tokens
$80.00 / 1M output tokens
Pricing
Pricing with Batch API*
Text
$2.50 / 1M input tokens
$10.00 / 1M output tokens
Audio
$100.00 / 1M input tokens
$200.00 / 1M output tokens
Pricing
Pricing with Batch API*
$5.00 / 1M input tokens
$2.50 / 1M input tokens
$15.00 / 1M output tokens
$7.50 / 1M output tokens
Vision pricing calculator
by
Price per 1M tokens (fixed)$2.50
512 x 512 tiles1 x 1
Total tiles1
Base tokens85
Tile tokens170 x 1 =  170
Total tokens255
Total price$0.000638

*Batch API pricing requires requests to be submitted as a batch. Responses will be returned within 24 hours for a 50% discount. Learn more about Batch API ↗(opens in a new window)

**Cached prompts are offered at a 50% discount compared to uncached prompts. Learn more about Prompt Caching ↗(opens in a new window)

GPT-4o mini

GPT-4o mini is our most cost-efficient small model that’s smarter and cheaper than GPT-3.5 Turbo, and has vision capabilities.

Pricing
Pricing with Batch API*
$0.150 / 1M input tokens
$0.075 / 1M input tokens
$0.075 / 1M cached** input tokens
$0.600 / 1M output tokens
$0.300 / 1M output tokens
Pricing
Pricing with Batch API*
$0.150 / 1M input tokens
$0.075 / 1M input tokens
$0.075 / 1M cached** input tokens
$0.600 / 1M output tokens
$0.300 / 1M output tokens
Pricing
Pricing with Batch API*
Text
$0.150 / 1M input tokens
$0.600 / 1M output tokens
Audio
$10.000 / 1M input tokens
$20.000 / 1M output tokens
Pricing
Pricing with Batch API*
Text
$0.150 / 1M input tokens
$0.600 / 1M output tokens
Audio
$10.000 / 1M input tokens
$20.000 / 1M output tokens
Vision pricing calculator
by
Price per 1M tokens (fixed)$0.15
512 x 512 tiles1 x 1
Total tiles1
Base tokens2833
Tile tokens5667 x 1 =  5667
Total tokens8500
Total price$0.001275

*Batch API pricing requires requests to be submitted as a batch. Responses will be returned within 24 hours for a 50% discount. Learn more about Batch API ↗(opens in a new window)

*Cached prompts are offered at a 50% discount compared to uncached prompts. Learn more about Prompt Caching ↗(opens in a new window)

OpenAI o1

o1 is our frontier reasoning model that supports tools, Structured Outputs, and vision. The model has 200K context and an October 2023 knowledge cutoff.

Pricing
Pricing with Batch API***
$15.00 / 1M input tokens
$7.50 / 1M input tokens
$7.50 / 1M cached* input tokens
$60.00 / 1M output** tokens
$30.00 / 1M output** tokens
Pricing
Pricing with Batch API***
$15.00 / 1M input tokens
$7.50 / 1M input tokens
$7.50 / 1M cached* input tokens
$60.00 / 1M output** tokens
$30.00 / 1M output** tokens
Pricing
Pricing with Batch API***
$15.00 / 1M input tokens
$7.50 / 1M input tokens
$7.50 / 1M cached* input tokens
$60.00 / 1M output** tokens
$30.00 / 1M output** tokens
Pricing
Pricing with Batch API***
$15.00 / 1M input tokens
$7.50 / 1M input tokens
$7.50 / 1M cached* input tokens
$60.00 / 1M tokens
$30.00 / 1M output** tokens
Vision pricing calculator
by
Price per 1M tokens (fixed)$15.00
512 x 512 tiles1 x 1
Total tiles1
Base tokens75
Tile tokens150 x 1 =  150
Total tokens225
Total price$0.00337

*Cached prompts are offered at a 50% discount compared to uncached prompts. Learn more about Prompt Caching ↗(opens in a new window)

(opens in a new window)

**Output tokens include internal reasoning tokens generated by the model that are not visible in API responses.



***Batch API pricing requires requests to be submitted as a batch. Responses will be returned within 24 hours for a 50% discount. Learn more about Batch API ↗⁠(opens in a new window)

OpenAI o3-mini

o3-mini is our cost-efficient reasoning model that’s optimized for coding, math, and science, and supports tools and Structured Outputs.

Learn about o3-mini >(opens in a new window)

Pricing
Pricing with Batch API***
$1.10 / 1M input tokens
$0.55 / 1M input tokens
$0.55 / 1M cached* input tokens
$4.40 / 1M output** tokens
$2.20 / 1M output** tokens
Pricing
Pricing with Batch API***
$1.10 / 1M input tokens
$0.55 / 1M input tokens
$0.55 / 1M cached* input tokens
$4.40 / 1M output** tokens
$2.20 / 1M output** tokens

*Cached prompts are offered at a 50% discount compared to uncached prompts. Learn more about Prompt Caching ↗⁠(opens in a new window)

**Output tokens include internal reasoning tokens generated by the model that are not visible in API responses.

***Batch API pricing requires requests to be submitted as a batch. Responses will be returned within 24 hours for a 50% discount. Learn more about Batch API ↗⁠(opens in a new window)

Embedding models

Build advanced search, clustering, topic modeling, and classification functionality with our embeddings offering.

Pricing
Pricing with Batch API*
$0.020 / 1M tokens
$0.010 / 1M tokens
Pricing
Pricing with Batch API*
$0.130 / 1M tokens
$0.065 / 1M tokens
Pricing
Pricing with Batch API*
$0.100 / 1M tokens
$0.050 / 1M tokens

*Batch API pricing requires requests to be submitted as a batch. Responses will be returned within 24 hours for a 50% discount. Learn more about Batch API ↗(opens in a new window)

Fine-tuning models

Create your own custom models by fine-tuning our base models with your training data. Once you fine-tune a model, you’ll be billed only for the tokens you use in requests to that model.

Pricing
Pricing with Batch API*
$3.750 / 1M input tokens
$1.875 / 1M input tokens
$1.875 / 1M cached** input tokens
$15.000 / 1M output tokens
$7.500 / 1M output tokens
$25.000 / 1M training tokens
Pricing
Pricing with Batch API*
$0.300 / 1M input tokens
$0.150 / 1M input tokens
$0.150 / 1M cached** input tokens
$1.200 / 1M output tokens
$0.600 / 1M output tokens
$3.000 / 1M training tokens
Pricing
Pricing with Batch API*
$3.000 / 1M input tokens
$1.500 / 1M input tokens
$6.000 / 1M output tokens
$3.000 / 1M output tokens
$8.000 / 1M training tokens
Pricing
Pricing with Batch API*
$12.000 / 1M input tokens
$6.000 / 1M input tokens
$12.000 / 1M output tokens
$6.000 / 1M output tokens
$6.000 / 1M training tokens
Pricing
Pricing with Batch API*
$1.600 / 1M input tokens
$0.800 / 1M input tokens
$1.600 / 1M output tokens
$0.800 / 1M output tokens
$0.400 / 1M training tokens
Vision pricing calculator
by
Price per 1M tokens (fixed)$25.00
512 x 512 tiles1 x 1
Total tiles1
Base tokens85
Tile tokens170 x 1 =  170
Total tokens255
Total price$0.006375

*Batch API pricing requires requests to be submitted as a batch. Responses will be returned within 24 hours for a 50% discount. Learn more about Batch API ↗(opens in a new window)


**Cached prompts are offered at a 50% discount compared to uncached prompts. Learn more about Prompt Caching ↗(opens in a new window)

Realtime API

The Realtime API lets developers build low-latency, multimodal experiences, including speech-to-speech capabilities. Text and audio processed by the Realtime API are priced separately.

Pricing
Text
$5.00 / 1M input tokens
$2.50 / 1M cached* input tokens
$20.00 / 1M output tokens
Audio
$40.00 / 1M input tokens
$2.50 / 1M cached* input tokens
$80.00 / 1M output tokens
Pricing
Text
$5.00 / 1M input tokens
$2.50 / 1M cached* input tokens
$20.00 / 1M output tokens
Audio
$40.00 / 1M input tokens
$2.50 / 1M cached* input tokens
$80.00 / 1M output tokens
Pricing
Text
$5.00 / 1M input tokens
$2.50 / 1M cached* input tokens
$20.00 / 1M output tokens
Audio*
$100.00 / 1M input tokens
$20.00 / 1M cached* input tokens
$200.00 / 1M output tokens
Pricing
Text
$0.60 / 1M input tokens
$0.30 / 1M cached* input tokens
$2.40 / 1M output tokens
Audio
$10.00 / 1M input tokens
$0.30 / 1M cached* input tokens
$20.00 / 1M output tokens
Pricing
Text
$0.60 / 1M input tokens
$0.30 / 1M cached* input tokens
$2.40 / 1M output tokens
Audio
$10.00 / 1M input tokens
$0.30 / 1M cached* input tokens
$20.00 / 1M output tokens

*Cached prompts are offered at a discount compared to uncached prompts. Learn more about Prompt Caching ↗(opens in a new window)

Assistants API

The Assistants API and its tools make it easy for developers to build AI assistants in their applications. The tokens used for the Assistant API are billed at the chosen language model's per-token input / output rates.

Additionally, we charge the following fees for tool usage:

Input
$0.03 / session
Input
$0.10 / GB of vector-storage per day (1 GB free)

GB refers to binary gigabytes (also known as gibibyte), where 1 GB is 2^30 bytes.

Image models

Build DALL·E directly into your apps to generate and edit novel images and art. DALL·E 3 is the highest quality model and DALL·E 2 is optimized for lower cost.

Quality
Resolution
Price
Standard
1024×1024
$0.040 / image
Standard
1024×1792, 1792×1024
$0.080 / image
Quality
Resolution
Price
HD
1024×1024
$0.080 / image
HD
1024×1792, 1792×1024
$0.120 / image
Quality
Resolution
Price
1024×1024
$0.020 / image
512×512
$0.018 / image
256×256
$0.016 / image
Audio models

Whisper can transcribe speech into text and translate many languages into English.



Text-to-speech (TTS) can convert text into spoken audio.

Usage
$0.006 / minute (rounded to the nearest second)
Usage
$15.000 / 1M characters
Usage
$30.000 / 1M characters
Other models

While we continuously improve our latest models, here is a list of other models that we support.

Input
Output
$1.10 / 1M tokens
$4.40 / 1M tokens
$0.55 / 1M cached* tokens
Input
Output
$5.00 / 1M tokens
$15.00 / 1M tokens
Input
Output
$10.00 / 1M tokens
$30.00 / 1M tokens
Input
Output
$10.00 / 1M tokens
$30.00 / 1M tokens
Input
Output
$30.00 / 1M tokens
$60.00 / 1M tokens
Input
Output
$60.00 / 1M tokens
$120.00 / 1M tokens
Input
Output
$10.00 / 1M tokens
$30.00 / 1M tokens
Input
Output
$10.00 / 1M tokens
$30.00 / 1M tokens
Input
Output
$10.00 / 1M tokens
$30.00 / 1M tokens
Input
Output
$0.50 / 1M tokens
$1.50 / 1M tokens
Input
Output
$1.50 / 1M tokens
$2.00 / 1M tokens
Input
Output
$1.00 / 1M tokens
$2.00 / 1M tokens
Input
Output
$1.50 / 1M tokens
$2.00 / 1M tokens
Input
Output
$3.00 / 1M tokens
$4.00 / 1M tokens
Input
Output
$1.50 / 1M tokens
$2.00 / 1M tokens
Input
Output
$2.00 / 1M tokens
$2.00 / 1M tokens
Input
Output
$0.40 / 1M tokens
$0.40 / 1M tokens

FAQ

You can think of tokens as pieces of words used for natural language processing. For English text, 1 token is approximately 4 characters or 0.75 words. As a point of reference, the collected works of Shakespeare are about 900,000 words or 1.2M tokens.

To learn more about how tokens work and estimate your usage…

  • Experiment with our interactive Tokenizer tool(opens in a new window).

  • Log in to your account and enter text into the Playground. The counter in the footer will display how many tokens are in your text.

Start creating with OpenAI’s powerful models.