Skip to main content
Pricing

Pricing

Simple and flexible. Only pay for what you use.

Language models

Multiple models, each with different capabilities and price points. Prices can be viewed in units of either per 1M or 1K tokens. You can think of tokens as pieces of words, where 1,000 tokens is about 750 words. This paragraph is 35 tokens.

Language models are also available in the Batch API that returns completions within 24 hours for a 50% discount.

GPT-4o

GPT-4o is our most advanced multimodal model that’s faster and cheaper than GPT-4 Turbo with stronger vision capabilities. The model has 128K context and an October 2023 knowledge cutoff.

Model
Input
Output
gpt-4o
$5.00 /
1M tokens
$15.00 /
1M tokens
gpt-4o-2024-05-13
$5.00 /
1M tokens
$15.00 /
1M tokens
Vision pricing calculator
by
Price per 1M tokens (fixed)$5.00
512 × 512 tiles1 × 1
Total tiles1
Base tokens85
Tile tokens170 × 1 = 170
Total tokens255
Total price$0.001275
GPT-4 Turbo

GPT-4 Turbo is offered at 128K context with an April

2023 knowledge cutoff and basic support for vision.


Model
Input
Output
gpt-4-turbo
$10.00 /
1M tokens
$30.00 /
1M tokens
gpt-4-turbo-2024-04-09
$10.00 /
1M tokens
$30.00 /
1M tokens
Vision pricing calculator
by
Price per 1M tokens (fixed)$10.00
512 × 512 tiles1 × 1
Total tiles1
Base tokens85
Tile tokens170 × 1 = 170
Total tokens255
Total price$0.00255
GPT-4

Our original GPT-4 model was built with broad general knowledge and domain expertise.

Model
Input
Output
gpt-4
$30.00 /
1M tokens
$60.00 /
1M tokens
gpt-4-32k
$60.00 /
1M tokens
$120.00 /
1M tokens
GPT-3.5 Turbo

GPT-3.5 Turbo is our fast and inexpensive model for simpler tasks.

gpt-3.5-turbo-0125 is the flagship model of this family, supports a 16K context window and is optimized for dialog.

gpt-3.5-turbo-instruct is an Instruct model and only supports a 4K context window.

Model
Input
Output
gpt-3.5-turbo-0125
$0.50 /
1M tokens
$1.50 /
1M tokens
gpt-3.5-turbo-instruct
$1.50 /
1M tokens
$2.00 /
1M tokens
Assistants API

The Assistants API and its tools make it easy for developers to build AI assistants in their applications. The tokens used for the Assistant API are billed at the chosen language model's per-token input / output rates. Additionally, we charge the following fees for tool usage:

Tool
Input
Code interpreter
$0.03 / session
File Search
$0.10 / GB of vector-storage per day (1 GB free)

GB refers to binary gigabytes (also known as gibibyte), where 1 GB is 2^30 bytes.

Fine-tuning models

Create your own custom models by fine-tuning our base models with your training data. Once you fine-tune a model, you’ll be billed only for the tokens you use in requests to that model.

Model
Training
Input Usage
Output usage
gpt-3.5-turbo
$8.00 /
1M tokens
$3.00 /
1M tokens
$6.00 /
1M tokens
davinci-002
$6.00 /
1M tokens
$12.00 /
1M tokens
$12.00 /
1M tokens
babbage-002
$0.40 /
1M tokens
$1.60 /
1M tokens
$1.60 /
1M tokens
Embedding models

Build advanced search, clustering, topic modeling, and classification functionality with our embeddings offering.

Model
Usage
text-embedding-3-small
$0.02 /
1M tokens
text-embedding-3-large
$0.13 /
1M tokens
ada v2
$0.10 /
1M tokens
Base models

GPT base models are not optimized for instruction-following and are less capable, but they can be effective when fine-tuned for narrow tasks.

Model
Usage
davinci-002
$2.00 /
1M tokens
babbage-002
$0.40 /
1M tokens

Other models

Image models

Build DALL·E directly into your apps to generate and edit novel images and art. DALL·E 3 is the highest quality model and DALL·E 2 is optimized for lower cost.

Model
Quality
Resolution
Price
DALL·E 3
Standard
1024×1024
$0.040 / image
Standard
1024×1792, 1792×1024
$0.080 / image
DALL·E 3
HD
1024×1024
$0.080 / image
HD
1024×1792, 1792×1024
$0.120 / image
DALL·E 2
1024×1024
$0.020 / image
512×512
$0.018 / image
256×256
$0.016 / image
Audio models

Whisper(opens in a new window) can transcribe speech into text and translate many languages into English.



Text-to-speech (TTS)(opens in a new window) can convert text into spoken audio.

Model
Usage
Whisper
$0.006 /
minute (rounded to the nearest second)
TTS
$15.00 /
1M characters
TTS HD
$30.00 /
1M characters
Older models

We continue to improve our models and periodically retire older, less used models.

Model
Input
Output
gpt-4-0125-preview
$10.00 /
1M tokens
$30.00 /
1M tokens
gpt-4-1106-preview
$10.00 /
1M tokens
$30.00 /
1M tokens
gpt-4-vision-preview
$10.00 /
1M tokens
$30.00 /
1M tokens
gpt-3.5-turbo-1106
$1.00 /
1M tokens
$2.00 /
1M tokens
gpt-3.5-turbo-0613
$1.50 /
1M tokens
$2.00 /
1M tokens
gpt-3.5-turbo-16k-0613
$3.00 /
1M tokens
$4.00 /
1M tokens
gpt-3.5-turbo-0301
$1.50 /
1M tokens
$2.00 /
1M tokens

Simple and flexible

Pay as you go

To keep things simple and flexible, pay only for the resources you use.

Choose your model

Use the right model for the job. We offer a spectrum of capabilities and price points.

FAQ

You can think of tokens as pieces of words used for natural language processing. For English text, 1 token is approximately 4 characters or 0.75 words. As a point of reference, the collected works of Shakespeare are about 900,000 words or 1.2M tokens.

To learn more about how tokens work and estimate your usage…

  • Experiment with our interactive Tokenizer tool(opens in a new window).

  • Log in to your account and enter text into the Playground. The counter in the footer will display how many tokens are in your text.

Start creating with OpenAI’s powerful models.