Question 1

How much does the OpenAI API cost?

Accepted Answer

OpenAI API pricing varies by model. GPT-5 costs $1.25 per million input tokens and $10.00 per million output tokens. The budget option GPT-5-mini costs $0.25 per million input and $2.00 per million output. GPT-4o costs $2.50/$10.00, and GPT-4o-mini is the cheapest at $0.15/$0.60. Batch API processing offers a 50% discount, and prompt caching reduces input costs by 50% for repeated prefixes.

Question 2

Is the OpenAI API free?

Accepted Answer

The OpenAI API is not free for production use, but new accounts receive $5 in free credits that expire after 3 months. After the credits are used, you pay per token based on the model you use. The cheapest model, GPT-4o-mini, costs as little as $0.15 per million input tokens, making small-scale usage very affordable. There are no monthly minimums or subscription fees - you only pay for what you use.

Question 3

How do OpenAI API tokens work?

Accepted Answer

Tokens are the units OpenAI uses to measure text. Roughly, 1 token equals about 4 characters or 0.75 words in English. A typical sentence is 15-20 tokens. You pay separately for input tokens (your prompt) and output tokens (the model's response). Output tokens are typically 2-8x more expensive than input tokens because they require more computation. You can use OpenAI's tokenizer tool to check exact token counts.

Question 4

What is the cheapest OpenAI API model?

Accepted Answer

GPT-4o-mini is the cheapest OpenAI model at $0.15 per million input tokens and $0.60 per million output tokens. For newer models, GPT-5-mini at $0.25 per million input and $2.00 per million output offers significantly better capability at a modest price increase. Using the Batch API halves these costs further if you can tolerate 24-hour processing times.

Question 5

What is the OpenAI Batch API?

Accepted Answer

The Batch API allows you to submit large numbers of API requests for asynchronous processing within a 24-hour window. In exchange for the longer processing time, you receive a 50% discount on all token costs. This is ideal for workloads like content generation, data processing, email summarisation, and classification tasks where real-time responses are not needed. Batch requests use the same models and produce identical results to real-time requests.

Model	Input / 1M	Output / 1M	Cached Input	Batch (In/Out)	Notes
GPT-5	$1.25	$10.00	$0.625	$0.625 / $5.00	Flagship model
GPT-5-mini	$0.25	$2.00	$0.125	$0.125 / $1.00	Budget GPT-5
GPT-5.2 Instant	$0.50	$3.00	$0.25	$0.25 / $1.50	Fast variant (Go plan model)

Model	Input / 1M	Output / 1M	Cached Input	Batch (In/Out)	Notes
O3	$2.00	$8.00	$1.00	$1.00 / $4.00	Advanced reasoning
O3-mini	$0.50	$2.00	$0.25	$0.25 / $1.00	Budget reasoning
O3-Pro	$150.00	$150.00	N/A	N/A	Maximum capability

Model	Input / 1M	Output / 1M	Cached Input	Batch (In/Out)	Notes
GPT-4o	$2.50	$10.00	$1.25	$1.25 / $5.00	Previous flagship
GPT-4o-mini	$0.15	$0.60	$0.075	$0.075 / $0.30	Cheapest model

Model	Input / 1M	Output / 1M	Cached Input	Batch (In/Out)	Notes
text-embedding-3-large	$0.13	-	-	$0.065	Best quality embeddings
text-embedding-3-small	$0.02	-	-	$0.01	Budget embeddings

Tier	Qualification	RPM (GPT-5)	TPM (GPT-5)
Free	$0 spent	3	40,000
Tier 1	$5+ spent	500	200,000
Tier 2	$50+ spent, 7+ days	5,000	2,000,000
Tier 3	$100+ spent, 7+ days	5,000	10,000,000
Tier 4	$250+ spent, 14+ days	10,000	50,000,000
Tier 5	$1,000+ spent, 30+ days	10,000	150,000,000

OpenAI API Pricing: Every Model, Every Cost

GPT-5 Family

Reasoning Models

GPT-4o Family (Legacy)

Embedding Models

How Tokens Work

What Is a Token?

Input vs Output Pricing

Cost Reduction Features

Batch API (50% Discount)

Prompt Caching (50% Input Discount)

Rate Limit Tiers

Real-World Cost Examples

Customer Support Chatbot

Content Generation Pipeline

RAG Application

Code Review Tool

When to Use the API vs a Subscription

Choose the API When

Choose a Subscription When