OpenAI API Pricing

openai.com ↗
📊The OpenAI API pricing page has been updated for clarity and detail.
Main ContentOpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. GPT-5.4 Our most capable model for professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.750 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.500 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks. Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Data residency and Regional Processing ⁠ (opens in a new window) ⁠ endpoints are charged an additional 10% for all models released after 3/5/26. Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 for inputs $0.40 for cached inputs $64.00 for outputs Text: $4.00 for inputs $0.40 for cached inputs $16.00 for outputs Image: $5.00 for inputs $0.50 for cached inputs GPT-image-1.5 State-of-the-art image generation model. Price Image: $8.00 for inputs $2.00 for cached inputs $32.00 for outputs Text: $5.00 for inputs $1.25 for cached inputs $10.00 for outputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact sales

Alert me when:

0 watching

Export & Integrate

Add .rss or .json to any alert URL

Recent Changes

Main Content
OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. GPT-5.4 Our most capable model for professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.750 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.500 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks. Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Data residency and Regional Processing ⁠ (opens in a new window) ⁠ endpoints are charged an additional 10% for all models released after 3/5/26. Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, audio, and video with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 for inputs $0.40 for cached inputs $64.00 for outputs Text: $4.00 for inputs $0.40 for cached inputs $16.00 for outputs Image: $5.00 for inputs $0.50 for cached inputs GPT-image-1.5 State-of-the-art image generation model. Price Image: $8.00 for inputs $2.00 for cached inputs $32.00 for outputs Text: $5.00 for inputs $1.25 for cached inputs $10.00 for outputs Sora-2 Our latest video generation model. Price Price per second: $0.10 For media with dimensions: Size: 720p Portrait: 720 x 1280 Landscape: 1280 x 720 Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact salesOpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. GPT-5.4 Our most capable model for professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.750 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.500 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks. Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Data residency and Regional Processing ⁠ (opens in a new window) ⁠ endpoints are charged an additional 10% for all models released after 3/5/26. Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 for inputs $0.40 for cached inputs $64.00 for outputs Text: $4.00 for inputs $0.40 for cached inputs $16.00 for outputs Image: $5.00 for inputs $0.50 for cached inputs GPT-image-1.5 State-of-the-art image generation model. Price Image: $8.00 for inputs $2.00 for cached inputs $32.00 for outputs Text: $5.00 for inputs $1.25 for cached inputs $10.00 for outputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact sales
18h ago
Main Content
OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. GPT-5.4 Our most capable model for professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.750 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.500 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks. Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Data residency and Regional Processing ⁠ (opens in a new window) ⁠ endpoints are charged an additional 10% for all models released after 3/5/26. Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, audio, and video with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 for inputs $0.40 for cached inputs $64.00 for outputs Text: $4.00 for inputs $0.40 for cached inputs $16.00 for outputs Image: $5.00 for inputs $0.50 for cached inputs GPT-image-1.5 State-of-the-art image generation model. Price Image: $8.00 for inputs $2.00 for cached inputs $32.00 for outputs Text: $5.00 for inputs $1.25 for cached inputs $10.00 for outputs Sora-2 Our latest video generation model. Price Price per second: $0.10 For media with dimensions: Size: 720p Portrait: 720 x 1280 Landscape: 1280 x 720 Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $25.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact salesOpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. GPT-5.4 Our most capable model for professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.750 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.500 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks. Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Data residency and Regional Processing ⁠ (opens in a new window) ⁠ endpoints are charged an additional 10% for all models released after 3/5/26. Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, audio, and video with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 for inputs $0.40 for cached inputs $64.00 for outputs Text: $4.00 for inputs $0.40 for cached inputs $16.00 for outputs Image: $5.00 for inputs $0.50 for cached inputs GPT-image-1.5 State-of-the-art image generation model. Price Image: $8.00 for inputs $2.00 for cached inputs $32.00 for outputs Text: $5.00 for inputs $1.25 for cached inputs $10.00 for outputs Sora-2 Our latest video generation model. Price Price per second: $0.10 For media with dimensions: Size: 720p Portrait: 720 x 1280 Landscape: 1280 x 720 Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact sales
2d ago
Main Content
OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. GPT-5.4 Our most capable model for professional work Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents Price Input: $0.750 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.500 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Explore detailed pricing (opens in a new window) Pricing above reflects standard processing rates for context lengths under 270K. See the full pricing page here ⁠ (opens in a new window) . Data residency and Regional Processing ⁠ (opens in a new window) endpoints are charged an additional 10% for all GPT 5.4 models. To optimize cost and performance for different use cases, we also offer: Batch API⁠ ⁠ (opens in a new window) : Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Priority processing ⁠ ⁠ : offers reliable, high-speed performance with the flexibility to pay-as-you-go. Our APIs Realtime API Build low-latency, multimodal experiences including speech-to-speech. Text Text gpt-realtime-1.5 $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime-1.5 $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime-mini $0.60 / 1M input tokens $0.06 / 1M cached input tokens $2.40 / 1M output tokens gpt-realtime-mini $0.60 / 1M input tokens $0.06 / 1M cached input tokens $2.40 / 1M output tokens Audio Audio gpt-realtime-1.5 $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime-1.5 $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime-mini $10.00 / 1M input tokens $0.30 / 1M cached input tokens $20.00 / 1M output tokens gpt-realtime-mini $10.00 / 1M input tokens $0.30 / 1M cached input tokens $20.00 / 1M output tokens Image Image gpt-realtime-1.5 $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime-1.5 $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime-mini $0.80 / 1M input tokens $0.08 / 1M cached input tokens - gpt-realtime-mini $0.80 / 1M input tokens $0.08 / 1M cached input tokens - Sora Video API Richly detailed, dynamic video generation and remixing with our latest generative model. Models Size Price per second sora-2 Portrait: 720 x 1280 Landscape: 1280 x 720 $0.10 sora-2-pro Portrait: 720 x 1280 Landscape: 1280 x 720 $0.30 sora-2-pro Portrait: 1024 x 1792 Landscape: 1792 x 1024 $0.50 sora-2-pro Portrait: 1080 x 1920 Landscape: 1920 x 1080 $0.70 Image Generation API Precise, high-fidelity image generation and editing with our latest multimodal model. Text Text GPT-image-1.5 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * $10.00 / 1M output tokens * GPT-image-1.5 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * $10.00 / 1M output tokens * GPT-image-1 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * - GPT-image-1 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * - GPT-image-1-mini $2.00 / 1M input tokens $0.20 / 1M cached input tokens * - GPT-image-1-mini $2.00 / 1M input tokens $0.20 / 1M cached input tokens * - Image Image GPT-image-1.5 $8.00 / 1M input tokens $2.00 / 1M cached input tokens * $32.00 / 1M output tokens GPT-image-1.5 $8.00 / 1M input tokens $2.00 / 1M cached input tokens * $32.00 / 1M output tokens GPT-image-1 $10.00 / 1M input tokens $2.50 / 1M cached input tokens * $40.00 / 1M output tokens GPT-image-1 $10.00 / 1M input tokens $2.50 / 1M cached input tokens * $40.00 / 1M output tokens GPT-image-1-mini $2.50 / 1M input tokens $0.25 / 1M cached input tokens * $8.00 / 1M output tokens GPT-image-1-mini $2.50 / 1M input tokens $0.25 / 1M cached input tokens * $8.00 / 1M output tokens Prompts are billed similarly to other GPT models. Image outputs cost approximately $0.01 (low), $0.04 (medium), and $0.17 (high) for square images. *available via the Responses API *text output tokens include model reasoning tokens For detailed token usage by image quality and size, see the docs . Built-in tools Extend model capabilities with built-in tools in the API Platform. Tool Cos...OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. GPT-5.4 Our most capable model for professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.750 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.500 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks. Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Data residency and Regional Processing ⁠ (opens in a new window) ⁠ endpoints are charged an additional 10% for all models released after 3/5/26. Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, audio, and video with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 for inputs $0.40 for cached inputs $64.00 for outputs Text: $4.00 for inputs $0.40 for cached inputs $16.00 for outputs Image: $5.00 for inputs $0.50 for cached inputs GPT-image-1.5 State-of-the-art image generation model. Price Image: $8.00 for inputs $2.00 for cached inputs $32.00 for outputs Text: $5.00 for inputs $1.25 for cached inputs $10.00 for outputs Sora-2 Our latest video generation model. Price Price per second: $0.10 For media with dimensions: Size: 720p Portrait: 720 x 1280 Landscape: 1280 x 720 Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $25.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact sales
7d ago
Main Content
OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. GPT-5.4 Our most capable model for professional work Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents Price Input: $0.750 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.500 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. See the full pricing page here ⁠ (opens in a new window) . Data residency and Regional Processing ⁠ (opens in a new window) endpoints are charged an additional 10% for all GPT 5.4 models. To optimize cost and performance for different use cases, we also offer: Batch API⁠ ⁠ (opens in a new window) : Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Priority processing ⁠ ⁠ : offers reliable, high-speed performance with the flexibility to pay-as-you-go. Fine-tuning our models Customize our models to get even higher performance for your specific use cases. GPT-4.1 Fine-tuning price Input: $3.00 / 1M tokens Cached input: $0.75 / 1M tokens Output: $12.00 / 1M tokens Training: $25.00 / 1M tokens GPT-4.1 mini Fine-tuning price Input: $0.80 / 1M tokens Cached input: $0.20 / 1M tokens Output: $3.20 / 1M tokens Training: $5.00 / 1M tokens GPT-4.1 nano Fine-tuning price Input: $0.20 / 1M tokens Cached input: $0.05 / 1M tokens Output: $0.80 / 1M tokens Training: $1.50 / 1M tokens o4-mini Reinforcement fine-tuning price Input: $4.00 / 1M tokens Cached input: $1.00 / 1M tokens Output: $16.00 / 1M tokens Training: $100.00 / training hour Explore detailed pricing (opens in a new window) Our APIs Realtime API Build low-latency, multimodal experiences including speech-to-speech. Text Text gpt-realtime-1.5 $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime-1.5 $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime-mini $0.60 / 1M input tokens $0.06 / 1M cached input tokens $2.40 / 1M output tokens gpt-realtime-mini $0.60 / 1M input tokens $0.06 / 1M cached input tokens $2.40 / 1M output tokens Audio Audio gpt-realtime-1.5 $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime-1.5 $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime-mini $10.00 / 1M input tokens $0.30 / 1M cached input tokens $20.00 / 1M output tokens gpt-realtime-mini $10.00 / 1M input tokens $0.30 / 1M cached input tokens $20.00 / 1M output tokens Image Image gpt-realtime-1.5 $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime-1.5 $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime-mini $0.80 / 1M input tokens $0.08 / 1M cached input tokens - gpt-realtime-mini $0.80 / 1M input tokens $0.08 / 1M cached input tokens - Sora Video API Richly detailed, dynamic video generation and remixing with our latest generative model. Models Size Price per second sora-2 Portrait: 720 x 1280 Landscape: 1280 x 720 $0.10 sora-2-pro Portrait: 720 x 1280 Landscape: 1280 x 720 $0.30 sora-2-pro Portrait: 1024 x 1792 Landscape: 1792 x 1024 $0.50 sora-2-pro Portrait: 1080 x 1920 Landscape: 1920 x 1080 $0.70 Image Generation API Precise, high-fidelity image generation and editing with our latest multimodal model. Text Text GPT-image-1.5 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * $10.00 / 1M output tokens * GPT-image-1.5 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * $10.00 / 1M output tokens * GPT-image-1 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * - GPT-image-1 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * - GPT-image-1-mini $2.00 / 1M input tokens $0.20 / 1M cached input tokens * - GPT-image-1-mini $2.00 / 1M input tokens $0.20 / 1M cached input tokens * - Image Image GPT-image-1.5 $8.00 / 1M input tokens $2.00 / 1M cached input tokens * $32.00 / 1M output tokens GPT-image-1.5 $8.00 / 1M input tokens $2.00 / 1M cached input tokens * $32.00 / 1M output tokens GPT-image-1 $10.00 / 1M input tokens $2.50 / 1M cached input tokens * $40.00 / 1M output token...OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. GPT-5.4 Our most capable model for professional work Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents Price Input: $0.750 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.500 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Explore detailed pricing (opens in a new window) Pricing above reflects standard processing rates for context lengths under 270K. See the full pricing page here ⁠ (opens in a new window) . Data residency and Regional Processing ⁠ (opens in a new window) endpoints are charged an additional 10% for all GPT 5.4 models. To optimize cost and performance for different use cases, we also offer: Batch API⁠ ⁠ (opens in a new window) : Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Priority processing ⁠ ⁠ : offers reliable, high-speed performance with the flexibility to pay-as-you-go. Our APIs Realtime API Build low-latency, multimodal experiences including speech-to-speech. Text Text gpt-realtime-1.5 $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime-1.5 $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime-mini $0.60 / 1M input tokens $0.06 / 1M cached input tokens $2.40 / 1M output tokens gpt-realtime-mini $0.60 / 1M input tokens $0.06 / 1M cached input tokens $2.40 / 1M output tokens Audio Audio gpt-realtime-1.5 $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime-1.5 $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime-mini $10.00 / 1M input tokens $0.30 / 1M cached input tokens $20.00 / 1M output tokens gpt-realtime-mini $10.00 / 1M input tokens $0.30 / 1M cached input tokens $20.00 / 1M output tokens Image Image gpt-realtime-1.5 $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime-1.5 $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime-mini $0.80 / 1M input tokens $0.08 / 1M cached input tokens - gpt-realtime-mini $0.80 / 1M input tokens $0.08 / 1M cached input tokens - Sora Video API Richly detailed, dynamic video generation and remixing with our latest generative model. Models Size Price per second sora-2 Portrait: 720 x 1280 Landscape: 1280 x 720 $0.10 sora-2-pro Portrait: 720 x 1280 Landscape: 1280 x 720 $0.30 sora-2-pro Portrait: 1024 x 1792 Landscape: 1792 x 1024 $0.50 sora-2-pro Portrait: 1080 x 1920 Landscape: 1920 x 1080 $0.70 Image Generation API Precise, high-fidelity image generation and editing with our latest multimodal model. Text Text GPT-image-1.5 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * $10.00 / 1M output tokens * GPT-image-1.5 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * $10.00 / 1M output tokens * GPT-image-1 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * - GPT-image-1 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * - GPT-image-1-mini $2.00 / 1M input tokens $0.20 / 1M cached input tokens * - GPT-image-1-mini $2.00 / 1M input tokens $0.20 / 1M cached input tokens * - Image Image GPT-image-1.5 $8.00 / 1M input tokens $2.00 / 1M cached input tokens * $32.00 / 1M output tokens GPT-image-1.5 $8.00 / 1M input tokens $2.00 / 1M cached input tokens * $32.00 / 1M output tokens GPT-image-1 $10.00 / 1M input tokens $2.50 / 1M cached input tokens * $40.00 / 1M output tokens GPT-image-1 $10.00 / 1M input tokens $2.50 / 1M cached input tokens * $40.00 / 1M output tokens GPT-image-1-mini $2.50 / 1M input tokens $0.25 / 1M cached input tokens * $8.00 / 1M output tokens GPT-image-1-mini $2.50 / 1M input tokens $0.25 / 1M cached input tokens * $8.00 / 1M output tokens Prompts are billed similarly to other GPT models. Image outputs cost approximately $0.01 (low), $0.04 (medium), and $0.17 (high) for square images. *available via the Responses API *text output tokens include model reasoning tokens For detailed token usage by image quality and size, see the docs . Built-in tools Extend model capabilities with built-in tools in the API Platform. Tool Cos...
9d ago
Main Content
OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. GPT-5.4 Our most capable model for professional work Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents Price Input: $0.750 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.500 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. See the full pricing page here ⁠ (opens in a new window) . Data residency and Regional Processing ⁠ (opens in a new window) endpoints are charged an additional 10% for all GPT 5.4 models. To optimize cost and performance for different use cases, we also offer: Batch API⁠ ⁠ (opens in a new window) : Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Priority processing ⁠ ⁠ : offers reliable, high-speed performance with the flexibility to pay-as-you-go. Fine-tuning our models Customize our models to get even higher performance for your specific use cases. GPT-4.1 Fine-tuning price Input: $3.00 / 1M tokens Cached input: $0.75 / 1M tokens Output: $12.00 / 1M tokens Training: $25.00 / 1M tokens GPT-4.1 mini Fine-tuning price Input: $0.80 / 1M tokens Cached input: $0.20 / 1M tokens Output: $3.20 / 1M tokens Training: $5.00 / 1M tokens GPT-4.1 nano Fine-tuning price Input: $0.20 / 1M tokens Cached input: $0.05 / 1M tokens Output: $0.80 / 1M tokens Training: $1.50 / 1M tokens o4-mini Reinforcement fine-tuning price Input: $4.00 / 1M tokens Cached input: $1.00 / 1M tokens Output: $16.00 / 1M tokens Training: $100.00 / training hour Explore detailed pricing (opens in a new window) Our APIs Realtime API Build low-latency, multimodal experiences including speech-to-speech. Text Text gpt-realtime-1.5 $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime-1.5 $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime-mini $0.60 / 1M input tokens $0.06 / 1M cached input tokens $2.40 / 1M output tokens gpt-realtime-mini $0.60 / 1M input tokens $0.06 / 1M cached input tokens $2.40 / 1M output tokens Audio Audio gpt-realtime-1.5 $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime-1.5 $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime-mini $10.00 / 1M input tokens $0.30 / 1M cached input tokens $20.00 / 1M output tokens gpt-realtime-mini $10.00 / 1M input tokens $0.30 / 1M cached input tokens $20.00 / 1M output tokens Image Image gpt-realtime-1.5 $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime-1.5 $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime-mini $0.80 / 1M input tokens $0.08 / 1M cached input tokens - gpt-realtime-mini $0.80 / 1M input tokens $0.08 / 1M cached input tokens - Sora Video API Richly detailed, dynamic video generation and remixing with our latest generative model. Models Size Price per second sora-2 Portrait: 720 x 1280 Landscape: 1280 x 720 $0.10 sora-2-pro Portrait: 720 x 1280 Landscape: 1280 x 720 $0.30 sora-2-pro Portrait: 1024 x 1792 Landscape: 1792 x 1024 $0.50 sora-2-pro Portrait: 1080 x 1920 Landscape: 1920 x 1080 $0.70 Image Generation API Precise, high-fidelity image generation and editing with our latest multimodal model. Text Text GPT-image-1.5 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * $10.00 / 1M output tokens * GPT-image-1.5 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * $10.00 / 1M output tokens * GPT-image-1 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * - GPT-image-1 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * - GPT-image-1-mini $2.00 / 1M input tokens $0.20 / 1M cached input tokens * - GPT-image-1-mini $2.00 / 1M input tokens $0.20 / 1M cached input tokens * - Image Image GPT-image-1.5 $8.00 / 1M input tokens $2.00 / 1M cached input tokens * $32.00 / 1M output tokens GPT-image-1.5 $8.00 / 1M input tokens $2.00 / 1M cached input tokens * $32.00 / 1M output tokens GPT-image-1 $10.00 / 1M input tokens $2.50 / 1M cached input tokens * $40.00 / 1M output tokens GPT-image-1 $10.00 / 1M input tokens $2.50 / 1M cached input tokens * $40.00 / 1M output tokens GPT-image-1-mini $2.50 / 1M input tokens $0.25 / 1M cached input t...OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. GPT-5.4 Our most capable model for professional work Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents Price Input: $0.750 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.500 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. See the full pricing page here ⁠ (opens in a new window) . Data residency and Regional Processing ⁠ (opens in a new window) endpoints are charged an additional 10% for all GPT 5.4 models. To optimize cost and performance for different use cases, we also offer: Batch API⁠ ⁠ (opens in a new window) : Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Priority processing ⁠ ⁠ : offers reliable, high-speed performance with the flexibility to pay-as-you-go. Fine-tuning our models Customize our models to get even higher performance for your specific use cases. GPT-4.1 Fine-tuning price Input: $3.00 / 1M tokens Cached input: $0.75 / 1M tokens Output: $12.00 / 1M tokens Training: $25.00 / 1M tokens GPT-4.1 mini Fine-tuning price Input: $0.80 / 1M tokens Cached input: $0.20 / 1M tokens Output: $3.20 / 1M tokens Training: $5.00 / 1M tokens GPT-4.1 nano Fine-tuning price Input: $0.20 / 1M tokens Cached input: $0.05 / 1M tokens Output: $0.80 / 1M tokens Training: $1.50 / 1M tokens o4-mini Reinforcement fine-tuning price Input: $4.00 / 1M tokens Cached input: $1.00 / 1M tokens Output: $16.00 / 1M tokens Training: $100.00 / training hour Explore detailed pricing (opens in a new window) Our APIs Realtime API Build low-latency, multimodal experiences including speech-to-speech. Text Text gpt-realtime-1.5 $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime-1.5 $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime $4.00 / 1M input tokens $0.40 / 1M cached input tokens $16.00 / 1M output tokens gpt-realtime-mini $0.60 / 1M input tokens $0.06 / 1M cached input tokens $2.40 / 1M output tokens gpt-realtime-mini $0.60 / 1M input tokens $0.06 / 1M cached input tokens $2.40 / 1M output tokens Audio Audio gpt-realtime-1.5 $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime-1.5 $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime $32.00 / 1M input tokens $0.40 / 1M cached input tokens $64.00 / 1M output tokens gpt-realtime-mini $10.00 / 1M input tokens $0.30 / 1M cached input tokens $20.00 / 1M output tokens gpt-realtime-mini $10.00 / 1M input tokens $0.30 / 1M cached input tokens $20.00 / 1M output tokens Image Image gpt-realtime-1.5 $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime-1.5 $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime $5.00 / 1M input tokens $0.50 / 1M cached input tokens - gpt-realtime-mini $0.80 / 1M input tokens $0.08 / 1M cached input tokens - gpt-realtime-mini $0.80 / 1M input tokens $0.08 / 1M cached input tokens - Sora Video API Richly detailed, dynamic video generation and remixing with our latest generative model. Models Size Price per second sora-2 Portrait: 720 x 1280 Landscape: 1280 x 720 $0.10 sora-2-pro Portrait: 720 x 1280 Landscape: 1280 x 720 $0.30 sora-2-pro Portrait: 1024 x 1792 Landscape: 1792 x 1024 $0.50 sora-2-pro Portrait: 1080 x 1920 Landscape: 1920 x 1080 $0.70 Image Generation API Precise, high-fidelity image generation and editing with our latest multimodal model. Text Text GPT-image-1.5 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * $10.00 / 1M output tokens * GPT-image-1.5 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * $10.00 / 1M output tokens * GPT-image-1 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * - GPT-image-1 $5.00 / 1M input tokens $1.25 / 1M cached input tokens * - GPT-image-1-mini $2.00 / 1M input tokens $0.20 / 1M cached input tokens * - GPT-image-1-mini $2.00 / 1M input tokens $0.20 / 1M cached input tokens * - Image Image GPT-image-1.5 $8.00 / 1M input tokens $2.00 / 1M cached input tokens * $32.00 / 1M output tokens GPT-image-1.5 $8.00 / 1M input tokens $2.00 / 1M cached input tokens * $32.00 / 1M output tokens GPT-image-1 $10.00 / 1M input tokens $2.50 / 1M cached input tokens * $40.00 / 1M output token...
9d ago