OpenAI API Pricing

openai.com ↗
📊The OpenAI API pricing has been updated from GPT-5.4 to GPT-5.5.
Main ContentOpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. Choose your processing mode Standard Batch -50% Data residency +10% GPT-5.5 A new class of intelligence for coding and professional work. Price Input: $5.00 / 1M tokens Cached input: $0.50 / 1M tokens Output: $30.00 / 1M tokens GPT-5.4 A more affordable model for coding and professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.75 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.50 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Learn more about Batch Processing ⁠ (opens in a new window) and Data residency & Regional Processing ⁠ (opens in a new window) Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-Realtime-2 Our most capable model for realtime voice interactions. Price Audio: $32.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $64.00 / 1M tokens for outputs Text: $4.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $24.00 / 1M tokens for outputs Image: $5.00 / 1M tokens for inputs $0.50 / 1M tokens for cached inputs GPT-Realtime-Translate A new live translation model that translates speech in real time and keeps pace with the speaker. Price $0.034 per minute / $0.00057 per second GPT-Realtime-Whisper A new streaming speech-to-text that transcribes speech live as the speaker talks. Price $0.017 per minute / $0.00028 per second GPT-Image-2 State-of-the-art image generation model. Price Image: $8.00 / 1M tokens for inputs $2.00 / 1M tokens for cached inputs $30.00 / 1M tokens for outputs Text: $5.00 / 1M tokens for inputs $1.25 / 1M tokens for cached inputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact sales

Alert me when:

0 watching

Export & Integrate

Add .rss or .json to any alert URL

Recent Changes

Main Content
OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. Choose your processing mode Standard Batch -50% Data residency +10% GPT-5.5 A new class of intelligence for coding and professional work. Price Input: $5.00 / 1M tokens Cached input: $0.50 / 1M tokens Output: $30.00 / 1M tokens GPT-5.4 A more affordable model for coding and professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.75 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.50 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Learn more about Batch Processing ⁠ (opens in a new window) and Data residency & Regional Processing ⁠ (opens in a new window) Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $64.00 / 1M tokens for outputs Text: $4.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $16.00 / 1M tokens for outputs Image: $5.00 / 1M tokens for inputs $0.50 / 1M tokens for cached inputs GPT-image-2 State-of-the-art image generation model. Price Image: $8.00 / 1M tokens for inputs $2.00 / 1M tokens for cached inputs $30.00 / 1M tokens for outputs Text: $5.00 / 1M tokens for inputs $1.25 / 1M tokens for cached inputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact salesOpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. Choose your processing mode Standard Batch -50% Data residency +10% GPT-5.5 A new class of intelligence for coding and professional work. Price Input: $5.00 / 1M tokens Cached input: $0.50 / 1M tokens Output: $30.00 / 1M tokens GPT-5.4 A more affordable model for coding and professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.75 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.50 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Learn more about Batch Processing ⁠ (opens in a new window) and Data residency & Regional Processing ⁠ (opens in a new window) Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-Realtime-2 Our most capable model for realtime voice interactions. Price Audio: $32.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $64.00 / 1M tokens for outputs Text: $4.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $24.00 / 1M tokens for outputs Image: $5.00 / 1M tokens for inputs $0.50 / 1M tokens for cached inputs GPT-Realtime-Translate A new live translation model that translates speech in real time and keeps pace with the speaker. Price $0.034 per minute / $0.00057 per second GPT-Realtime-Whisper A new streaming speech-to-text that transcribes speech live as the speaker talks. Price $0.017 per minute / $0.00028 per second GPT-Image-2 State-of-the-art image generation model. Price Image: $8.00 / 1M tokens for inputs $2.00 / 1M tokens for cached inputs $30.00 / 1M tokens for outputs Text: $5.00 / 1M tokens for inputs $1.25 / 1M tokens for cached inputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact sales
21d ago
Main Content
OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. Choose your processing mode Standard Batch -50% Data residency +10% GPT-5.5 (coming soon) A new class of intelligence for coding and professional work. Price Input: $5.00 / 1M tokens Cached input: $0.50 / 1M tokens Output: $30.00 / 1M tokens GPT-5.4 A more affordable model for coding and professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.75 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.50 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Learn more about Batch Processing ⁠ (opens in a new window) and Data residency & Regional Processing ⁠ (opens in a new window) Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $64.00 / 1M tokens for outputs Text: $4.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $16.00 / 1M tokens for outputs Image: $5.00 / 1M tokens for inputs $0.50 / 1M tokens for cached inputs GPT-image-2 State-of-the-art image generation model. Price Image: $8.00 / 1M tokens for inputs $2.00 / 1M tokens for cached inputs $30.00 / 1M tokens for outputs Text: $5.00 / 1M tokens for inputs $1.25 / 1M tokens for cached inputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact salesOpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. Choose your processing mode Standard Batch -50% Data residency +10% GPT-5.5 A new class of intelligence for coding and professional work. Price Input: $5.00 / 1M tokens Cached input: $0.50 / 1M tokens Output: $30.00 / 1M tokens GPT-5.4 A more affordable model for coding and professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.75 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.50 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Learn more about Batch Processing ⁠ (opens in a new window) and Data residency & Regional Processing ⁠ (opens in a new window) Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $64.00 / 1M tokens for outputs Text: $4.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $16.00 / 1M tokens for outputs Image: $5.00 / 1M tokens for inputs $0.50 / 1M tokens for cached inputs GPT-image-2 State-of-the-art image generation model. Price Image: $8.00 / 1M tokens for inputs $2.00 / 1M tokens for cached inputs $30.00 / 1M tokens for outputs Text: $5.00 / 1M tokens for inputs $1.25 / 1M tokens for cached inputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact sales
34d ago
Main Content
OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. Choose your processing mode Standard Batch -50% Data residency +10% GPT-5.4 Our most capable model for professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.75 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.50 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks. Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Learn more about Batch Processing ⁠ (opens in a new window) and Data residency & Regional Processing ⁠ (opens in a new window) Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 for inputs $0.40 for cached inputs $64.00 for outputs Text: $4.00 for inputs $0.40 for cached inputs $16.00 for outputs Image: $5.00 for inputs $0.50 for cached inputs GPT-image-2 State-of-the-art image generation model. Price Image: $8.00 for inputs $2.00 for cached inputs $30.00 for outputs Text: $5.00 for inputs $1.25 for cached inputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact salesOpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. Choose your processing mode Standard Batch -50% Data residency +10% GPT-5.5 (coming soon) A new class of intelligence for coding and professional work. Price Input: $5.00 / 1M tokens Cached input: $0.50 / 1M tokens Output: $30.00 / 1M tokens GPT-5.4 A more affordable model for coding and professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.75 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.50 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Learn more about Batch Processing ⁠ (opens in a new window) and Data residency & Regional Processing ⁠ (opens in a new window) Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $64.00 / 1M tokens for outputs Text: $4.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $16.00 / 1M tokens for outputs Image: $5.00 / 1M tokens for inputs $0.50 / 1M tokens for cached inputs GPT-image-2 State-of-the-art image generation model. Price Image: $8.00 / 1M tokens for inputs $2.00 / 1M tokens for cached inputs $30.00 / 1M tokens for outputs Text: $5.00 / 1M tokens for inputs $1.25 / 1M tokens for cached inputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact sales
35d ago
Main Content
OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. Choose your processing mode Standard Batch -50% Data residency +10% GPT-5.4 Our most capable model for professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.75 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.50 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks. Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Learn more about Batch Processing ⁠ (opens in a new window) and Data residency & Regional Processing ⁠ (opens in a new window) Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 for inputs $0.40 for cached inputs $64.00 for outputs Text: $4.00 for inputs $0.40 for cached inputs $16.00 for outputs Image: $5.00 for inputs $0.50 for cached inputs GPT-image-2 State-of-the-art image generation model. Price Image: $8.00 for inputs $2.00 for cached inputs $30.00 for outputs Text: $5.00 for inputs $1.25 for cached inputs $10.00 for outputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact salesOpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. Choose your processing mode Standard Batch -50% Data residency +10% GPT-5.4 Our most capable model for professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.75 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.50 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks. Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Learn more about Batch Processing ⁠ (opens in a new window) and Data residency & Regional Processing ⁠ (opens in a new window) Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 for inputs $0.40 for cached inputs $64.00 for outputs Text: $4.00 for inputs $0.40 for cached inputs $16.00 for outputs Image: $5.00 for inputs $0.50 for cached inputs GPT-image-2 State-of-the-art image generation model. Price Image: $8.00 for inputs $2.00 for cached inputs $30.00 for outputs Text: $5.00 for inputs $1.25 for cached inputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact sales
37d ago
Main Content
OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. Choose your processing mode Standard Batch -50% Data residency +10% GPT-5.4 Our most capable model for professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.75 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.50 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks. Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Learn more about Batch Processing ⁠ (opens in a new window) and Data residency & Regional Processing ⁠ (opens in a new window) Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 for inputs $0.40 for cached inputs $64.00 for outputs Text: $4.00 for inputs $0.40 for cached inputs $16.00 for outputs Image: $5.00 for inputs $0.50 for cached inputs GPT-image-1.5 State-of-the-art image generation model. Price Image: $8.00 for inputs $2.00 for cached inputs $32.00 for outputs Text: $5.00 for inputs $1.25 for cached inputs $10.00 for outputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact salesOpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. Choose your processing mode Standard Batch -50% Data residency +10% GPT-5.4 Our most capable model for professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.75 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.50 / 1M tokens GPT-5.4 nano Our cheapest GPT-5.4-class model for simple high-volume tasks. Price Input: $0.20 / 1M tokens Cached input: $0.02 / 1M tokens Output: $1.25 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Learn more about Batch Processing ⁠ (opens in a new window) and Data residency & Regional Processing ⁠ (opens in a new window) Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 for inputs $0.40 for cached inputs $64.00 for outputs Text: $4.00 for inputs $0.40 for cached inputs $16.00 for outputs Image: $5.00 for inputs $0.50 for cached inputs GPT-image-2 State-of-the-art image generation model. Price Image: $8.00 for inputs $2.00 for cached inputs $30.00 for outputs Text: $5.00 for inputs $1.25 for cached inputs $10.00 for outputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency ⁠ (opens in a new window) , Scale Tier ⁠ and Reserved Capacity ⁠ designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact sales
37d ago