Main ContentOpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. Choose your processing mode Standard Batch -50% Data residency +10% GPT-5.5 A new class of intelligence for coding and professional work. Price Input: $5.00 / 1M tokens Cached input: $0.50 / 1M tokens Output: $30.00 / 1M tokens GPT-5.4 A more affordable model for coding and professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.75 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.50 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Learn more about Batch Processing (opens in a new window) and Data residency & Regional Processing (opens in a new window) Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-realtime-1.5 Our most capable model for realtime voice interactions. Price Audio: $32.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $64.00 / 1M tokens for outputs Text: $4.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $16.00 / 1M tokens for outputs Image: $5.00 / 1M tokens for inputs $0.50 / 1M tokens for cached inputs GPT-image-2 State-of-the-art image generation model. Price Image: $8.00 / 1M tokens for inputs $2.00 / 1M tokens for cached inputs $30.00 / 1M tokens for outputs Text: $5.00 / 1M tokens for inputs $1.25 / 1M tokens for cached inputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency (opens in a new window) , Scale Tier and Reserved Capacity designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact sales→OpenAI API Pricing Contact sales Flagship models Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems. Choose your processing mode Standard Batch -50% Data residency +10% GPT-5.5 A new class of intelligence for coding and professional work. Price Input: $5.00 / 1M tokens Cached input: $0.50 / 1M tokens Output: $30.00 / 1M tokens GPT-5.4 A more affordable model for coding and professional work. Price Input: $2.50 / 1M tokens Cached input: $0.25 / 1M tokens Output: $15.00 / 1M tokens GPT-5.4 mini Our strongest mini model yet for coding, computer use, and subagents. Price Input: $0.75 / 1M tokens Cached input: $0.075 / 1M tokens Output: $4.50 / 1M tokens Pricing above reflects standard processing rates for context lengths under 270K. Learn more about Batch Processing (opens in a new window) and Data residency & Regional Processing (opens in a new window) Explore detailed pricing (opens in a new window) Multimodal models Power applications across text, image, and audio with models built for real-time interaction and rich media generation. GPT-Realtime-2 Our most capable model for realtime voice interactions. Price Audio: $32.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $64.00 / 1M tokens for outputs Text: $4.00 / 1M tokens for inputs $0.40 / 1M tokens for cached inputs $24.00 / 1M tokens for outputs Image: $5.00 / 1M tokens for inputs $0.50 / 1M tokens for cached inputs GPT-Realtime-Translate A new live translation model that translates speech in real time and keeps pace with the speaker. Price $0.034 per minute / $0.00057 per second GPT-Realtime-Whisper A new streaming speech-to-text that transcribes speech live as the speaker talks. Price $0.017 per minute / $0.00028 per second GPT-Image-2 State-of-the-art image generation model. Price Image: $8.00 / 1M tokens for inputs $2.00 / 1M tokens for cached inputs $30.00 / 1M tokens for outputs Text: $5.00 / 1M tokens for inputs $1.25 / 1M tokens for cached inputs Tools Extend model capabilities with built-in tools for retrieval, execution, and external data access. Web search Retrieve up-to-date information from the web to ground model responses. Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container Service tiers Balance performance, predictable costs, and availability based on your needs. Batch API Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours. Learn more (opens in a new window) Priority processing Offers reliable, high-speed performance with the flexibility to pay-as-you-go. Learn more (opens in a new window) Flex processing Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks. Learn more (opens in a new window) Enterprise offerings Contact our sales team to learn more about Data residency (opens in a new window) , Scale Tier and Reserved Capacity designed for cutting-edge customers running larger workloads. Contact sales FAQ Start creating with OpenAI’s powerful models. Get started (opens in a new window) Contact sales