OpenAI Images (gpt-image)
OpenAI · Ranked #1 of 7 in Image Generation APIs
Foundation-model image API with best-in-class docs, a playground, and instruction-following generation/editing baked into the same platform as GPT.
Frontier image gen + editing inside the OpenAI platform

Overview
OpenAI Images (gpt-image) is OpenAI's hosted text-to-image and image-editing API, first released in April 2025 as gpt-image-1, a natively multimodal model built on the same GPT-4o-class image generation that went viral in ChatGPT. Unlike diffusion-based competitors, it is an autoregressive multimodal model that accepts both text and image inputs and produces image outputs, which gives it standout strengths in instruction-following, accurate in-image text rendering, and world-knowledge-aware composition. The product line has since iterated quickly: gpt-image-1-mini (a cheaper, faster variant), gpt-image-1.5 (December 2025, roughly 20% cheaper and a notable quality/text-rendering jump), and gpt-image-2 (the current flagship). All are reachable through the dedicated Images API and the conversational Responses API, the latter enabling multi-turn, high-fidelity iterative edits.
The target user is a developer or product team embedding generation/editing into an app rather than an end-user buying a creative seat. That positioning is validated by marquee launch partners already wiring it in, Adobe (Firefly/Express), Figma, Canva, Wix, Airtable, Instacart, and GoDaddy. Where it wins: best-in-class prompt adherence and legible text (it consistently ranks at or near the top of Artificial Analysis's Text-to-Image Arena), strong multi-image reference editing, transparent-background and format/compression controls, tunable moderation (auto/low), and C2PA provenance metadata baked into every output. Pricing is genuinely usage-based per token, working out to roughly $0.02 / $0.07 / $0.19 per low/medium/high-quality square image on the original gpt-image-1, with later models cheaper.
Where it loses: latency is the dominant complaint, high-quality generations routinely take 30–60 seconds and the API can time out around 180 seconds, which is painful for interactive UX, though streaming partial images and the mini/batch tiers soften this. It is pricier and slower than open-weight diffusion stacks you can self-host, the original model had a noticeable yellow tint, transparency isn't supported on the newest gpt-image-2, and content moderation can produce false positives. Reliability is generally strong (status-page components mostly 99.8–99.99%) but image generation has had recurring incident clusters in 2025–2026. Net: the default choice when you need reliable instruction-following and text-in-image and are willing to pay OpenAI per token and tolerate multi-second latency.
How this score is derived
The APIbenchmarks Index is a weighted sum of four dimensions, each scored on an absolute 0–100 reference scale. See the methodology for every mapping.
| Dimension | Score | Weight | Contribution |
|---|---|---|---|
| Documentation & DXThorough first-party docs at developers.openai.com cover the Images API, Responses API, every parameter, streaming, and a per-image cost calculator, with Python/JS/cURL examples throughout. | 94 | 30% | 28.2 |
| ReliabilityStatus-page components generally report 99.8–99.99% uptime, but image generation specifically has seen recurring elevated-error incidents through 2025–2026 (May/Jun/Aug 2025, Feb 2026). | 92 | 25% | 23.0 |
| Ecosystem & SDKsBacked by OpenAI's huge developer base and official SDKs, with major launch partners (Adobe, Figma, Canva, Wix, Airtable) and availability through Azure OpenAI / Azure AI Foundry. | 90 | 25% | 22.5 |
| AccessibilityStandard API-key access via OpenAI or Azure, simple REST endpoints and official SDKs, though API organization verification is required to use gpt-image and there is no free tier. | 84 | 20% | 16.8 |
| APIbenchmarks Index (ABI) | 90.5 | ||
Table 1. Derivation of the ABI for OpenAI Images (gpt-image). Contribution = score × weight; the index is their sum.
At a glance
- Vendor
- OpenAI
- Pricing model
- Per token (~$0.02-0.25/image)
- Free tier
- No
- Official SDKs
- 4 languages
Pricing
| gpt-image-1-mini | $2.50 in / $8.00 out per 1M tokens | Cheapest/fastest tier; cached input $0.25/1M. Batch: $1.25 in / $4.00 out. |
| gpt-image-1.5 | $8.00 in / $32.00 out per 1M tokens | Dec 2025 model; per-image square approx $0.009/$0.034/$0.133 for low/med/high. Batch: $4 in / $16 out. |
| gpt-image-2 (flagship) | $8.00 in / $30.00 out per 1M tokens | Current top model; per-image square approx $0.006/$0.053/$0.211 low/med/high. Batch: $4 in / $15 out. |
| gpt-image-1 (original) | ~$0.02 / $0.07 / $0.19 per image | Per-image low/medium/high quality (1024x1024); token-based ($5 text in, $10 image in, $40 image out per 1M). |
Key features
- •Text-to-image generation and natural-language image editing (inpainting, object add/remove, background expansion)
- •Multi-image reference input for composition and style transfer
- •Quality tiers: low / medium / high
- •Sizes: 1024x1024 (square), 1536x1024 (landscape), 1024x1536 (portrait)
- •Output formats: PNG, JPEG, WebP with output_compression control
- •Transparent background support (gpt-image-1/1.5; not gpt-image-2)
- •Streaming with partial_images for progressive previews
- •Tunable content moderation (auto/low)
- •C2PA cryptographically-signed provenance metadata
- •Available via Images API and conversational Responses API, and on Azure OpenAI / AI Foundry
Official SDKs
Strengths & trade-offs
- +Best-in-class prompt/instruction following and accurate in-image text rendering; ranks at/near the top of Artificial Analysis's Text-to-Image Arena
- +Native multimodal model accepts text + multiple reference images for editing, inpainting, style transfer and background expansion
- +Conversational Responses API enables multi-turn, high-fidelity iterative edits across turns
- +Per-token usage pricing with low/medium/high quality tiers and a cheaper mini variant plus 50%-off batch pricing
- +Built-in C2PA provenance metadata on every generated image and tunable moderation (auto/low)
- +Adopted by major platforms out of the gate (Adobe, Figma, Canva, Wix, Airtable) and available via Azure OpenAI
- –High latency: high-quality generations commonly take 30–60s and the API can time out around 180s, hurting interactive UX
- –More expensive and slower than self-hosted open-weight diffusion models for high-volume use
- –Original gpt-image-1 had a noticeable yellow/warm color tint
- –gpt-image-2 (newest flagship) does not support transparent backgrounds
- –Content moderation can flag legitimate prompts as false positives
- –Requires API organization verification to access, and image generation has had recurring error-rate incidents in 2025–2026
What developers say
Developers praise instruction-following, text rendering and editing quality, but latency and slow API response times are a consistent, prominent complaint.
“Each request takes 30–60 seconds, which is too slow for my needs.”
Key figures
| Text-to-Image Arena Elo (GPT Image 1 high) | 1131.95 | Artificial Analysis ↗ |
| Text-to-Image Arena Elo (GPT Image 1.5 high, leaderboard) | 1264 (4th overall) | Artificial Analysis ↗ |
| Per-image price, high quality square (gpt-image-1) | ~$0.19 | OpenAI announcement / docs ↗ |
| Per-image price, high quality square (gpt-image-2) | $0.211 | OpenAI image generation guide ↗ |
| Output token price (gpt-image-2) | $30.00 per 1M tokens | OpenAI API pricing ↗ |
| Typical generation latency (high quality) | 30–60 s per request | OpenAI Developer Community ↗ |
| Reported component uptime (Mar–Jun 2026) | 99.80%–99.99% | OpenAI status page ↗ |
Compare OpenAI Images (gpt-image) head to head
Sources
- https://openai.com/index/image-generation-api/
- https://developers.openai.com/api/docs/pricing
- https://developers.openai.com/api/docs/guides/image-generation
- https://artificialanalysis.ai/image/leaderboard/text-to-image
- https://artificialanalysis.ai/image/models/openai-gpt_gpt-image-1--high
- https://status.openai.com/
- https://community.openai.com/t/gpt-image-1-is-realy-slow/1310616
- https://cybernews.com/ai-tools/gpt-image-1-5-review/
- https://techcrunch.com/2025/04/23/openai-makes-its-upgraded-image-generator-available-to-developers/
Figures last verified 2026-06-27. Spotted an error? corrections@apibenchmarks.com
