Data story
AI prices did not fall. They fanned out.
The received wisdom is tidy: AI gets about ten times cheaper every year. It makes a great chart, a line sliding down and to the right, faster than Moore's Law. The line is real. It is also hiding the actual story.
We tracked the published API price of 86 models across 10 providers, back to the day GPT-4 launched in March 2023, with a first-party source on every single number. The data does not show a tidy collapse. It shows a fan opening. The cheap end fell through the floor. The expensive end climbed past anything that existed in 2023. And the gap between them blew open until "the price of AI" became a number that describes nothing.
Here is the figure to keep. Right now, a million tokens of model output costs between $0.10 and $600 depending on which model you call. That is a 6,000x spread, for the same unit, at the same moment. There is no single price of intelligence. There is a price list with four orders of magnitude on it.
| Model | Provider | Date | Price | Confidence | Source |
|---|---|---|---|---|---|
| jamba-large | AI21 | 2025-07-03 | $2 | verified | source |
| jamba-mini | AI21 | 2026-01-01 | $0.2 | inferred | source |
| qwen-flash | Qwen | 2025-07-28 | $0.05 | verified | source |
| qwen-max | Qwen | 2025-01-25 | $1.6 | verified | source |
| qwen-plus | Qwen | 2025-12-01 | $0.4 | verified | source |
| qwen-turbo | Qwen | 2025-04-28 | $0.05 | verified | source |
| qwen3-coder-plus | Qwen | 2025-07-22 | $1 | verified | source |
| qwen3-max | Qwen | 2026-01-23 | $1.2 | verified | source |
| qwen3.5-flash | Qwen | 2026-02-23 | $0.1 | verified | source |
| qwen3.5-plus | Qwen | 2026-02-15 | $0.4 | verified | source |
| nova-2-lite | Amazon Nova | 2025-12-02 | $0.3 | verified | source |
| nova-lite | Amazon Nova | 2024-12-03 | $0.06 | verified | source |
| nova-micro | Amazon Nova | 2024-12-03 | $0.035 | verified | source |
| nova-premier | Amazon Nova | 2025-04-30 | $2.5 | verified | source |
| nova-pro | Amazon Nova | 2024-12-03 | $0.8 | verified | source |
| claude-3-5-haiku-20241022 | Anthropic | 2024-11-04 | $1 | archived | source |
| claude-3-5-haiku-20241022 | Anthropic | 2024-12-03 | $0.8 | archived | source |
| claude-3-5-sonnet-20240620 | Anthropic | 2024-06-20 | $3 | archived | source |
| claude-3-7-sonnet-20250219 | Anthropic | 2025-02-24 | $3 | archived | source |
| claude-3-haiku-20240307 | Anthropic | 2024-03-04 | $0.25 | archived | source |
| claude-3-opus-20240229 | Anthropic | 2024-03-04 | $15 | archived | source |
| claude-3-sonnet-20240229 | Anthropic | 2024-03-04 | $3 | archived | source |
| claude-fable-5 | Anthropic | 2026-06-09 | $10 | verified | source |
| claude-haiku-4-5-20251001 | Anthropic | 2025-10-15 | $1 | verified | source |
| claude-opus-4-1-20250805 | Anthropic | 2025-08-05 | $15 | verified | source |
| claude-opus-4-20250514 | Anthropic | 2025-05-14 | $15 | verified | source |
| claude-opus-4-5-20251101 | Anthropic | 2025-11-01 | $5 | verified | source |
| claude-opus-4-6 | Anthropic | 2026-02-05 | $5 | verified | source |
| claude-opus-4-7 | Anthropic | 2026-04-16 | $5 | verified | source |
| claude-opus-4-8 | Anthropic | 2026-05-28 | $5 | verified | source |
| claude-sonnet-4-20250514 | Anthropic | 2025-05-14 | $3 | verified | source |
| claude-sonnet-4-5-20250929 | Anthropic | 2025-09-29 | $3 | verified | source |
| claude-sonnet-4-6 | Anthropic | 2026-02-17 | $3 | verified | source |
| command-a-plus | Cohere | 2026-05-20 | $2.5 | verified | source |
| command-r | Cohere | 2024-08-30 | $0.15 | verified | source |
| command-r-plus | Cohere | 2024-08-30 | $2.5 | verified | source |
| command-r7b | Cohere | 2024-12-13 | $0.0375 | verified | source |
| deepseek-v4-flash | DeepSeek | 2026-04-24 | $0.14 | verified | source |
| deepseek-v4-pro | DeepSeek | 2026-06-01 | $0.435 | inferred | source |
| gemini-1.5-flash | 2024-08-12 | $0.075 | archived | source | |
| gemini-1.5-pro | 2024-10-01 | $1.25 | archived | source | |
| gemini-2.0-flash | 2025-02-05 | $0.1 | archived | source | |
| gemini-2.5-flash | 2025-06-17 | $0.3 | archived | source | |
| gemini-2.5-flash-lite | 2025-06-17 | $0.1 | archived | source | |
| gemini-2.5-pro | 2025-06-17 | $1.25 | archived | source | |
| gemini-3-flash-preview | 2025-12-17 | $0.5 | verified | source | |
| gemini-3.1-flash-lite | 2026-05-07 | $0.25 | verified | source | |
| gemini-3.1-pro-preview | 2026-02-19 | $2 | verified | source | |
| gemini-3.5-flash | 2026-05-19 | $1.5 | verified | source | |
| codestral-25.08 | Mistral | 2025-07-30 | $0.3 | verified | source |
| devstral-2 | Mistral | 2025-12-09 | $0.4 | verified | source |
| devstral-small-2 | Mistral | 2025-12-09 | $0.1 | verified | source |
| magistral-medium | Mistral | 2025-06-10 | $2 | verified | source |
| magistral-small | Mistral | 2025-06-10 | $0.5 | verified | source |
| ministral-3-14b | Mistral | 2025-12-02 | $0.2 | verified | source |
| ministral-3-3b | Mistral | 2025-12-02 | $0.1 | verified | source |
| ministral-3-8b | Mistral | 2025-12-02 | $0.15 | verified | source |
| mistral-large-3 | Mistral | 2025-12-02 | $0.5 | verified | source |
| mistral-medium-3.5 | Mistral | 2026-04-29 | $1.5 | inferred | source |
| mistral-small-4 | Mistral | 2026-03-16 | $0.1 | verified | source |
| gpt-4 | OpenAI | 2023-03-14 | $30 | archived | source |
| gpt-4-turbo | OpenAI | 2023-11-06 | $10 | archived | source |
| gpt-4.1 | OpenAI | 2025-04-14 | $2 | verified | source |
| gpt-4o | OpenAI | 2024-05-13 | $5 | archived | source |
| gpt-4o | OpenAI | 2024-08-06 | $2.5 | archived | source |
| gpt-4o-mini | OpenAI | 2024-07-18 | $0.15 | archived | source |
| gpt-5 | OpenAI | 2025-08-07 | $1.25 | inferred | source |
| gpt-5.1 | OpenAI | 2025-11-13 | $1.25 | inferred | source |
| gpt-5.2 | OpenAI | 2026-01-15 | $1.75 | inferred | source |
| gpt-5.2-pro | OpenAI | 2026-01-15 | $21 | inferred | source |
| gpt-5.4 | OpenAI | 2026-03-05 | $2.5 | verified | source |
| gpt-5.4-mini | OpenAI | 2026-03-17 | $0.75 | verified | source |
| gpt-5.4-nano | OpenAI | 2026-03-17 | $0.2 | verified | source |
| gpt-5.4-pro | OpenAI | 2026-03-05 | $30 | verified | source |
| gpt-5.5 | OpenAI | 2026-04-24 | $5 | verified | source |
| gpt-5.5-pro | OpenAI | 2026-04-24 | $30 | verified | source |
| o1 | OpenAI | 2024-12-17 | $15 | verified | source |
| o1-mini | OpenAI | 2024-09-12 | $1.1 | verified | source |
| o1-pro | OpenAI | 2025-03-19 | $150 | verified | source |
| o3 | OpenAI | 2025-04-16 | $10 | archived | source |
| o3 | OpenAI | 2025-06-10 | $2 | verified | source |
| o3-deep-research | OpenAI | 2025-06-26 | $10 | verified | source |
| o3-mini | OpenAI | 2025-01-31 | $1.1 | verified | source |
| o3-pro | OpenAI | 2025-06-10 | $20 | verified | source |
| o4-mini | OpenAI | 2025-04-16 | $1.1 | verified | source |
| o4-mini-deep-research | OpenAI | 2025-06-26 | $2 | verified | source |
| grok-4.20-0309-reasoning | xAI | 2026-03-09 | $1.25 | verified | source |
| grok-4.3 | xAI | 2026-05-05 | $1.25 | verified | source |
| grok-build-0.1 | xAI | 2026-06-01 | $1 | inferred | source |
- Coverage
- 86 models, 10 providers
- Cheapest input fell
- 857x
- Input spread today
- 4,286x
- Output spread today
- 6,000x
The method, up front
One paragraph, because the whole thing rests on it. Each row in the dataset is one price observation: a provider, a model, input or output tokens, the dollar price per million tokens, the date it took effect, the date we last checked it against the official page, and a link to that page. We measure the standard published list price, not promotional, batch, or cached rates, because list price is the number you get quoted before you optimize. It is all open, CC BY 4.0, and one pull request away from being corrected if we got something wrong.
The floor fell through the floor
In March 2023, the cheapest way to get a million tokens of frontier output was GPT-4, at 60 dollars. There was no budget tier. That was the price of the good model and the price of the only model. Today the cheapest million input tokens in the dataset costs $0.035, and the cheapest output costs $0.10. The cheapest input price has dropped 857x since the GPT-4 launch.
Take a real workload: 10 million tokens in, 2 million out, a meaty day of coding-agent traffic. On GPT-4 at 2023 prices that job cost about 420 dollars. On the cheapest tier today it costs under a dollar. This is the part everyone already wrote about, so we will not dwell. Competition from open-weight models, distillation, cheaper hardware, and a wave of small models drove the bottom of the market toward zero. True, well covered, move on. Because the moment you look at the other end of the fan, the tidy story breaks.
The flagship tier did not get the memo
If prices fell 10x a year across the board, the flagship tier should have fallen with everything else. It mostly did not, and two of the biggest labs went in opposite directions.
| Flagship line | Price path | What happened |
|---|---|---|
| Anthropic Opus flagship tier | $15 -> $5 input / $75 -> $25 output | Cut 67%, once, at Opus 4.5 (Nov 2025). Held since. |
| OpenAI GPT-5 flagship line | $1.25 -> $5.00 input | Rose about 4x across GPT-5 to GPT-5.5 (Aug 2025 to Apr 2026). |
Anthropic cut its Opus flagship once, in a single step, after holding it flat for over a year. OpenAI's flagship line did the opposite: it got more expensive over its own lifetime, quietly quadrupling its input price in eight months while the headlines said AI was getting cheaper. The cheap tier is not safe from this either. Anthropic's budget Haiku is about four times more expensive than the cheapest Claude used to be. None of these are hikes you would find by checking today's pricing page. The old number is gone.
A ceiling appeared that did not exist in 2023
In 2023 the most expensive model you could call was GPT-4 at 60 dollars per million output tokens. That was the ceiling. Today the ceiling is OpenAI's o1-pro at $600 per million output tokens, ten times higher than the priciest option that existed three years ago. Premium reasoning tiers, models that think for a long time before answering, opened a whole new band of the price list above where the old frontier used to sit. So while the floor was falling hundreds of times over, the ceiling rose about 10x. The fan opened from both ends at once. That is why the spread is 6,000x and not, say, 50x.
Why this happened
Three forces, pulling apart. At the bottom, intelligence became a commodity and got priced like one: open weights set a floor near the cost of the GPUs, and everyone selling a small model has to price against "or you could just run it yourself." At the top, providers stopped competing on flagship sticker price and started competing on capability per token, so the best model has no pressure to get cheaper, and reasoning models gave providers a reason to charge more because they burn more compute per request. In the middle, the unit of value quietly changed: the price per token is falling, but the best models now spend far more tokens per task, thinking out loud before they answer. Cheaper per token, more tokens per job. Your bill is not tracking the chart that goes down and to the right.
Nobody kept the receipts
Here is the uncomfortable part. We could only assemble this by reconstructing it from first-party pages and archives, after the fact. Provider pricing pages have no changelog. When a price changes, the old number is overwritten and gone. When a model is retired, its pricing page often disappears with it. The single most-cited stat in this whole space, the famous "1,000x cheaper," was built by scraping the Wayback Machine, because the primary record had been deleted.
Think about what that means. The industry that bills you by the token does not keep a public, dated, sourced history of what it charged. If you want to know what a job actually cost the day you ran it last spring, the authoritative answer no longer exists on the vendor's own site. That is the gap this dataset exists to close. Every price has a date and a source. Going forward, a bot diffs each provider's official page daily and a human verifies every change before it lands, so the next quiet repricing gets caught the day it happens instead of being reconstructed from an archive two years later. It is the changelog the pricing pages should have shipped and never did.
What it unlocks
The point of keeping receipts is that you can answer questions the live pricing page cannot. You can reprice a historical bill at the rates that were actually in effect on the day, instead of extrapolating from today's number. You can audit a vendor's "we cut prices" announcement against the date the API page actually changed, and find the cuts that shipped with no announcement at all. You can plot the fan yourself, because the whole thing is a CSV. (Point-in-time repricing is also why we built it: it is the engine behind Goei, our cost dashboard.)
The tidy line was never wrong, exactly. It was just an average drawn across a market that stopped having a single price, and the average of cents and hundreds of dollars is a number that has never appeared on anyone's invoice. The receipts are messier than the line. They are also true, and you can check every one of them.
Get the data
Open and CC BY 4.0. Found a wrong price? Send a pull request with the source. That is the whole point.
- GitHub repository - the bitemporal source of truth, methodology, and issues
- Hugging Face dataset - load it with one line of Python
- The dataset endpoint - current prices, JSON endpoints, and how to cite it
- The interactive tool - filter the full history and read any point in time
Citable: DOI 10.5281/zenodo.20730241. Prices validated as of 2026-06-17. Release v2026.06.17-4ab5db2.