Data story

AI prices did not fall. They fanned out.

The received wisdom is tidy: AI gets about ten times cheaper every year. It makes a great chart, a line sliding down and to the right, faster than Moore's Law. The line is real. It is also hiding the actual story.

We tracked the published API price of 108 models across 11 providers, back to the day GPT-4 launched in March 2023, with a first-party source on every single number. The data does not show a tidy collapse. It shows a fan opening: prices fanned out instead of falling together. The cheap end fell through the floor. The expensive end climbed past anything that existed in 2023. And the gap between them blew open until "the price of AI" became a number that describes nothing.

Here is the figure to keep. Right now, a million tokens of model output costs between $0.10 and $600 depending on which model you call. That is a 6,000x spread, for the same unit, at the same moment. There is no single price of intelligence. There is a price list with four orders of magnitude on it.

Model	Provider	Date	Price	Confidence	Source
jamba-large	AI21	2025-07-03	$2	verified	source
jamba-mini	AI21	2026-01-01	$0.2	inferred	source
qwen-flash	Qwen	2025-07-28	$0.05	verified	source
qwen-max	Qwen	2025-01-25	$1.6	verified	source
qwen-plus	Qwen	2025-12-01	$0.4	verified	source
qwen-turbo	Qwen	2025-04-28	$0.05	verified	source
qwen3-coder-plus	Qwen	2025-07-22	$1	verified	source
qwen3-max	Qwen	2026-01-23	$1.2	verified	source
qwen3.5-flash	Qwen	2026-02-23	$0.1	verified	source
qwen3.5-plus	Qwen	2026-02-15	$0.4	verified	source
qwen3.6-flash	Qwen	2026-04-27	$0.25	verified	source
qwen3.6-plus	Qwen	2026-04-02	$0.5	verified	source
qwen3.7-max	Qwen	2026-05-21	$2.5	verified	source
qwen3.7-plus	Qwen	2026-06-03	$0.4	verified	source
nova-2-lite	Amazon Nova	2025-12-02	$0.3	verified	source
nova-2-pro	Amazon Nova	2026-06-22	$1.25	inferred	source
nova-2-pro	Amazon Nova	2026-06-29	$1.25	verified	source
nova-lite	Amazon Nova	2024-12-03	$0.06	verified	source
nova-micro	Amazon Nova	2024-12-03	$0.035	verified	source
nova-premier	Amazon Nova	2025-04-30	$2.5	verified	source
nova-pro	Amazon Nova	2024-12-03	$0.8	verified	source
claude-3-5-haiku-20241022	Anthropic	2024-11-04	$1	archived	source
claude-3-5-haiku-20241022	Anthropic	2024-12-03	$0.8	archived	source
claude-3-5-sonnet-20240620	Anthropic	2024-06-20	$3	archived	source
claude-3-7-sonnet-20250219	Anthropic	2025-02-24	$3	archived	source
claude-3-haiku-20240307	Anthropic	2024-03-04	$0.25	archived	source
claude-3-opus-20240229	Anthropic	2024-03-04	$15	archived	source
claude-3-sonnet-20240229	Anthropic	2024-03-04	$3	archived	source
claude-fable-5	Anthropic	2026-06-09	$10	verified	source
claude-haiku-4-5-20251001	Anthropic	2025-10-15	$1	verified	source
claude-mythos-5	Anthropic	2026-06-22	$10	inferred	source
claude-mythos-5	Anthropic	2026-06-29	$10	verified	source
claude-opus-4-1-20250805	Anthropic	2025-08-05	$15	verified	source
claude-opus-4-20250514	Anthropic	2025-05-14	$15	verified	source
claude-opus-4-5-20251101	Anthropic	2025-11-01	$5	verified	source
claude-opus-4-6	Anthropic	2026-02-05	$5	verified	source
claude-opus-4-7	Anthropic	2026-04-16	$5	verified	source
claude-opus-4-8	Anthropic	2026-05-28	$5	verified	source
claude-opus-5	Anthropic	2026-07-24	$5	verified	source
claude-sonnet-4-20250514	Anthropic	2025-05-14	$3	verified	source
claude-sonnet-4-5-20250929	Anthropic	2025-09-29	$3	verified	source
claude-sonnet-4-6	Anthropic	2026-02-17	$3	verified	source
claude-sonnet-5	Anthropic	2026-07-01	$2	verified	source
command-a-plus	Cohere	2026-05-20	$2.5	archived	source
command-r	Cohere	2024-08-30	$0.15	verified	source
command-r-plus	Cohere	2024-08-30	$2.5	archived	source
command-r7b	Cohere	2024-12-13	$0.0375	verified	source
deepseek-v4-flash	DeepSeek	2026-04-24	$0.14	verified	source
deepseek-v4-pro	DeepSeek	2026-06-01	$0.435	inferred	source
deepseek-v4-pro	DeepSeek	2026-06-22	$0.435	verified	source
gemini-1.5-flash	Google	2024-08-12	$0.075	archived	source
gemini-1.5-pro	Google	2024-10-01	$1.25	archived	source
gemini-2.0-flash	Google	2025-02-05	$0.1	archived	source
gemini-2.5-flash	Google	2025-06-17	$0.3	archived	source
gemini-2.5-flash-lite	Google	2025-06-17	$0.1	archived	source
gemini-2.5-pro	Google	2025-06-17	$1.25	archived	source
gemini-3-flash-preview	Google	2025-12-17	$0.5	verified	source
gemini-3.1-flash-lite	Google	2026-05-07	$0.25	verified	source
gemini-3.1-pro-preview	Google	2026-02-19	$2	verified	source
gemini-3.5-flash	Google	2026-05-19	$1.5	verified	source
gemini-3.5-flash-lite	Google	2026-07-21	$0.3	verified	source
gemini-3.6-flash	Google	2026-07-21	$1.5	verified	source
llama-3.1-405b	meta-llama	2026-06-22	$3.5	inferred	source
llama-3.1-8b	meta-llama	2026-06-22	$0.18	inferred	source
llama-3.3-70b	meta-llama	2026-06-22	$1.04	inferred	source
llama-4-maverick	meta-llama	2025-04-05	$0.27	inferred	source
llama-4-scout	meta-llama	2025-04-05	$0.18	inferred	source
codestral-25.08	Mistral	2025-07-30	$0.3	verified	source
devstral-2	Mistral	2025-12-09	$0.4	verified	source
devstral-small-2	Mistral	2025-12-09	$0.1	verified	source
magistral-medium	Mistral	2025-06-10	$2	verified	source
magistral-small	Mistral	2025-06-10	$0.5	verified	source
ministral-3-14b	Mistral	2025-12-02	$0.2	verified	source
ministral-3-3b	Mistral	2025-12-02	$0.1	verified	source
ministral-3-8b	Mistral	2025-12-02	$0.15	verified	source
mistral-large-3	Mistral	2025-12-02	$0.5	verified	source
mistral-medium-3.5	Mistral	2026-04-29	$1.5	inferred	source
mistral-medium-3.5	Mistral	2026-06-22	$1.5	verified	source
mistral-small-4	Mistral	2026-03-16	$0.1	verified	source
mistral-small-4	Mistral	2026-06-24	$0.15	verified	source
gpt-4	OpenAI	2023-03-14	$30	archived	source
gpt-4-turbo	OpenAI	2023-11-06	$10	archived	source
gpt-4.1	OpenAI	2025-04-14	$2	verified	source
gpt-4o	OpenAI	2024-05-13	$5	archived	source
gpt-4o	OpenAI	2024-08-06	$2.5	archived	source
gpt-4o-mini	OpenAI	2024-07-18	$0.15	archived	source
gpt-5	OpenAI	2025-08-07	$1.25	inferred	source
gpt-5	OpenAI	2026-06-22	$1.25	verified	source
gpt-5.1	OpenAI	2025-11-13	$1.25	inferred	source
gpt-5.1	OpenAI	2026-06-22	$1.25	verified	source
gpt-5.2	OpenAI	2026-01-15	$1.75	inferred	source
gpt-5.2	OpenAI	2026-06-22	$1.75	verified	source
gpt-5.2-pro	OpenAI	2026-01-15	$21	inferred	source
gpt-5.2-pro	OpenAI	2026-06-22	$21	verified	source
gpt-5.3-codex	OpenAI	2026-07-10	$1.75	verified	source
gpt-5.4	OpenAI	2026-03-05	$2.5	verified	source
gpt-5.4-mini	OpenAI	2026-03-17	$0.75	verified	source
gpt-5.4-nano	OpenAI	2026-03-17	$0.2	verified	source
gpt-5.4-pro	OpenAI	2026-03-05	$30	verified	source
gpt-5.5	OpenAI	2026-04-24	$5	verified	source
gpt-5.5-pro	OpenAI	2026-04-24	$30	verified	source
gpt-5.6-luna	OpenAI	2026-07-09	$0.2	inferred	source
gpt-5.6-sol	OpenAI	2026-07-09	$5	verified	source
gpt-5.6-terra	OpenAI	2026-07-09	$2	inferred	source
o1	OpenAI	2024-12-17	$15	verified	source
o1-mini	OpenAI	2024-09-12	$1.1	verified	source
o1-pro	OpenAI	2025-03-19	$150	verified	source
o3	OpenAI	2025-04-16	$10	archived	source
o3	OpenAI	2025-06-10	$2	verified	source
o3-deep-research	OpenAI	2025-06-26	$10	verified	source
o3-mini	OpenAI	2025-01-31	$1.1	verified	source
o3-pro	OpenAI	2025-06-10	$20	verified	source
o4-mini	OpenAI	2025-04-16	$1.1	verified	source
o4-mini-deep-research	OpenAI	2025-06-26	$2	verified	source
grok-4.20-0309-non-reasoning	xAI	2026-06-22	$1.25	inferred	source
grok-4.20-0309-non-reasoning	xAI	2026-06-29	$1.25	verified	source
grok-4.20-0309-reasoning	xAI	2026-03-09	$1.25	verified	source
grok-4.20-multi-agent-0309	xAI	2026-03-31	$1.25	verified	source
grok-4.3	xAI	2026-05-05	$1.25	verified	source
grok-4.5	xAI	2026-07-08	$2	verified	source
grok-build-0.1	xAI	2026-06-01	$1	inferred	source
grok-build-0.1	xAI	2026-06-22	$1	verified	source

Each marker is one model's input price on the date it took effect (log scale, USD per 1M tokens). The dashed line is the cheapest input price available over time. Hover or focus a marker for its first-party source. Output prices fan even wider; they run off the top of this axis at $600.

Coverage: 108 models, 11 providers
Cheapest input fell: 857x
Input spread today: 4,286x
Output spread today: 6,000x

The method, up front

One paragraph, because the whole thing rests on it. Each row in the dataset is one price observation: a provider, a model, input or output tokens, the dollar price per million tokens, the date it took effect, the date we last checked it against the official page, and a link to that page. We measure the standard published list price, not promotional, batch, or cached rates, because list price is the number you get quoted before you optimize. It is all open, CC BY 4.0, and one pull request away from being corrected if we got something wrong.

The floor fell through the floor

In March 2023, the cheapest way to get a million tokens of frontier output was GPT-4, at 60 dollars. There was no budget tier. That was the price of the good model and the price of the only model. Today the cheapest million input tokens in the dataset costs $0.035, and the cheapest output costs $0.10. The cheapest input price has dropped 857x since the GPT-4 launch.

Take a real workload: 10 million tokens in, 2 million out, a meaty day of coding-agent traffic. On GPT-4 at 2023 prices that job cost about 420 dollars. On the cheapest tier today it costs under a dollar. This is the part everyone already wrote about, so we will not dwell. Competition from open-weight models, distillation, cheaper hardware, and a wave of small models drove the bottom of the market toward zero. True, well covered, move on. Because the moment you look at the other end of the fan, the tidy story breaks.

The flagship tier did not get the memo

If prices fell 10x a year across the board, the flagship tier should have fallen with everything else. It mostly did not, and two of the biggest labs went in opposite directions.

Flagship line	Price path	What happened
Anthropic Opus flagship tier	$15 -> $5 input / $75 -> $25 output	Cut 67%, once, at Opus 4.5 (Nov 2025). Held since.
OpenAI GPT-5 flagship line	$1.25 -> $5.00 input	Rose about 4x across GPT-5 to GPT-5.5 (Aug 2025 to Apr 2026).

Anthropic cut its Opus flagship once, in a single step, after holding it flat for over a year. OpenAI's flagship line did the opposite: it got more expensive over its own lifetime, quietly quadrupling its input price in eight months while the headlines said AI was getting cheaper. The cheap tier is not safe from this either. Anthropic's budget Haiku is about four times more expensive than the cheapest Claude used to be. None of these are hikes you would find by checking today's pricing page. The old number is gone.

A ceiling appeared that did not exist in 2023

In 2023 the most expensive model you could call was GPT-4 at 60 dollars per million output tokens. That was the ceiling. Today the ceiling is OpenAI's o1-pro at $600 per million output tokens, ten times higher than the priciest option that existed three years ago. Premium reasoning tiers, models that think for a long time before answering, opened a whole new band of the price list above where the old frontier used to sit. So while the floor was falling hundreds of times over, the ceiling rose about 10x. The fan opened from both ends at once. That is why the spread is 6,000x and not, say, 50x.

Why this happened

Three forces, pulling apart. At the bottom, intelligence became a commodity and got priced like one: open weights set a floor near the cost of the GPUs, and everyone selling a small model has to price against "or you could just run it yourself." At the top, providers stopped competing on flagship sticker price and started competing on capability per token, so the best model has no pressure to get cheaper, and reasoning models gave providers a reason to charge more because they burn more compute per request. In the middle, the unit of value quietly changed: the price per token is falling, but the best models now spend far more tokens per task, thinking out loud before they answer. Cheaper per token, more tokens per job. Your bill is not tracking the chart that goes down and to the right.

Nobody kept the receipts

Here is the uncomfortable part. We could only assemble this by reconstructing it from first-party pages and archives, after the fact. Provider pricing pages have no changelog. When a price changes, the old number is overwritten and gone. When a model is retired, its pricing page often disappears with it. The single most-cited stat in this whole space, the famous "1,000x cheaper," was built by scraping the Wayback Machine, because the primary record had been deleted.

Think about what that means. The industry that bills you by the token does not keep a public, dated, sourced history of what it charged. If you want to know what a job actually cost the day you ran it last spring, the authoritative answer no longer exists on the vendor's own site. That is the gap this dataset exists to close. Every price has a date and a source. Going forward, a bot diffs each provider's official page daily and a human verifies every change before it lands, so the next quiet repricing gets caught the day it happens instead of being reconstructed from an archive two years later. It is the changelog the pricing pages should have shipped and never did.

What it unlocks

The point of keeping receipts is that you can answer questions the live pricing page cannot. You can reprice a historical bill at the rates that were actually in effect on the day, instead of extrapolating from today's number. You can audit a vendor's "we cut prices" announcement against the date the API page actually changed, and find the cuts that shipped with no announcement at all. You can plot the fan yourself, because the whole thing is a CSV. (Point-in-time repricing is also why we built it: it is the engine behind Goei, our cost dashboard.)

The tidy line was never wrong, exactly. It was just an average drawn across a market that stopped having a single price, and the average of cents and hundreds of dollars is a number that has never appeared on anyone's invoice. The receipts are messier than the line. They are also true, and you can check every one of them.

Get the data

Open and CC BY 4.0. Found a wrong price? Send a pull request with the source. That is the whole point.

GitHub repository - the bitemporal source of truth, methodology, and issues
Hugging Face dataset - load it with one line of Python
The dataset endpoint - current prices, JSON endpoints, and how to cite it
The interactive tool - filter the full history and read any point in time

Citable: DOI 10.5281/zenodo.20730240. Prices validated as of 2026-07-31. Release v2026.08.02-caea47c.