Data story

AI prices did not fall. They fanned out.

The received wisdom is tidy: AI gets about ten times cheaper every year. It makes a great chart, a line sliding down and to the right, faster than Moore's Law. The line is real. It is also hiding the actual story.

We tracked the published API price of 86 models across 10 providers, back to the day GPT-4 launched in March 2023, with a first-party source on every single number. The data does not show a tidy collapse. It shows a fan opening. The cheap end fell through the floor. The expensive end climbed past anything that existed in 2023. And the gap between them blew open until "the price of AI" became a number that describes nothing.

Here is the figure to keep. Right now, a million tokens of model output costs between $0.10 and $600 depending on which model you call. That is a 6,000x spread, for the same unit, at the same moment. There is no single price of intelligence. There is a price list with four orders of magnitude on it.

The AI price fan: input price per model, 2023 to 2026roninforge.org/ai-price-indexUSD per 1M tokens (log)$0.1$1$10$100202420252026cheapestAI21QwenAmazon NovaAnthropicCohereDeepSeekGoogleMistralOpenAIxAI
ModelProviderDatePriceConfidenceSource
jamba-largeAI212025-07-03$2verifiedsource
jamba-miniAI212026-01-01$0.2inferredsource
qwen-flashQwen2025-07-28$0.05verifiedsource
qwen-maxQwen2025-01-25$1.6verifiedsource
qwen-plusQwen2025-12-01$0.4verifiedsource
qwen-turboQwen2025-04-28$0.05verifiedsource
qwen3-coder-plusQwen2025-07-22$1verifiedsource
qwen3-maxQwen2026-01-23$1.2verifiedsource
qwen3.5-flashQwen2026-02-23$0.1verifiedsource
qwen3.5-plusQwen2026-02-15$0.4verifiedsource
nova-2-liteAmazon Nova2025-12-02$0.3verifiedsource
nova-liteAmazon Nova2024-12-03$0.06verifiedsource
nova-microAmazon Nova2024-12-03$0.035verifiedsource
nova-premierAmazon Nova2025-04-30$2.5verifiedsource
nova-proAmazon Nova2024-12-03$0.8verifiedsource
claude-3-5-haiku-20241022Anthropic2024-11-04$1archivedsource
claude-3-5-haiku-20241022Anthropic2024-12-03$0.8archivedsource
claude-3-5-sonnet-20240620Anthropic2024-06-20$3archivedsource
claude-3-7-sonnet-20250219Anthropic2025-02-24$3archivedsource
claude-3-haiku-20240307Anthropic2024-03-04$0.25archivedsource
claude-3-opus-20240229Anthropic2024-03-04$15archivedsource
claude-3-sonnet-20240229Anthropic2024-03-04$3archivedsource
claude-fable-5Anthropic2026-06-09$10verifiedsource
claude-haiku-4-5-20251001Anthropic2025-10-15$1verifiedsource
claude-opus-4-1-20250805Anthropic2025-08-05$15verifiedsource
claude-opus-4-20250514Anthropic2025-05-14$15verifiedsource
claude-opus-4-5-20251101Anthropic2025-11-01$5verifiedsource
claude-opus-4-6Anthropic2026-02-05$5verifiedsource
claude-opus-4-7Anthropic2026-04-16$5verifiedsource
claude-opus-4-8Anthropic2026-05-28$5verifiedsource
claude-sonnet-4-20250514Anthropic2025-05-14$3verifiedsource
claude-sonnet-4-5-20250929Anthropic2025-09-29$3verifiedsource
claude-sonnet-4-6Anthropic2026-02-17$3verifiedsource
command-a-plusCohere2026-05-20$2.5verifiedsource
command-rCohere2024-08-30$0.15verifiedsource
command-r-plusCohere2024-08-30$2.5verifiedsource
command-r7bCohere2024-12-13$0.0375verifiedsource
deepseek-v4-flashDeepSeek2026-04-24$0.14verifiedsource
deepseek-v4-proDeepSeek2026-06-01$0.435inferredsource
gemini-1.5-flashGoogle2024-08-12$0.075archivedsource
gemini-1.5-proGoogle2024-10-01$1.25archivedsource
gemini-2.0-flashGoogle2025-02-05$0.1archivedsource
gemini-2.5-flashGoogle2025-06-17$0.3archivedsource
gemini-2.5-flash-liteGoogle2025-06-17$0.1archivedsource
gemini-2.5-proGoogle2025-06-17$1.25archivedsource
gemini-3-flash-previewGoogle2025-12-17$0.5verifiedsource
gemini-3.1-flash-liteGoogle2026-05-07$0.25verifiedsource
gemini-3.1-pro-previewGoogle2026-02-19$2verifiedsource
gemini-3.5-flashGoogle2026-05-19$1.5verifiedsource
codestral-25.08Mistral2025-07-30$0.3verifiedsource
devstral-2Mistral2025-12-09$0.4verifiedsource
devstral-small-2Mistral2025-12-09$0.1verifiedsource
magistral-mediumMistral2025-06-10$2verifiedsource
magistral-smallMistral2025-06-10$0.5verifiedsource
ministral-3-14bMistral2025-12-02$0.2verifiedsource
ministral-3-3bMistral2025-12-02$0.1verifiedsource
ministral-3-8bMistral2025-12-02$0.15verifiedsource
mistral-large-3Mistral2025-12-02$0.5verifiedsource
mistral-medium-3.5Mistral2026-04-29$1.5inferredsource
mistral-small-4Mistral2026-03-16$0.1verifiedsource
gpt-4OpenAI2023-03-14$30archivedsource
gpt-4-turboOpenAI2023-11-06$10archivedsource
gpt-4.1OpenAI2025-04-14$2verifiedsource
gpt-4oOpenAI2024-05-13$5archivedsource
gpt-4oOpenAI2024-08-06$2.5archivedsource
gpt-4o-miniOpenAI2024-07-18$0.15archivedsource
gpt-5OpenAI2025-08-07$1.25inferredsource
gpt-5.1OpenAI2025-11-13$1.25inferredsource
gpt-5.2OpenAI2026-01-15$1.75inferredsource
gpt-5.2-proOpenAI2026-01-15$21inferredsource
gpt-5.4OpenAI2026-03-05$2.5verifiedsource
gpt-5.4-miniOpenAI2026-03-17$0.75verifiedsource
gpt-5.4-nanoOpenAI2026-03-17$0.2verifiedsource
gpt-5.4-proOpenAI2026-03-05$30verifiedsource
gpt-5.5OpenAI2026-04-24$5verifiedsource
gpt-5.5-proOpenAI2026-04-24$30verifiedsource
o1OpenAI2024-12-17$15verifiedsource
o1-miniOpenAI2024-09-12$1.1verifiedsource
o1-proOpenAI2025-03-19$150verifiedsource
o3OpenAI2025-04-16$10archivedsource
o3OpenAI2025-06-10$2verifiedsource
o3-deep-researchOpenAI2025-06-26$10verifiedsource
o3-miniOpenAI2025-01-31$1.1verifiedsource
o3-proOpenAI2025-06-10$20verifiedsource
o4-miniOpenAI2025-04-16$1.1verifiedsource
o4-mini-deep-researchOpenAI2025-06-26$2verifiedsource
grok-4.20-0309-reasoningxAI2026-03-09$1.25verifiedsource
grok-4.3xAI2026-05-05$1.25verifiedsource
grok-build-0.1xAI2026-06-01$1inferredsource
Each marker is one model's input price on the date it took effect (log scale, USD per 1M tokens). The dashed line is the cheapest input price available over time. Hover or focus a marker for its first-party source. Output prices fan even wider; they run off the top of this axis at $600.
Coverage
86 models, 10 providers
Cheapest input fell
857x
Input spread today
4,286x
Output spread today
6,000x

The method, up front

One paragraph, because the whole thing rests on it. Each row in the dataset is one price observation: a provider, a model, input or output tokens, the dollar price per million tokens, the date it took effect, the date we last checked it against the official page, and a link to that page. We measure the standard published list price, not promotional, batch, or cached rates, because list price is the number you get quoted before you optimize. It is all open, CC BY 4.0, and one pull request away from being corrected if we got something wrong.

The floor fell through the floor

In March 2023, the cheapest way to get a million tokens of frontier output was GPT-4, at 60 dollars. There was no budget tier. That was the price of the good model and the price of the only model. Today the cheapest million input tokens in the dataset costs $0.035, and the cheapest output costs $0.10. The cheapest input price has dropped 857x since the GPT-4 launch.

Take a real workload: 10 million tokens in, 2 million out, a meaty day of coding-agent traffic. On GPT-4 at 2023 prices that job cost about 420 dollars. On the cheapest tier today it costs under a dollar. This is the part everyone already wrote about, so we will not dwell. Competition from open-weight models, distillation, cheaper hardware, and a wave of small models drove the bottom of the market toward zero. True, well covered, move on. Because the moment you look at the other end of the fan, the tidy story breaks.

The flagship tier did not get the memo

If prices fell 10x a year across the board, the flagship tier should have fallen with everything else. It mostly did not, and two of the biggest labs went in opposite directions.

Flagship linePrice pathWhat happened
Anthropic Opus flagship tier$15 -> $5 input / $75 -> $25 outputCut 67%, once, at Opus 4.5 (Nov 2025). Held since.
OpenAI GPT-5 flagship line$1.25 -> $5.00 inputRose about 4x across GPT-5 to GPT-5.5 (Aug 2025 to Apr 2026).

Anthropic cut its Opus flagship once, in a single step, after holding it flat for over a year. OpenAI's flagship line did the opposite: it got more expensive over its own lifetime, quietly quadrupling its input price in eight months while the headlines said AI was getting cheaper. The cheap tier is not safe from this either. Anthropic's budget Haiku is about four times more expensive than the cheapest Claude used to be. None of these are hikes you would find by checking today's pricing page. The old number is gone.

A ceiling appeared that did not exist in 2023

In 2023 the most expensive model you could call was GPT-4 at 60 dollars per million output tokens. That was the ceiling. Today the ceiling is OpenAI's o1-pro at $600 per million output tokens, ten times higher than the priciest option that existed three years ago. Premium reasoning tiers, models that think for a long time before answering, opened a whole new band of the price list above where the old frontier used to sit. So while the floor was falling hundreds of times over, the ceiling rose about 10x. The fan opened from both ends at once. That is why the spread is 6,000x and not, say, 50x.

Why this happened

Three forces, pulling apart. At the bottom, intelligence became a commodity and got priced like one: open weights set a floor near the cost of the GPUs, and everyone selling a small model has to price against "or you could just run it yourself." At the top, providers stopped competing on flagship sticker price and started competing on capability per token, so the best model has no pressure to get cheaper, and reasoning models gave providers a reason to charge more because they burn more compute per request. In the middle, the unit of value quietly changed: the price per token is falling, but the best models now spend far more tokens per task, thinking out loud before they answer. Cheaper per token, more tokens per job. Your bill is not tracking the chart that goes down and to the right.

Nobody kept the receipts

Here is the uncomfortable part. We could only assemble this by reconstructing it from first-party pages and archives, after the fact. Provider pricing pages have no changelog. When a price changes, the old number is overwritten and gone. When a model is retired, its pricing page often disappears with it. The single most-cited stat in this whole space, the famous "1,000x cheaper," was built by scraping the Wayback Machine, because the primary record had been deleted.

Think about what that means. The industry that bills you by the token does not keep a public, dated, sourced history of what it charged. If you want to know what a job actually cost the day you ran it last spring, the authoritative answer no longer exists on the vendor's own site. That is the gap this dataset exists to close. Every price has a date and a source. Going forward, a bot diffs each provider's official page daily and a human verifies every change before it lands, so the next quiet repricing gets caught the day it happens instead of being reconstructed from an archive two years later. It is the changelog the pricing pages should have shipped and never did.

What it unlocks

The point of keeping receipts is that you can answer questions the live pricing page cannot. You can reprice a historical bill at the rates that were actually in effect on the day, instead of extrapolating from today's number. You can audit a vendor's "we cut prices" announcement against the date the API page actually changed, and find the cuts that shipped with no announcement at all. You can plot the fan yourself, because the whole thing is a CSV. (Point-in-time repricing is also why we built it: it is the engine behind Goei, our cost dashboard.)

The tidy line was never wrong, exactly. It was just an average drawn across a market that stopped having a single price, and the average of cents and hundreds of dollars is a number that has never appeared on anyone's invoice. The receipts are messier than the line. They are also true, and you can check every one of them.

Get the data

Open and CC BY 4.0. Found a wrong price? Send a pull request with the source. That is the whole point.

Citable: DOI 10.5281/zenodo.20730241. Prices validated as of 2026-06-17. Release v2026.06.17-4ab5db2.