RoninForge / LLM API pricing
Token prices for six providers, taken from their official pricing pages and nothing else. Standard tier, USD per 1 million tokens, with the caching and long-context caveats that quietly change your bill.
Every number verified against the linked source on 2026-06-10. The spread is real: Mistral Small 4 costs $0.40 per combined 1M in + 1M out, GPT-5.5 Pro costs $210.00.
| Model | Input /1M | Cached /1M | Output /1M | 1M in + 1M out | Context |
|---|---|---|---|---|---|
| Claude Fable 5 | $10.00 | $1.00 | $50.00 | $60.00 | 1M |
| Claude Opus 4.8 | $5.00 | $0.50 | $25.00 | $30.00 | 1M |
| Claude Sonnet 4.6 | $3.00 | $0.30 | $15.00 | $18.00 | 1M |
| Claude Haiku 4.5 | $1.00 | $0.10 | $5.00 | $6.00 | - |
| Model | Input /1M | Cached /1M | Output /1M | 1M in + 1M out | Context |
|---|---|---|---|---|---|
| GPT-5.5 Protiered | $30.00 | - | $180.00 | $210.00 | - |
| GPT-5.5tiered | $5.00 | $0.50 | $30.00 | $35.00 | - |
| GPT-5.4tiered | $2.50 | $0.25 | $15.00 | $17.50 | - |
| GPT-5.3-Codex | $1.75 | $0.175 | $14.00 | $15.75 | - |
| GPT-5.4 mini | $0.75 | $0.075 | $4.50 | $5.25 | - |
| GPT-5.4 nano | $0.20 | $0.02 | $1.25 | $1.45 | - |
| Model | Input /1M | Cached /1M | Output /1M | 1M in + 1M out | Context |
|---|---|---|---|---|---|
| Gemini 3.1 Pro (preview)tiered | $2.00 | $0.20 | $12.00 | $14.00 | - |
| Gemini 3.5 Flash | $1.50 | $0.15 | $9.00 | $10.50 | - |
| Gemini 2.5 Protiered | $1.25 | $0.125 | $10.00 | $11.25 | - |
| Gemini 3 Flash (preview) | $0.50 | $0.05 | $3.00 | $3.50 | - |
| Gemini 3.1 Flash-Lite | $0.25 | $0.025 | $1.50 | $1.75 | - |
| Model | Input /1M | Cached /1M | Output /1M | 1M in + 1M out | Context |
|---|---|---|---|---|---|
| DeepSeek V4 Pro | $0.44 | $0.0036 | $0.87 | $1.31 | 1M |
| DeepSeek V4 Flash | $0.14 | $0.0028 | $0.28 | $0.42 | 1M |
| Model | Input /1M | Cached /1M | Output /1M | 1M in + 1M out | Context |
|---|---|---|---|---|---|
| Magistral Medium | $2.00 | - | $5.00 | $7.00 | - |
| Mistral Medium 3.5 | $1.50 | - | $7.50 | $9.00 | - |
| Mistral Large 3 | $0.50 | - | $1.50 | $2.00 | - |
| Codestral | $0.30 | - | $0.90 | $1.20 | - |
| Mistral Small 4 | $0.10 | - | $0.30 | $0.40 | - |
| Model | Input /1M | Cached /1M | Output /1M | 1M in + 1M out | Context |
|---|---|---|---|---|---|
| Grok 4.3 | $1.25 | $0.20 | $2.50 | $3.75 | 1M |
| Grok 4.20 (reasoning) | $1.25 | $0.20 | $2.50 | $3.75 | 1M |
| Grok Build 0.1 | $1.00 | $0.20 | $2.00 | $3.00 | 256K |
GitHub Copilot now bills these same token rates through AI credits. Estimate your monthly burn against your plan allowance.
Open the calculatorHard daily spend caps and per-branch cost attribution for Claude Code. Reads local telemetry only: zero keys, zero prompts, zero latency added.
Set a spend cap