AI Model Pricing
Token costs, context windows, and capabilities for every major LLM
126 models tracked · 19 providers
152 models
ModelInput/1MOutput/1MContextRelease
Showing 50 of 152
Understanding AI Model Pricing
AI model pricing is measured in cost per million tokens. Input tokens are what you send to the model — your prompt, system instructions, and context. Output tokens are what the model generates in response. Most providers charge differently for each.
Context window determines how much text a model can process at once. Larger windows (1M+ tokens) let you work with entire codebases or long documents, but often come at higher per-token costs.
Some models support prompt caching, which significantly reduces costs for repeated prefixes. Reasoning models like OpenAI o-series and DeepSeek R1 have separate pricing for their chain-of-thought tokens.
Pricing sourced from LiteLLM & provider data. Updated daily.
| Model | Provider | Input Price (per 1M tokens) | Output Price (per 1M tokens) | Context Window | Release Date | User Rating |
|---|---|---|---|---|---|---|
| MiniMax M2-her | minimax | $0.30 | $1.20 | 66K tokens | 2026-01-27 | — |
| Kimi K2.5 | moonshotai | $0.60 | $3.00 | 262K tokens | 2026-01-27 | 5/5 (10 votes) |
| Solar Pro 3 | upstage | Free | Free | 128K tokens | 2026-01-27 | — |
| GLM 4.7 Flash | zhipu | $0.07 | $0.40 | 200K tokens | 2026-01-27 | — |
| MiniMax: MiniMax M2.1 | minimax | — | — | — tokens | 2025-12-23 | 5/5 (45 votes) |
| Z.AI: GLM 4.7 | openrouter | — | — | — tokens | 2025-12-22 | 5/5 (114 votes) |
| Gemini 3 Flash Preview | $0.50 | $3.00 | 1.0M tokens | 2025-12-17 | 5/5 (51 votes) | |
| Mistral Small Creative | mistral | $0.10 | $0.30 | 33K tokens | 2025-12-16 | — |
| MiMo-V2-Flash | xiaomi | Free | Free | 262K tokens | 2025-12-14 | 2.5/5 (50 votes) |
| GPT-5.2 Chat | openai | $1.75 | $14.00 | 128K tokens | 2025-12-10 | 3.7/5 (25 votes) |
| GPT-5.2 | openai | $1.75 | $14.00 | 400K tokens | 2025-12-10 | 4.9/5 (61 votes) |
| GPT-5.2 Pro | openai | $21.00 | $168.00 | 400K tokens | 2025-12-10 | 3.4/5 (42 votes) |
| Mistral: Devstral 2 2512 | mistral | — | — | — tokens | 2025-12-09 | 2.1/5 (81 votes) |
| GPT-5.1 Codex Max | openai | — | — | — tokens | 2025-12-04 | 2.2/5 (45 votes) |
| Amazon Nova 2 Lite | amazon | Free | Free | 1M tokens | 2025-12-02 | — |
| DeepSeek V3.2 Speciale | deepseek | $0.28 | $0.42 | 131K tokens | 2025-12-01 | 3.3/5 (21 votes) |
| DeepSeek V3.2 | deepseek | $0.28 | $0.40 | 164K tokens | 2025-12-01 | 1.9/5 (48 votes) |
| Mistral Large 3 2512 | mistral | — | — | — tokens | 2025-12-01 | 5/5 (19 votes) |
| TNG R1T Chimera | openrouter | — | — | — tokens | 2025-11-27 | — |
| INTELLECT-3 | openrouter | — | — | — tokens | 2025-11-27 | 3.5/5 (13 votes) |
| Claude Opus 4.5 | anthropic | — | — | — tokens | 2025-11-24 | 3.3/5 (789 votes) |
| Bert-Nebulon Alpha | openrouter | — | — | — tokens | 2025-11-24 | 3/5 (46 votes) |
| Grok 4.1 Fast | xai | — | — | — tokens | 2025-11-21 | 2.9/5 (29 votes) |
| Gemini 3 Pro Preview | — | — | — tokens | 2025-11-18 | 5/5 (585 votes) | |
| Sherlock Dash Alpha | openrouter | — | — | — tokens | 2025-11-15 | — |
| Sherlock Think Alpha | openrouter | — | — | — tokens | 2025-11-15 | — |
| GPT-5.1 | openai | $1.25 | $10.00 | 400K tokens | 2025-11-13 | 3.9/5 (89 votes) |
| GPT-5.1 Chat | openai | $1.25 | $10.00 | 128K tokens | 2025-11-13 | 2.3/5 (18 votes) |
| GPT-5.1-Codex-Mini | openai | $1.50 | $6.00 | 400K tokens | 2025-11-13 | 3.1/5 (165 votes) |
| GPT-5.1-Codex | openai | $1.25 | $10.00 | 400K tokens | 2025-11-13 | 2.5/5 (112 votes) |
| Kimi Linear 48B A3B Instruct | moonshotai | $0.30 | $0.60 | 1.0M tokens | 2025-11-10 | 1.9/5 (22 votes) |
| Kimi K2 Thinking | moonshotai | $0.60 | $2.50 | 262K tokens | 2025-11-06 | 2.3/5 (109 votes) |
| Polaris Alpha | openrouter | Free | Free | 256K tokens | 2025-11-06 | 5/5 (45 votes) |
| Nova Premier 1.0 | amazon | $2.50 | $12.50 | 1M tokens | 2025-10-31 | 1.5/5 (24 votes) |
| Sonar Pro Search | perplexity | $3.00 | $15.00 | 200K tokens | 2025-10-30 | — |
| MiniMax M2 | minimax | Free | Free | 205K tokens | 2025-10-23 | 3.2/5 (242 votes) |
| Andromeda Alpha | openrouter | Free | Free | 128K tokens | 2025-10-21 | — |
| Claude Haiku 4.5 | anthropic | $1.00 | $5.00 | 200K tokens | 2025-10-15 | 4.9/5 (310 votes) |
| GPT-5 Pro | openai | $15.00 | $120.00 | 400K tokens | 2025-10-06 | 3.4/5 (81 votes) |
| Z.AI: GLM 4.6 | openrouter | $0.50 | $1.75 | 203K tokens | 2025-09-30 | 5/5 (432 votes) |
| DeepSeek V3.2 Exp | deepseek | $0.27 | $0.41 | 164K tokens | 2025-09-29 | 3/5 (76 votes) |
| Claude Sonnet 4.5 | anthropic | $3.00 | $15.00 | 200K tokens | 2025-09-29 | 4.3/5 (764 votes) |
| Google: Gemini 2.5 Flash Preview 09-2025 | $0.30 | $2.50 | 1.0M tokens | 2025-09-25 | — | |
| Google: Gemini 2.5 Flash Lite Preview 09-2025 | $0.10 | $0.40 | 1.0M tokens | 2025-09-25 | — | |
| GPT-5 Codex | openai | — | — | — tokens | 2025-09-23 | 3.2/5 (496 votes) |
| Qwen3 Coder Plus | qwen | $1.00 | $5.00 | 128K tokens | 2025-09-17 | 2.5/5 (181 votes) |
| Qwen3 Coder Flash | qwen | $0.30 | $1.50 | 128K tokens | 2025-09-17 | — |
| Qwen3 Next 80B A3B Instruct | qwen | $0.15 | $1.50 | 66K tokens | 2025-09-11 | 3.8/5 (80 votes) |
| Qwen3 Next 80B A3B Thinking | qwen | $0.15 | $1.50 | 66K tokens | 2025-09-11 | 3/5 (68 votes) |
| Qwen Plus 0728 (thinking) | qwen | $0.40 | $4.00 | 1M tokens | 2025-09-08 | — |
| Qwen Plus 0728 | qwen | $0.40 | $1.20 | 1M tokens | 2025-09-08 | — |
| Sonoma Dusk Alpha | openrouter | Free | Free | 2M tokens | 2025-09-05 | 5/5 (31 votes) |
| Sonoma Sky Alpha | openrouter | Free | Free | 2M tokens | 2025-09-05 | 3.3/5 (77 votes) |
| Qwen: Qwen3 Max | qwen | $1.20 | $6.00 | 256K tokens | 2025-09-05 | 3.9/5 (182 votes) |
| NVIDIA Nemotron Nano 9B V2 | nvidia | $0.04 | $0.16 | 131K tokens | 2025-09-05 | 2.5/5 (19 votes) |
| MoonshotAI: Kimi K2 0905 | moonshotai | $0.60 | $2.50 | 262K tokens | 2025-09-04 | 3.7/5 (63 votes) |
| Qwen3 30B A3B Thinking 2507 | qwen | $0.07 | $0.28 | 262K tokens | 2025-08-29 | 4.2/5 (58 votes) |
| Grok Code Fast 1 | xai | $0.20 | $1.50 | 256K tokens | 2025-08-26 | 2.2/5 (1314 votes) |
| DeepSeek V3.1 | deepseek | $0.20 | $0.80 | 164K tokens | 2025-08-21 | 3.5/5 (244 votes) |
| Mistral Medium 3.1 | mistral | $0.40 | $2.00 | 131K tokens | 2025-08-13 | 4.5/5 (55 votes) |
| GPT-5 | openai | $1.25 | $10.00 | 400K tokens | 2025-08-07 | 3.8/5 (824 votes) |
| GPT-5 Mini | openai | <$0.01 | <$0.01 | — tokens | 2025-08-07 | 4.4/5 (729 votes) |
| GPT-5 Nano | openai | — | — | — tokens | 2025-08-07 | 2.6/5 (223 votes) |
| Claude Opus 4.1 | anthropic | $15.00 | $75.00 | 200K tokens | 2025-08-05 | 5/5 (338 votes) |
| GPT OSS 20B | openai | $0.05 | $0.20 | 131K tokens | 2025-08-05 | 3.3/5 (878 votes) |
| GPT OSS 120B | openai | $0.15 | $0.60 | 131K tokens | 2025-08-05 | 3.2/5 (596 votes) |
| Horizon Beta | openrouter | Free | Free | 256K tokens | 2025-08-01 | 4.6/5 (305 votes) |
| Horizon Alpha | openrouter | Free | Free | 256K tokens | 2025-07-30 | 3.7/5 (304 votes) |
| Qwen: Qwen3 30B A3B Instruct 2507 | qwen | $0.20 | $0.80 | 131K tokens | 2025-07-29 | 4/5 (157 votes) |
| Qwen: Qwen3 235B A22B Thinking 2507 | qwen | $0.30 | $3.00 | 131K tokens | 2025-07-25 | 4.5/5 (97 votes) |
| Z.AI: GLM 4.5 | zhipu | $0.60 | $2.20 | 128K tokens | 2025-07-25 | 5/5 (229 votes) |
| Z.AI: GLM 4.5 Air | zhipu | $0.20 | $1.10 | 128K tokens | 2025-07-25 | 5/5 (309 votes) |
| Z.AI: GLM 4 32B | zhipu | $0.10 | $0.10 | 128K tokens | 2025-07-24 | 3.8/5 (16 votes) |
| Qwen3 Coder | qwen | $1.00 | $2.00 | — tokens | 2025-07-23 | 5/5 (477 votes) |
| Qwen: Qwen3 235B A22B 2507 | qwen | <$0.01 | <$0.01 | — tokens | 2025-07-21 | 3.9/5 (85 votes) |
| OpenAI o3 | openai | $10.00 | $40.00 | — tokens | 2025-04-16 | 2.4/5 (398 votes) |
| OpenAI o4-mini | openai | $1.10 | $4.40 | — tokens | 2025-04-16 | 2.5/5 (60 votes) |
| OpenAI o4 Mini High | openai | $1.10 | $4.40 | 200K tokens | 2025-04-16 | 3.2/5 (118 votes) |
| GPT-4.1 | openai | $2.00 | $8.00 | 1.0M tokens | 2025-04-14 | 4.1/5 (475 votes) |
| GPT-4.1 Nano | openai | $0.10 | $0.40 | 1.0M tokens | 2025-04-14 | 3.1/5 (95 votes) |
| GPT-4.1 Mini | openai | $0.40 | $1.60 | 1.0M tokens | 2025-04-14 | 3.9/5 (56 votes) |
| Optimus Alpha | openrouter | — | — | 1M tokens | 2025-04-10 | 4.9/5 (92 votes) |
| Grok 3 Mini Beta | xai | $0.40 | $0.80 | 131K tokens | 2025-04-09 | 4/5 (56 votes) |
| Grok 3 Beta | xai | $2.00 | $4.00 | 131K tokens | 2025-04-09 | 3.6/5 (17 votes) |
| Llama 4 Maverick | meta | $1.50 | $2.50 | 1M tokens | 2025-04-05 | 2.2/5 (139 votes) |
| Llama 4 Scout | meta | $0.25 | $0.50 | 10M tokens | 2025-04-05 | 1.9/5 (145 votes) |
| Quasar Alpha | openrouter | — | — | 1M tokens | 2025-04-02 | 2.7/5 (109 votes) |
| ChatGPT-4o (March 2025) | openai | $5.00 | $15.00 | 128K tokens | 2025-03-27 | 2.3/5 (198 votes) |
| Gemini 2.5 Pro Experimental | $1.00 | $2.00 | 1M tokens | 2025-03-25 | 4.6/5 (1045 votes) | |
| QwQ 32B | qwen | $0.50 | $1.50 | 40K tokens | 2025-03-05 | 3.6/5 (57 votes) |
| GPT-4.5 | openai | $75.00 | $150.00 | 128K tokens | 2025-02-27 | 2.2/5 (203 votes) |
| Claude 3.7 Thinking Sonnet | anthropic | $6.00 | $30.00 | 200K tokens | 2025-02-26 | 4.7/5 (486 votes) |
| Claude 3.7 Sonnet | anthropic | $3.00 | $15.00 | 200K tokens | 2025-02-25 | 4/5 (997 votes) |
| Grok 3 Thinking | xai | — | — | 128K tokens | 2025-02-19 | 2.2/5 (90 votes) |
| Grok 3 | xai | — | — | 128K tokens | 2025-02-18 | 3.4/5 (124 votes) |
| DeepSeek R1 | deepseek | $0.55 | $2.19 | 66K tokens | 2025-02-01 | 2.9/5 (344 votes) |
| Trinity Large Preview | arcee-ai | Free | Free | 131K tokens | 2025-01-27 | — |
| Gemini 2.0 Pro Experimental | — | — | 2M tokens | 2025-01-01 | 3.1/5 (19 votes) | |
| o3 Mini | openai | $1.10 | $4.40 | 64K tokens | 2024-12-15 | 2.8/5 (53 votes) |
| Gemini 2.0 Flash Thinking | $0.25 | $0.50 | 500K tokens | 2024-12-11 | 4.8/5 (51 votes) | |
| o1 | openai | $15.00 | $60.00 | 128K tokens | 2024-12-05 | 2.6/5 (67 votes) |
| Mistral Large 2 | mistral | $2.00 | $6.00 | 128K tokens | 2024-07-24 | 2.4/5 (23 votes) |
| Llama 3.1 70B (Instruct) | meta | $0.59 | $0.79 | 128K tokens | 2024-07-23 | 1.9/5 (43 votes) |
| Llama 3.1 405B | meta | $2.70 | $3.10 | 128K tokens | 2024-07-23 | — |
| GPT-4o mini | openai | $0.15 | $0.60 | 128K tokens | 2024-07-18 | 2.7/5 (224 votes) |
| Claude Sonnet 3.6 (2022-10-22) | anthropic | $3.00 | $15.00 | 200K tokens | 2024-06-01 | 2.3/5 (59 votes) |
| GPT-4o (Omni) | openai | $2.50 | $10.00 | 128K tokens | 2024-05-13 | 2.3/5 (198 votes) |
| Llama 3 70B | meta | $0.59 | $0.79 | 8K tokens | 2024-04-18 | — |
| DeepSeek V3 (March 2024) | deepseek | $0.14 | $0.28 | 128K tokens | 2024-03-24 | 4.2/5 (135 votes) |
| Claude 3 Haiku | anthropic | $0.25 | $1.25 | 200K tokens | 2024-03-04 | 2.8/5 (11 votes) |
| Claude 3 Opus | anthropic | $15.00 | $75.00 | 200K tokens | 2024-03-04 | 1.6/5 (28 votes) |
| Mistral Large | mistral | $2.00 | $6.00 | 32K tokens | 2024-02-26 | 3.3/5 (38 votes) |
| Gemini 1.5 Pro | $3.50 | $10.50 | 1M tokens | 2024-02-15 | 3/5 (22 votes) | |
| Llama 4 Behemoth | meta | — | — | 128K tokens | Coming Soon (In Training) | — |
| xAI: Grok 4 Fast (free) | xai | Free | Free | 2M tokens | 2025-09-19 | 4.3/5 (45 votes) |
| Mistral Devstral Medium | mistral | $0.40 | $2.00 | — tokens | 2025-07-11 | 2.6/5 (100 votes) |
| Mistral Devstral Small 1.1 | mistral | $0.10 | $0.30 | — tokens | 2025-07-11 | 3.4/5 (116 votes) |
| Kimi K2 | moonshotai | $0.57 | $2.30 | — tokens | 2025-07-11 | 4.5/5 (216 votes) |
| xAI: Grok 4 | xai | $3.00 | $15.00 | 256K tokens | 2025-07-09 | 2.5/5 (252 votes) |
| Google: Gemma 3n 2B | Free | Free | 8K tokens | 2025-07-09 | 1.9/5 (17 votes) | |
| Cypher Alpha (free) | openrouter | — | — | — tokens | 2025-07-01 | 1.5/5 (132 votes) |
| Inception: Mercury | inception | $10.00 | $10.00 | 32K tokens | 2025-06-26 | 1.9/5 (42 votes) |
| Gemini 2.5 Flash Lite Preview 06-17 | $0.10 | $0.40 | 1.0M tokens | 2025-06-17 | 5/5 (69 votes) | |
| MiniMax M1 | minimax | $0.30 | $1.65 | 1M tokens | 2025-06-17 | 1.7/5 (58 votes) |
| Gemini 2.5 Pro Preview 06-05 | $1.25 | $10.00 | 1.0M tokens | 2025-06-05 | 5/5 (214 votes) | |
| DeepSeek R1 0528 | deepseek | Free | Free | 164K tokens | 2025-05-28 | 3.2/5 (179 votes) |
| Claude Opus 4 | anthropic | $15.00 | $75.00 | 200K tokens | 2025-05-22 | 5/5 (550 votes) |
| Claude Sonnet 4 | anthropic | $3.00 | $15.00 | 200K tokens | 2025-05-22 | 4.1/5 (919 votes) |
| Gemini 2.5 Flash Preview 05-20 | $0.15 | $0.60 | 1.0M tokens | 2025-05-20 | 4.2/5 (20 votes) | |
| Gemini 2.5 Flash Preview 05-20 (thinking) | $0.15 | $3.50 | 1.0M tokens | 2025-05-20 | 4.4/5 (28 votes) | |
| Gemma 3n 4B | Free | Free | 33K tokens | 2025-05-20 | 4.1/5 (126 votes) | |
| OpenAI Codex Mini | openai | $1.50 | $6.00 | 200K tokens | 2025-05-16 | 3.8/5 (77 votes) |
| Mistral Medium 3 | mistral | $0.40 | $2.00 | 131K tokens | 2025-05-07 | 3.6/5 (96 votes) |
| Gemini 2.5 Pro (I/O Edition) | $1.25 | $10.00 | — tokens | 2025-05-06 | 4.3/5 (329 votes) | |
| DeepSeek Prover V2 | deepseek | Free | Free | 164K tokens | 2025-04-30 | 1.9/5 (18 votes) |
| Qwen3 0.6B | qwen | Free | Free | 33K tokens | 2025-04-29 | 1.6/5 (122 votes) |
| Qwen3 30B A3B | qwen | Free | Free | 41K tokens | 2025-04-28 | 2.3/5 (301 votes) |
| Qwen3 235B A22B | qwen | — | — | 33K tokens | 2025-04-28 | 3/5 (529 votes) |
| Gemini 2.5 Flash Preview | $0.15 | $0.60 | 1.0M tokens | 2025-04-17 | 3/5 (210 votes) | |
| Gemini 2.5 Flash Preview (thinking) | $0.17 | $3.50 | 1.0M tokens | 2025-04-17 | 3.3/5 (126 votes) | |
| Gemma 3 12B | $0.03 | $0.03 | — tokens | 2025-03-13 | 2.3/5 (359 votes) | |
| Gemma 3 27B | $0.09 | $0.17 | — tokens | 2025-03-12 | 2.6/5 (297 votes) | |
| Mistral Nemo | mistral | $0.03 | $0.07 | 128K tokens | 2024-07-19 | 2.7/5 (77 votes) |
| Golden Gate Claude | anthropic | — | — | 200K tokens | 2024-05-23 | — |
| Claude 3 Sonnet | anthropic | $3.00 | $15.00 | — tokens | 2024-03-05 | 2.7/5 (14 votes) |
| Gemini Pro 1.0 | $0.50 | $1.50 | 33K tokens | 2023-12-13 | — | |
| PaLM 2 Chat | $1.00 | $2.00 | 8K tokens | 2023-07-20 | — | |
| Claude 2 | anthropic | $8.00 | $24.00 | 100K tokens | 2023-06-11 | 3.9/5 (18 votes) |
| GPT-4 | openai | $30.00 | $60.00 | 8K tokens | 2023-05-29 | 1.1/5 (35 votes) |
| GPT-3.5 Turbo | openai | $0.50 | $1.50 | 16K tokens | 2022-03-15 | 2.2/5 (58 votes) |
| GPT-2 | openai | — | — | 1K tokens | 2019-11-05 | 1.5/5 (22 votes) |
| GPT-1 | openai | — | — | 1K tokens | 2018-06-11 | 1.8/5 (29 votes) |