Model Costs
Live token pricing across all major AI providers · Updated daily
Provider:
Tier:
| Provider | Model | Context | Input / 1M tokens ↑ | Output / 1M tokens | Tier | Notes |
|---|---|---|---|---|---|---|
| Groq | Llama 3.1 8B | 128K | $0.05 | $0.08 | budget | Extremely low-cost, high-speed option |
| Groq | Llama 4 Scout 17B | 128K | $0.11 | $0.34 | budget | Ultra-fast inference on Groq hardware |
| Mistral | Mistral Small 4 | 128K | $0.15 | $0.60 | budget | Very cost-effective multimodal reasoning model |
| Cohere | Command R | large | $0.15 | $0.60 | budget | Efficient model for balanced RAG and tool-use tasks |
| Gemini 3.1 Flash-Lite | large | $0.25 | $1.50 | budget | Ultra-efficient lite variant for high-volume use | |
| Mistral | Mistral Medium 3 | 131K | $0.40 | $2.00 | budget | Strong mid-tier model with good speed and caching support |
| Groq | Llama 3.3 70B | 128K | $0.59 | $0.79 | budget | Versatile high-performance open model |
| OpenAI | GPT-5.4 mini | 400K | $0.75 | $4.50 | budget | Efficient model for coding, computer use, and sub-agents |
| Together AI | Llama 3.3 70B | 128K | $0.88 | $0.88 | budget | Competitive serverless pricing for open models |
| Anthropic | Claude Haiku 4.5 | 1M | $1.00 | $5.00 | budget | Fastest and most cost-efficient Claude variant |
| xAI | Grok 4.3 | 1M | $1.25 | $2.50 | standard | New flagship with excellent tool calling and low hallucination rate |
| xAI | Grok 4.20 | 2M | $1.25 | $2.50 | standard | High-speed agentic model available in reasoning and non-reasoning variants |
| Gemini 3.5 Flash | hundreds of K | $1.50 | $9.00 | standard | Fast, cost-effective multimodal model | |
| Gemini 3.1 Pro | 1M+ | $2.00 | $12.00 | premium | High-performance model with tiered long-context pricing (≤200K: $2.00, >200K: $4.00 input) | |
| OpenAI | GPT-5.4 | 400K-1M | $2.50 | $15.00 | standard | Strong balance of performance and cost for coding and professional tasks |
| Cohere | Command R+ | large | $2.50 | $10.00 | standard | High-performance model optimized for RAG, agents, and multilingual use |
| Anthropic | Claude Sonnet 4.6 | 1M | $3.00 | $15.00 | standard | Balanced intelligence for complex tasks with full context at standard rates |
| OpenAI | GPT-5.5 | 1M | $5.00 | $30.00 | premium | Flagship multimodal reasoning model with advanced agentic capabilities |
| Anthropic | Claude Opus 4.7 | 1M | $5.00 | $25.00 | premium | Top-tier model with strong prompt caching (up to 90% savings) and extended thinking |
Prices in USD per 1M tokens · Updated daily · Always verify against official provider documentation