Token-level pricing across 14 LLM API providers, covering 24 models. Costs are listed per 1 million tokens and updated monthly from verified vendor pricing pages.
Cheapest Input
$0.05/1M
Llama (Meta)
Median Input
$1/1M
across all models
Cheapest Output
$0.1/1M
Llama (Meta)
Providers Tracked
14
24 model tiers
Prices in USD per 1 million tokens. Ranges shown when a provider offers multiple model tiers at different price points.
| Provider | Input / 1M | Output / 1M | Free Tier |
|---|---|---|---|
| Llama (Meta)Cheapest | $0.05 - $0.19 | $0.1 - $0.49 | Yes |
| Llama 3.1 | $0.05 - $0.19 | $0.1 - $0.49 | Yes |
| Replicate | $0.1 | $0.5 | Yes |
| Mistral AI | $0.1 - $2 | $0.3 - $6 | No |
| DeepSeek | $0.14 - $1.74 | $0.28 - $3.48 | Yes |
| DeepSeek V3 | $0.14 - $1.74 | $0.28 - $3.48 | Yes |
| Phi-3 | $0.14 | $0.56 | Yes |
| Google AI Studio | $0.15 - $1.25 | $0.6 - $10 | Yes |
| Cohere | $0.15 - $2.5 | $0.6 - $10 | Yes |
| OpenAI API | $0.75 - $2.5 | $4.5 - $15 | No |
| Anthropic API (Claude) | $1 - $5 | $5 - $25 | Yes |
| Mistral Large | $2 | $6 | Yes |
| GPT-4o | $2.5 | $10 | Yes |
| Command R+ | $2.5 | $10 | Yes |
These products are sold as monthly subscriptions to end users, not as API access. Token pricing does not apply.
| Product | Starting Price | Free Plan |
|---|---|---|
| Meta AI | Free | Yes |
| Qwen 2.5 | Free | Yes |
| Groq | Free | Yes |
| Mistral Small | $0.1/mo | No |
| Claude 3.7 Sonnet | $3/mo | No |
| Amazon Nova | $4.75/mo | No |
| Grok 2 | $8/mo | Yes |
| Hugging Face | $9/mo | Yes |
| Grok | $16/mo | Yes |
| Gemini Advanced | $19.99/mo | No |
| ChatGPT | $20/mo | Yes |
| Claude | $20/mo | Yes |
| Google Gemini | $20/mo | Yes |
| Claude 3.5 Sonnet | $20/mo | Yes |
| Gemini 1.5 Pro | $20/mo | Yes |
| OpenAI o1 | $20/mo | No |
Llama 3 and Phi-3 start at $0.05-$0.14 per 1M input tokens. GPT-4o and Claude Sonnet start at $2.50-$3.00. For non-critical workloads, the cost difference is difficult to justify.
Across all providers, output pricing consistently runs higher than input. Generation is computationally expensive. Prompt caching and shorter outputs have a significant impact on total inference cost.
ChatGPT Plus, Claude Pro, Gemini Advanced, and Grok all price their primary consumer tier at $19.99-$20/month. Premium tiers (o1 Pro, Claude Max) run $100-$200/month for heavy power users.
Most API providers offer a free tier with rate-limited access. This covers development, evaluation, and low-volume production workloads without any billing commitment.
Feature benchmarks, context windows, and full pricing breakdowns for every model.
Token pricing sourced from vendor API documentation and pricing pages, verified monthly. Full methodology.