by Mistral AI· 2 years ago
The original open-source MoE model that started the MoE trend. Fast and efficient.
Context Window
32K
Max Output
4K
TTFT
100ms
Speed
180 tok/s
Input Price
$0.24/M tokens
Output Price
$0.24/M tokens
Performance Profile
Budget-friendly at just $0.24/M input tokens
32K token context window for substantial input processing
Fully open source — self-host, fine-tune, and customize without restrictions
56B (12B active) parameter architecture for deep reasoning
vs similar-tier models
| Model | Input | Output | Context | Avg Score |
|---|---|---|---|---|
Mixtral 8x7BCurrent Mistral AI | $0.24 | $0.24 | 32K | 69.1 |
Claude Haiku 3.5 Anthropic | $0.80 | $4.00 | 200K | 77.0 |
GPT-4.1 Mini OpenAI | $0.40 | $1.60 | 1.0M | 76.5 |
Summarize an email
<$0.001~300 word email → short summary
400 in · 100 out
Analyze a 1,000-word article
<$0.001Blog post or news article → detailed analysis
1,333 in · 500 out
Chatbot conversation (10 turns)
$0.0014Full customer support interaction
4,000 in · 2,000 out
Summarize a 50-page report
$0.0095Legal contract or research paper → key points
37,500 in · 2,000 out
Review a 5,000-line codebase
$0.0067Full code review with suggestions
25,000 in · 3,000 out
Process a full novel
$0.030~90,000 words → detailed summary & analysis
120,000 in · 5,000 out
Email summaries
$4/mo
$0.12/day
Chat conversations
$43/mo
$1/day
Document analysis
$284/mo
$9/day
No ratings yet. Be the first to rate this model!
Sign in to rate this model and share your experience.
Sign in to leave a comment and join the discussion.
Mistral AI
Mistral's efficient model for everyday tasks. Fast and cost-effective.
Input
$0.10/M
Output
$0.30/M
Context
32K
Mistral AI
Mistral's first code-focused model with 32K context. Supports 80+ programming languages.
Input
$0.30/M
Output
$0.90/M
Context
32K
Mistral AI
Mistral's 12B open-source model co-developed with NVIDIA. Replaces Mistral 7B.
Input
$0.04/M
Output
$0.04/M
Context
128K
Anthropic
Anthropic's fastest and most affordable model. Great for high-volume, low-latency tasks.
Input
$0.80/M
Output
$4.00/M
Context
200K
OpenAI
A fast, affordable variant of GPT-4.1 for high-volume workloads.
Input
$0.40/M
Output
$1.60/M
Context
1.0M
OpenAI
OpenAI's fastest and cheapest model. Ideal for classification, autocomplete, and high-throughput tasks.
Input
$0.10/M
Output
$0.40/M
Context
1.0M