by NVIDIA
Input Price
Free/M tokens
Output Price
Free/M tokens
Performance Profile
Strong mid-tier performance balancing capability and cost
7B parameter architecture for deep reasoning
vs similar-tier models
| Model | Input | Output | Context | Avg Score |
|---|---|---|---|---|
PersonaPlex 7B v1Current NVIDIA | Free | Free | N/A | 0.0 |
o3-mini OpenAI | $1.10 | $4.40 | 200K | 86.3 |
DeepSeek-R1 DeepSeek | $0.55 | $2.19 | 128K | 87.0 |
Summarize an email
<$0.001~300 word email → short summary
400 in · 100 out
Analyze a 1,000-word article
<$0.001Blog post or news article → detailed analysis
1,333 in · 500 out
Chatbot conversation (10 turns)
<$0.001Full customer support interaction
4,000 in · 2,000 out
Summarize a 50-page report
<$0.001Legal contract or research paper → key points
37,500 in · 2,000 out
Review a 5,000-line codebase
<$0.001Full code review with suggestions
25,000 in · 3,000 out
Process a full novel
<$0.001~90,000 words → detailed summary & analysis
120,000 in · 5,000 out
Email summaries
$0.00/mo
$0.00/day
Chat conversations
$0.00/mo
$0.00/day
Document analysis
$0.00/mo
$0.00/day
No ratings yet. Be the first to rate this model!
Sign in to rate this model and share your experience.
Sign in to leave a comment and join the discussion.
NVIDIA
NVIDIA's optimized Llama 3.1 variant with custom reward model training.
Input
$0.18/M
Output
$0.18/M
Context
128K
NVIDIA
NVIDIA's large open-source model trained for synthetic data generation.
Input
$1.20/M
Output
$1.20/M
Context
4K
NVIDIA
Hybrid Mamba-Transformer MoE with 4x higher throughput than predecessor. Open weights and training data.
Input
$0.04/M
Output
$0.08/M
Context
1.0M
OpenAI
OpenAI's efficient reasoning model, optimized for speed while maintaining strong analytical capabilities.
Input
$1.10/M
Output
$4.40/M
Context
200K
DeepSeek
DeepSeek's reasoning model with transparent chain-of-thought. Open-source and highly competitive.
Input
$0.55/M
Output
$2.19/M
Context
128K
Anthropic
Anthropic's best balance of intelligence and speed. Excellent for production workloads.
Input
$3.00/M
Output
$15.00/M
Context
200K