Nemotron-4 340B

Name: Nemotron-4 340B
Price: 1.2 USD
Author: NVIDIA

frontier

by NVIDIA· 2 years ago

NVIDIA's large open-source model trained for synthetic data generation.

Context Window

Max Output

TTFT

500ms

Speed

45 tok/s

text Open Source

Input Price

$1.20/M tokens

Output Price

$1.20/M tokens

Performance Profile

Why Choose Nemotron-4 340B

Frontier-tier performance at $1.20/M input tokens

Fully open source — self-host, fine-tune, and customize without restrictions

340B parameter architecture for deep reasoning

Consistently scores 80%+ across major benchmarks

Strengths & Limitations

Strengths

+Top-tier benchmark scores across categories
+Excellent math performance
+Open source — can self-host and fine-tune

Limitations

−Limited context window
−Text only — no image or audio support

Benchmark Results

MMLU78.7

HumanEval73.0

HellaSwag90.0

GSM8K92.0

Quick Comparison

vs similar-tier models

Model	Input	Output	Context	Avg Score
Nemotron-4 340BCurrent NVIDIA	$1.20	$1.20	4K	83.4
GPT-4o OpenAI	$2.50	$10.00	128K	81.1
Kimi K2.5 Moonshot AI	$0.45	$2.20	256K	92.3

Full Comparison

Pricing Calculator

How pricing works A token is roughly ¾ of a word. A 1,000-word article is about 1,333 tokens. You pay separately for input (what you send) and output (what the model replies).

Summarize an email

<$0.001

~300 word email → short summary

400 in · 100 out

Analyze a 1,000-word article

$0.0022

Blog post or news article → detailed analysis

1,333 in · 500 out

Chatbot conversation (10 turns)

$0.0072

Full customer support interaction

4,000 in · 2,000 out

Summarize a 50-page report

$0.047

Legal contract or research paper → key points

37,500 in · 2,000 out

Review a 5,000-line codebase

$0.034

Full code review with suggestions

25,000 in · 3,000 out

Process a full novel

$0.150

~90,000 words → detailed summary & analysis

120,000 in · 5,000 out

At scale: 1,000 requests/day

Email summaries

$18/mo

$0.60/day

Chat conversations

$216/mo

$7/day

Document analysis

$1422/mo

$47/day

Technical Specifications

ProviderNVIDIA

ArchitectureTransformer

Parameters340B

Context Window4K tokens

Max Output4K tokens

Modalitiestext

Open SourceYes

Release DateJune 14, 2024

Community Ratings

No ratings yet. Be the first to rate this model!

Rate This Model

Comments

0 comments

No comments yet. Be the first to share your thoughts!

Similar Frontier Models

GPT-4o

OpenAI

frontier

OpenAI's most advanced multimodal model. Excels at text, vision, and audio tasks with fast response times.

textimageaudio

Input

$2.50/M

Output

$10.00/M

Context

128K

Kimi K2.5

Moonshot AI

frontier

Moonshot AI's frontier multimodal MoE model with 1T total parameters (32B active). Tops SWE-bench and AIME 2025 benchmarks.

textimagecode

Input

$0.45/M

Output

$2.20/M

Context

256K

Gemini 2.5 Pro

Google

frontier

Google's most capable thinking model with breakthrough performance on reasoning and coding.

textimageaudiovideocode

Input

$1.25/M

Output

$10.00/M

Context

1.0M

Compare Nemotron-4 340B with other models

See how it stacks up against the competition

Nemotron-4 340B

Why Choose Nemotron-4 340B

Strengths & Limitations

Strengths

Limitations

Benchmark Results

Quick Comparison

Quick Comparison

Pricing Calculator

At scale: 1,000 requests/day

Technical Specifications

Community Ratings

Rate This Model

Comments

More from NVIDIA

Nemotron 70B

PersonaPlex 7B v1

Nemotron 3 Nano

Similar Frontier Models

GPT-4o

Kimi K2.5

Gemini 2.5 Pro

Compare Nemotron-4 340B with other models