Name: Gemma 3 12B
Price: 0.05 USD
Author: Google

Why Choose Gemma 3 12B

Budget-friendly at just $0.05/M input tokens

128K token context window — handles lengthy documents with ease

Supports text + image — true multimodal capability

Fully open source — self-host, fine-tune, and customize without restrictions

Strengths & Limitations

Strengths

+Solid benchmark performance
+Large context window for complex tasks
+Very affordable pricing
+Open source — can self-host and fine-tune

Limitations

No significant limitations identified

Benchmark Results

MMLU79.0

HumanEval74.0

HellaSwag83.0

MATH55.0

GSM8K84.5

Quick Comparison

vs similar-tier models

Model	Input	Output	Context	Avg Score
Gemma 3 12BCurrent Google	$0.05	$0.05	128K	75.1
Claude Haiku 3.5 Anthropic	$0.80	$4.00	200K	77.0
Mistral Small Mistral AI	$0.10	$0.30	32K	69.8

Full Comparison

Pricing Calculator

How pricing works A token is roughly ¾ of a word. A 1,000-word article is about 1,333 tokens. You pay separately for input (what you send) and output (what the model replies).

Describe a single image

<$0.001

Photo → detailed description

1,000 in · 200 out

Analyze a chart or diagram

<$0.001

Visual data → structured insights

2,000 in · 500 out

OCR a 10-page document

<$0.001

Scanned pages → structured text

15,000 in · 3,000 out

Batch process 100 images

$0.0060

Bulk image analysis pipeline

100,000 in · 20,000 out

At scale: 1,000 requests/day

Image descriptions

$2/mo

$0.06/day

Document OCR

$27/mo

$0.90/day

Batch image analysis

$180/mo

$6/day

Technical Specifications

ProviderGoogle

ArchitectureTransformer

Parameters12B

Context Window128K tokens

Max Output8K tokens

Modalitiestext, image

Open SourceYes

Release DateMarch 12, 2025

Community Ratings

No ratings yet. Be the first to rate this model!

Rate This Model

Sign in to rate this model and share your experience.

Comments

0 comments

Sign in to leave a comment and join the discussion.

No comments yet. Be the first to share your thoughts!

More from Google

Gemini 2.5 Pro

Google

frontier

Google's most capable thinking model with breakthrough performance on reasoning and coding.

textimageaudiovideocode

Input

$1.25/M

Output

$10.00/M

Context

1.0M

Gemini 2.0 Flash

Google

mid

Google's fastest multimodal model with native tool use and advanced agentic capabilities.

textimageaudiovideo

Input

$0.10/M

Output

$0.40/M

Context

1.0M

Gemini 2.5 Flash

Google

mid

Google's fast and cost-efficient thinking model with strong reasoning capabilities.

textimageaudiovideo

Input

$0.15/M

Output

$0.60/M

Context

1.0M

Similar Budget Models

Claude Haiku 3.5

Anthropic

budget

Anthropic's fastest and most affordable model. Great for high-volume, low-latency tasks.

textimage

Input

$0.80/M

Output

$4.00/M

Context

200K

Mistral Small

Mistral AI

budget

Mistral's efficient model for everyday tasks. Fast and cost-effective.

text

Input

$0.10/M

Output

$0.30/M

Context

32K

GPT-4.1 Mini

OpenAI

budget

A fast, affordable variant of GPT-4.1 for high-volume workloads.

textimage

Input

$0.40/M

Output

$1.60/M

Context

1.0M

Gemma 3 12B

Why Choose Gemma 3 12B

Strengths & Limitations

Strengths

Limitations

Benchmark Results

Quick Comparison

Quick Comparison

Pricing Calculator

At scale: 1,000 requests/day

Technical Specifications

Community Ratings

Rate This Model

Comments

More from Google

Gemini 2.5 Pro

Gemini 2.0 Flash

Gemini 2.5 Flash

Similar Budget Models

Claude Haiku 3.5

Mistral Small

GPT-4.1 Mini

Compare Gemma 3 12B with other models