Compare AI Models, Make Better Decisions

Google's most capable model. 94.3% on GPQA Diamond, 80.6% on SWE-bench, 77.1% on ARC-AGI-2. #1 on 12 of 18 tracked benchmarks.

textimageaudiovideocode

Input

$2.00/M

Output

$12.00/M

Context

1.0M

Claude Sonnet 4.6

Anthropic

Matches Opus 4.6 on most benchmarks at 1/5 the cost. 79.6% on SWE-bench, 1M context, computer use, and design capabilities.

textimagecode

Input

$3.00/M

Output

$15.00/M

Context

1.0M

DeepSeek V4

DeepSeek

DeepSeek's 1T parameter coding-focused model with 1M+ context. Three architectural innovations: Manifold-Constrained Hyper-Connections, Engram memory, Sparse Attention.

textcode

Input

$0.10/M

Output

$0.40/M

Context

1.0M

Grok 4.20

xAI

xAI's 4-agent parallel collaboration system with rapid learning architecture and medical document analysis. Beta release.

textimage

Input

$3.00/M

Output

$15.00/M

Context

131K

Compare Models Head-to-Head

Put up to 4 models side by side. Compare benchmarks, pricing, capabilities, and find the perfect fit for your use case.

Gemini 3.1 Pro

DeepSeek V4

Claude Sonnet 4.6

Qwen3.5 397B

Key Insights

Live rankings, pricing data, and performance metrics updated continuously

Top Performers

DeepSeek V4

DeepSeek

88.6 #2

Kimi K2.5

Moonshot AI

92.3 #3

Gemini 3.1 Pro

Google

93.5 #4

DeepSeek-V3.2

DeepSeek

86.4 #5

DeepSeek-R1

DeepSeek

87.0

Full Leaderboard

Best Value (Input)

LTX-2

Lightricks

Wan 2.1

Alibaba/Qwen

Wan 2.2

Alibaba/Qwen

HunyuanVideo 1.5

Tencent

Full Pricing Table

Open Source

DeepSeek V4

DeepSeek

Tiny Aya

Cohere

Qwen3.5 397B

Alibaba/Qwen

MiniMax M2.5

MiniMax

All Open Source

Pricing Spotlight

Most affordable models per million tokens

Best Value

LTX-2

Lightricks

Input

Free

Output

Free

Wan 2.1

Alibaba/Qwen

Input

Free

Output

Free

Wan 2.2

Alibaba/Qwen

Input

Free

Output

Free

HunyuanVideo 1.5

Tencent

Input

Free

Output

Free

Open Source Models

Deploy on your own infrastructure — no vendor lock-in

DeepSeek V4

DeepSeek

DeepSeek's 1T parameter coding-focused model with 1M+ context. Three architectural innovations: Manifold-Constrained Hyper-Connections, Engram memory, Sparse Attention.

textcode

Input

$0.10/M

Output

$0.40/M

Context

1.0M

Tiny Aya

Cohere

budget

Cohere's compact multilingual model supporting 70+ languages. Runs on consumer devices including phones. Outperforms Gemma3-4B in 46/61 languages.

text

Input

$0.01/M

Output

$0.01/M

Context

32K

Qwen3.5 397B

Alibaba/Qwen

Alibaba's open-weight hybrid MoE model with 512 experts and 17B active parameters. Natively multimodal with 201 language support. Top scores on GPQA and SWE-bench.

textimagevideocode

Input

$0.15/M

Output

$1.00/M

Context

256K

MiniMax M2.5

MiniMax