GPTCrunch
Back to Providers
DeepSeek logo

DeepSeek

Explore all 13 AI models from DeepSeek. Compare benchmarks, pricing, and capabilities across the full model lineup.

deepseek.com

13

Total Models

79.3

Avg Benchmark

13

Open Source

3

Modalities

frontiermidbudgettextcodeimage Open Source

All Models

13 models

DeepSeek logo

DeepSeek V4

DeepSeek

frontier

DeepSeek's 1T parameter coding-focused model with 1M+ context. Three architectural innovations: Manifold-Constrained Hyper-Connections, Engram memory, Sparse Attention.

textcode

Input

$0.10/M

Output

$0.40/M

Context

1.0M

DeepSeek logo

DeepSeek-V3.2

DeepSeek

frontier

Unified reasoning and non-reasoning model that merges DeepSeek-V3 and R1 capabilities into a single architecture.

textcode

Input

$0.28/M

Output

$0.42/M

Context

128K

DeepSeek logo

DeepSeek-Math V2

DeepSeek

frontier

Math-specialized model achieving gold-level scores in math competitions. Based on V3.2 architecture.

text

Input

$0.27/M

Output

$1.10/M

Context

128K

DeepSeek logo

DeepSeek-V3.1

DeepSeek

frontier

Hybrid model combining V3 and R1 strengths. Improved reasoning with RL techniques from R1.

text

Input

$0.27/M

Output

$1.10/M

Context

128K

DeepSeek logo

DeepSeek-R1-Distill-Llama-70B

DeepSeek

mid

R1's reasoning capability distilled into a Llama 3.1 70B architecture for efficient deployment.

text

Input

$0.18/M

Output

$0.18/M

Context

128K

DeepSeek logo

DeepSeek-R1

DeepSeek

mid

DeepSeek's reasoning model with transparent chain-of-thought. Open-source and highly competitive.

text

Input

$0.55/M

Output

$2.19/M

Context

128K

DeepSeek logo

DeepSeek-R1-Distill-Qwen-32B

DeepSeek

mid

R1 reasoning capabilities distilled into a compact Qwen-based 32B model.

text

Input

$0.12/M

Output

$0.18/M

Context

128K

DeepSeek logo

DeepSeek-R1-Distill-Qwen-7B

DeepSeek

budget

Distilled R1 reasoning into compact Qwen-based model. Exceptional at math and programming.

text

Input

$0.07/M

Output

$0.14/M

Context

128K

DeepSeek logo

DeepSeek-R1-Distill-Llama-8B

DeepSeek

budget

R1 reasoning distilled into Llama 3 architecture. Strong reasoning at minimal compute cost.

text

Input

$0.07/M

Output

$0.14/M

Context

128K

DeepSeek logo

DeepSeek-V3

DeepSeek

mid

DeepSeek's open-source MoE model rivaling frontier models at a fraction of the cost.

textcode

Input

$0.27/M

Output

$1.10/M

Context

128K

DeepSeek logo

DeepSeek-VL2

DeepSeek

mid

Vision-language model for image understanding, OCR, and visual reasoning tasks.

textimage

Input

$0.14/M

Output

$0.28/M

Context

128K

DeepSeek logo

DeepSeek-V2.5

DeepSeek

mid

Merged general and coder capabilities from V2 into a unified model.

textcode

Input

$0.14/M

Output

$0.28/M

Context

128K

DeepSeek logo

DeepSeek-Coder-V2

DeepSeek

mid

DeepSeek's open-source code-focused MoE model. Competitive with GPT-4 Turbo on coding.

code

Input

$0.14/M

Output

$0.28/M

Context

128K

Compare DeepSeek models side by side

See how DeepSeek models stack up against each other and the competition