GPT-5.3-Codex

Name: GPT-5.3-Codex
Price: 2 USD
Author: OpenAI

frontier

by OpenAI· 5 months ago

OpenAI's most capable coding model combining Codex and GPT-5 training stacks. Agentic coding, research, and tool use with 77.3% on Terminal-Bench 2.0.

Context Window

400K

Max Output

128K

TTFT

600ms

Speed

55 tok/s

textcode

Input Price

$2.00/M tokens

Output Price

$16.00/M tokens

Performance Profile

Why Choose GPT-5.3-Codex

Frontier-tier performance at $2.00/M input tokens

400K token context window — handles lengthy documents with ease

Supports text + code — true multimodal capability

Consistently scores 80%+ across major benchmarks

Best Use Cases

Full-Stack Development

Build complete applications with agentic coding, research, and tool use capabilities.

Security Analysis

First model rated 'High' for cybersecurity — ideal for code security audits.

Complex Execution

77.3% on Terminal-Bench 2.0 — executes multi-step terminal workflows autonomously.

Strengths & Limitations

Strengths

+Top-tier benchmark scores across categories
+Excellent reasoning performance
+Excellent math performance
+Large context window for complex tasks

Limitations

−Closed source — API access only

Benchmark Results

GPQA82.0

HumanEval96.0

MATH94.0

SWE-bench72.0

GSM8K98.5

LiveCodeBench85.0

IFEval95.0

Quick Comparison

vs similar-tier models

Model	Input	Output	Context	Avg Score
GPT-5.3-CodexCurrent OpenAI	$2.00	$16.00	400K	88.9
Kimi K2.5 Moonshot AI	$0.45	$2.20	256K	92.3
Gemini 2.5 Pro Google	$1.25	$10.00	1.0M	88.4

Full Comparison

Pricing Calculator

How pricing works A token is roughly ¾ of a word. A 1,000-word article is about 1,333 tokens. You pay separately for input (what you send) and output (what the model replies).

Generate a function

$0.0058

Spec → implementation with tests

500 in · 300 out

Review a 2,000-line PR

$0.052

Full pull request code review

10,000 in · 2,000 out

Refactor a 5,000-line module

$0.130

Major refactoring with explanations

25,000 in · 5,000 out

Analyze a full codebase

$0.360

Architecture analysis + recommendations

100,000 in · 10,000 out

At scale: 1,000 requests/day

Code generation

$174/mo

$6/day

PR reviews

$1560/mo

$52/day

Codebase analysis

$6840/mo

$228/day

Technical Specifications

ProviderOpenAI

ArchitectureTransformer + Codex

Context Window400K tokens

Max Output128K tokens

Modalitiestext, code

Open SourceNo

Release DateFebruary 5, 2026

Community Ratings

No ratings yet. Be the first to rate this model!

Rate This Model

Comments

0 comments

No comments yet. Be the first to share your thoughts!

Similar Frontier Models

Kimi K2.5

Moonshot AI

frontier

Moonshot AI's frontier multimodal MoE model with 1T total parameters (32B active). Tops SWE-bench and AIME 2025 benchmarks.

textimagecode

Input

$0.45/M

Output

$2.20/M

Context

256K

Gemini 2.5 Pro

Google

frontier

Google's most capable thinking model with breakthrough performance on reasoning and coding.

textimageaudiovideocode

Input

$1.25/M

Output

$10.00/M

Context

1.0M

Claude Opus 4

Anthropic

frontier

Anthropic's most powerful model. Top-tier performance on coding, analysis, and complex reasoning tasks.

textimage

Input

$15.00/M

Output

$75.00/M

Context

200K

Compare GPT-5.3-Codex with other models

See how it stacks up against the competition

GPT-5.3-Codex

Why Choose GPT-5.3-Codex

Best Use Cases

Full-Stack Development

Security Analysis

Complex Execution

Strengths & Limitations

Strengths

Limitations

Benchmark Results

Quick Comparison

Quick Comparison

Pricing Calculator

At scale: 1,000 requests/day

Technical Specifications

Community Ratings

Rate This Model

Comments

More from OpenAI

GPT-4o

o1

o3-mini

Similar Frontier Models

Kimi K2.5

Gemini 2.5 Pro

Claude Opus 4

Compare GPT-5.3-Codex with other models