by OpenAI· 5 months ago
OpenAI's second-generation video synthesis model capable of producing cinematic-quality videos up to 60 seconds long with synchronized audio. Built on an advanced Diffusion Transformer (DiT) architecture, Sora 2 excels at complex scene composition, realistic physics simulation, and coherent multi-character narratives with natural dialogue and ambient sound.
Input Price
$5.00/M tokens
Output Price
$100.00/M tokens
Performance Profile
State-of-the-art video generation with cinematic quality and temporal consistency
Generate clips directly from text descriptions — no video editing skills required
vs similar-tier models
| Model | Input | Output | Context | Avg Score |
|---|---|---|---|---|
Sora 2Current OpenAI | $5.00 | $100.00 | N/A | 0.0 |
Kimi K2.5 Moonshot AI | $0.45 | $2.20 | 256K | 92.3 |
Gemini 2.5 Pro | $1.25 | $10.00 | 1.0M | 88.4 |
Generate a 5-second clip
$0.501Short animated clip from text prompt
200 in · 5,000 out
10-second social video
$1.00Instagram Reel or TikTok-style content
400 in · 10,000 out
Batch of 10 short clips
$5.01Multiple variations for A/B testing
2,000 in · 50,000 out
50 ad clips per campaign
$25.05Full video ad campaign production
10,000 in · 250,000 out
Short clips
$15030/mo
$501/day
Social videos
$30060/mo
$1002/day
Ad production
$75150/mo
$2505/day
No ratings yet. Be the first to rate this model!
Sign in to rate this model and share your experience.
Sign in to leave a comment and join the discussion.
OpenAI
OpenAI's most advanced multimodal model. Excels at text, vision, and audio tasks with fast response times.
Input
$2.50/M
Output
$10.00/M
Context
128K
OpenAI
OpenAI's reasoning model with chain-of-thought capabilities for complex problem solving.
Input
$15.00/M
Output
$60.00/M
Context
200K
OpenAI
OpenAI's efficient reasoning model, optimized for speed while maintaining strong analytical capabilities.
Input
$1.10/M
Output
$4.40/M
Context
200K
Moonshot AI
Moonshot AI's frontier multimodal MoE model with 1T total parameters (32B active). Tops SWE-bench and AIME 2025 benchmarks.
Input
$0.45/M
Output
$2.20/M
Context
256K
Google's most capable thinking model with breakthrough performance on reasoning and coding.
Input
$1.25/M
Output
$10.00/M
Context
1.0M
Anthropic
Anthropic's most powerful model. Top-tier performance on coding, analysis, and complex reasoning tasks.
Input
$15.00/M
Output
$75.00/M
Context
200K