by Stability AI· 1 years ago
Mid-size Stable Diffusion optimized for consumer GPUs and edge devices.
Input Price
$0.02/M tokens
Output Price
$0.02/M tokens
Performance Profile
High-quality image generation balancing visual fidelity and speed
Precise control over composition, style, and content through natural language prompts
Open weights — run locally, fine-tune on your own data, no API rate limits
2.5B parameter architecture for high-fidelity generation
vs similar-tier models
| Model | Input | Output | Context | Avg Score |
|---|---|---|---|---|
SD 3.5 MediumCurrent Stability AI | $0.02 | $0.02 | N/A | 0.0 |
o3-mini OpenAI | $1.10 | $4.40 | 200K | 86.3 |
DeepSeek-R1 DeepSeek | $0.55 | $2.19 | 128K | 87.0 |
Generate a single image
<$0.001Text prompt → 1024×1024 image
100 in · 1,000 out
Batch of 10 product images
<$0.001E-commerce product shots in different styles
1,000 in · 10,000 out
50 social media images
$0.0011A week's worth of branded social content
5,000 in · 50,000 out
100 marketing variations
$0.0022A/B test visuals for ad campaigns
10,000 in · 100,000 out
Product images
$0.66/mo
$0.02/day
Social media posts
$1/mo
$0.04/day
Marketing campaigns
$3/mo
$0.11/day
No ratings yet. Be the first to rate this model!
Sign in to rate this model and share your experience.
Sign in to leave a comment and join the discussion.
Stability AI
Stability AI's open-source language model with multilingual support.
Input
$0.04/M
Output
$0.04/M
Context
4K
Stability AI
Stability AI's specialized model for 4D novel-view video synthesis, generating temporally consistent multi-angle video from a single input clip or image. Stable Video 4D 2.0 enables creators to produce orbiting camera paths, bullet-time effects, and 3D-aware video transformations that maintain geometric and photometric coherence throughout the sequence.
Input
$2.00/M
Output
$40.00/M
Stability AI
Stability AI's largest open-source image generation model built on the Multimodal Diffusion Transformer (MMDiT) architecture. SD 3.5 Large delivers high-quality results across photorealistic and artistic styles with strong prompt adherence, accurate text rendering, and diverse composition capabilities, available under an open license for both research and commercial use.
Input
$0.50/M
Output
$6.50/M
OpenAI
OpenAI's efficient reasoning model, optimized for speed while maintaining strong analytical capabilities.
Input
$1.10/M
Output
$4.40/M
Context
200K
DeepSeek
DeepSeek's reasoning model with transparent chain-of-thought. Open-source and highly competitive.
Input
$0.55/M
Output
$2.19/M
Context
128K
Anthropic
Anthropic's best balance of intelligence and speed. Excellent for production workloads.
Input
$3.00/M
Output
$15.00/M
Context
200K