Model Directory

DeepSeek's open-source MoE model rivaling frontier models at a fraction of the cost.

Input

$0.27/M

Output

$1.10/M

Context

128K

DeepSeek-R1

DeepSeek

DeepSeek's reasoning model with transparent chain-of-thought. Open-source and highly competitive.

Input

$0.55/M

Output

$2.19/M

Context

128K

Llama 4 Scout

Meta

Meta's latest open-source MoE model with 17B active parameters and industry-leading 10M token context.

Input

$0.15/M

Output

$0.60/M

Context

10.5M

Claude Sonnet 4

Anthropic

Anthropic's best balance of intelligence and speed. Excellent for production workloads.

Input

$3.00/M

Output

$15.00/M

Context

200K

Gemini 3 Flash

Google

Google's frontier-class model at Flash-level latency and cost. 90.4% on GPQA Diamond, 78% on SWE-bench, 1M context window.

textimageaudiovideocode

Input

$0.50/M

Output

$3.00/M

Context

1.0M

Llama 3.3 70B

Meta

Meta's open-source model matching GPT-4 class performance at 70B parameters.

Input

$0.18/M

Output

$0.18/M

Context

128K

Gemini 2.0 Flash

Google

Google's fastest multimodal model with native tool use and advanced agentic capabilities.

textimageaudiovideo

Input

$0.10/M

Output

$0.40/M

Context

1.0M

Gemini 2.5 Flash

Google

Google's fast and cost-efficient thinking model with strong reasoning capabilities.

textimageaudiovideo

Input

$0.15/M

Output

$0.60/M

Context

1.0M

o3-mini

OpenAI

OpenAI's efficient reasoning model, optimized for speed while maintaining strong analytical capabilities.

Input

$1.10/M

Output

$4.40/M

Context

200K

Llama 3.1 70B

Meta

Meta's strong mid-range open-source model, predecessor to 3.3 with broad community support.

Input

$0.18/M

Output

$0.18/M

Context

128K

Code Llama 70B

Meta

Meta's largest code-focused open-source model. Specialized for code generation and understanding.

Input

$0.18/M

Output

$0.18/M

Context

16K

Codestral

Mistral AI

Mistral's first code-focused model with 32K context. Supports 80+ programming languages.

Input

$0.30/M

Output

$0.90/M

Context

32K

Pixtral Large

Mistral AI

Mistral's flagship multimodal model. Built on Mistral Large with vision capabilities.

Input

$2.00/M

Output

$6.00/M

Context

128K

Mixtral 8x22B

Mistral AI

Mistral's large open-source MoE model with 176B total params. Strong coding and reasoning.

Input

$0.65/M

Output

$0.65/M

Context

66K

DeepSeek-Coder-V2

DeepSeek

DeepSeek's open-source code-focused MoE model. Competitive with GPT-4 Turbo on coding.

Input

$0.14/M

Output

$0.28/M

Context

128K

DeepSeek-V2.5

DeepSeek

Merged general and coder capabilities from V2 into a unified model.

Input

$0.14/M

Output

$0.28/M

Context

128K

DeepSeek-R1-Distill-Qwen-32B

DeepSeek

R1 reasoning capabilities distilled into a compact Qwen-based 32B model.

Input

$0.12/M

Output

$0.18/M

Context

128K

Command R

Cohere

Cohere's open-weight model optimized for RAG and tool use. Strong multilingual support.

Input

$0.15/M

Output

$0.60/M

Context

128K

Aya Expanse 32B

Cohere

Cohere's open-source multilingual model covering 23 languages with strong performance.

Input

$0.50/M

Output

$1.50/M

Context

128K

Qwen2.5 72B

Alibaba/Qwen

Alibaba's flagship open-source model. Competitive with Llama 3.1 405B at a fraction of the size.

Input

$0.30/M

Output

$0.30/M

Context

128K

Qwen2.5 32B

Alibaba/Qwen

Strong mid-range open-source model from Alibaba with broad capabilities.

Input

$0.08/M

Output

$0.08/M

Context

128K

Qwen2.5-Coder 32B

Alibaba/Qwen

Alibaba's open-source coding specialist. Matches GPT-4o on code benchmarks.

Input

$0.08/M

Output

$0.08/M

Context

128K

Qwen2-VL 72B

Alibaba/Qwen

Alibaba's open-source vision-language model with video understanding capabilities.

textimagevideo

Input

$0.40/M

Output

$0.40/M

Context

32K

QwQ 32B

Alibaba/Qwen

Alibaba's open-source reasoning model with transparent chain-of-thought. Competitive with o1-mini.

Input

$0.10/M

Output

$0.30/M

Context

32K

Phi-3.5 MoE

Microsoft

Microsoft's open-source MoE model with 42B total params and only 6.6B active.

Input

$0.06/M

Output

$0.06/M

Context

128K

WizardLM-2 8x22B

Microsoft

Microsoft's instruction-tuned MoE model based on Mixtral. Strong on complex reasoning tasks.

Input

$0.65/M

Output

$0.65/M

Context

66K

Nemotron 70B

NVIDIA

NVIDIA's optimized Llama 3.1 variant with custom reward model training.

Input

$0.18/M

Output

$0.18/M

Context

128K

Jamba 1.5 Large

AI21 Labs

AI21's hybrid SSM-Transformer model with 256K context. Novel Mamba architecture.

Input

$2.00/M

Output

$8.00/M

Context

256K

Falcon 180B

TII

TII's largest open-source model. One of the first truly open 180B parameter models.

Input

$0.80/M

Output

$0.80/M

Context

Yi-Lightning

01.AI

01.AI's fast inference model with strong performance across benchmarks.

Input

$0.30/M

Output

$0.30/M

Context

16K

Yi-1.5 34B

01.AI

01.AI's open-source 34B model with strong bilingual (English/Chinese) capabilities.

Input

$0.10/M

Output

$0.10/M

Context

InternLM2.5 20B

Shanghai AI Lab

Open-source model with 1M context from Shanghai AI Lab. Strong coding and math skills.

Input

$0.06/M

Output

$0.06/M

Context

1.0M

InternVL2 26B

Shanghai AI Lab

Open-source vision-language model with strong image understanding capabilities.

Input

$0.08/M

Output

$0.08/M

Context

Arctic

Snowflake

Snowflake's open-source enterprise MoE model optimized for SQL and business tasks.

Input

$0.30/M

Output

$0.30/M

Context

DBRX

Databricks

Databricks' open-source MoE model with strong code and reasoning capabilities.

Input

$0.75/M

Output

$0.75/M

Context

32K

GLM-4

Zhipu AI

Zhipu AI's flagship model with strong Chinese and English bilingual capabilities.

Input

$1.00/M

Output

$3.00/M

Context

128K

o4-mini

OpenAI

OpenAI's cost-efficient reasoning model with multimodal input, strong math and coding performance at a fraction of o3 pricing.

textimagecode

Input

$1.10/M

Output

$4.40/M

Context

200K

Mistral Medium 3

Mistral AI

Mistral's mid-tier model offering 90% of Claude Sonnet quality at significantly lower cost.

Input

$0.40/M

Output

$2.00/M

Context

131K

Codestral 25.01

Mistral AI

Mistral's specialized code model supporting 80+ languages with 256K context and fill-in-the-middle capability.

Input

$0.30/M

Output

$0.90/M

Context

256K

Grok-3 Mini

xAI

xAI's efficient reasoning model with fast inference and competitive performance at budget pricing.

Input

$0.30/M

Output

$0.50/M

Context

131K

Phi-4-reasoning

Microsoft

Chain-of-thought reasoning variant of Phi-4, competitive with much larger models on math and logic tasks.

Input

$0.04/M

Output

$0.04/M

Context

32K

DeepSeek-R1-Distill-Llama-70B

DeepSeek

R1's reasoning capability distilled into a Llama 3.1 70B architecture for efficient deployment.

Input

$0.18/M

Output

$0.18/M

Context

128K

Command A

Cohere

Cohere's 111B parameter model supporting 23 languages with enterprise tool use and 256K context.

Input

$2.50/M

Output

$10.00/M

Context

256K

Qwen3 32B

Alibaba/Qwen

Alibaba's dense 32B model with dual thinking/non-thinking modes and strong reasoning performance.

Input

$0.08/M

Output

$0.20/M

Context

131K

OLMo 3 32B

Allen AI

Fully open model with all components public: data, code, weights, and checkpoints. Instruct, Think, and RL Zero variants.

Input

$0.25/M

Output

$0.75/M

Context

128K

GLM-4.5V

Zhipu AI

Vision-language MoE model with superior performance at lower inference cost.

Input

$0.15/M

Output

$0.30/M

Context

128K

CogVideoX 5B

Zhipu AI

Open-source video generation model creating 6-second clips at 720x480. Supports LoRA fine-tuning.

Input

$0.05/M

Output

$0.05/M

BGE-VL

BAAI

State-of-the-art multimodal embedding model for visual search applications.

Input

$0.02/M

Output

$0.02/M

Context

Pika 2.5

Pika Labs

Pika Labs' video generation model focused on ultra-realistic output with enhanced physics simulation. Pika 2.5 handles complex material interactions such as fluid dynamics, cloth draping, and particle effects with high fidelity. Its intuitive prompt interface and style controls make it accessible for creators seeking photorealistic short-form video content.

Input

$2.00/M

Output

$50.00/M

Luma Ray3.14

Luma AI

Luma AI's video generation model delivering native 1080p output at 4x faster inference speeds than previous versions, with optional 4K upscaling. Ray3.14 specializes in photorealistic 3D-aware video synthesis with strong spatial understanding, making it particularly effective for product visualization, architectural walkthroughs, and immersive content creation.

Input

$2.00/M

Output

$45.00/M

MiniMax Hailuo 2.3

MiniMax

MiniMax's Hailuo 2.3 video model combines photorealistic rendering with versatile style support including anime, watercolor, and cinematic looks. It features advanced motion control, accurate lip-sync for dialogue scenes, and sophisticated lighting effects that adapt dynamically to scene content and camera movement.

Input

$2.00/M

Output

$50.00/M

LTX-2

Lightricks

Lightricks' open-source video generation model capable of producing native 4K video at 50 frames per second with clips up to 20 seconds in length. LTX-2 includes native audio synthesis and offers full model weights under a permissive license, making it a leading choice for researchers and developers building custom video generation pipelines.

videoaudio

Input

Free/M

Output

Free/M

Wan 2.1

Alibaba/Qwen

Alibaba's open-source video generation model that achieved the number one ranking on the VBench video quality benchmark upon release. With 14 billion parameters, Wan 2.1 demonstrates exceptional prompt adherence, temporal consistency, and visual quality across diverse content types, establishing a new baseline for open-weight video synthesis models.

Input

Free/M

Output

Free/M

Wan 2.2

Alibaba/Qwen

The successor to Wan 2.1, this open-source model introduces a Mixture-of-Experts flow-matching architecture with approximately 27 billion total parameters and 14 billion active during inference. Wan 2.2 delivers significantly improved motion quality, fine-grained detail, and extended generation lengths while maintaining the accessibility of fully open weights.

Input

Free/M

Output

Free/M

HunyuanVideo 1.5

Tencent

Tencent's open-source video generation model with 8.3 billion parameters, featuring a novel Spatial-Temporal Self-Attention (SSTA) mechanism for improved temporal coherence. HunyuanVideo 1.5 supports diverse aspect ratios, variable frame rates, and extended clip durations, making it a versatile foundation model for the open-source video generation community.

Input

Free/M

Output

Free/M

Stable Video 4D 2.0

Stability AI

Stability AI's specialized model for 4D novel-view video synthesis, generating temporally consistent multi-angle video from a single input clip or image. Stable Video 4D 2.0 enables creators to produce orbiting camera paths, bullet-time effects, and 3D-aware video transformations that maintain geometric and photometric coherence throughout the sequence.

Input

$2.00/M

Output

$40.00/M

Gemini 2.5 Flash Image

Google

A multimodal extension of Google's Gemini 2.5 Flash model that adds native image generation and editing capabilities alongside text understanding. This model enables conversational image creation, iterative visual refinement, and combined text-image output within a single unified interface, making it particularly effective for design iteration and creative brainstorming workflows.

imagetext

Input

$0.15/M

Output

$30.00/M

FLUX.2 Dev

Black Forest Labs

The open-weights development version of FLUX.2 with the same 32 billion parameter architecture as the Pro variant, released for non-commercial research and experimentation. FLUX.2 Dev provides researchers full access to model weights for fine-tuning, distillation, and architectural exploration while delivering near-Pro-level quality for academic and personal projects.

Input

Free/M

Output

Free/M

Adobe Firefly Image 4

Adobe

Adobe's fourth-generation Firefly image model offering improved quality, faster generation, and enhanced creative controls compared to its predecessors. Firefly Image 4 provides robust structure references, style transfer, and generative fill capabilities, all trained on Adobe's commercially licensed dataset to ensure IP safety for enterprise and professional use.

Input

$3.00/M

Output

$25.00/M

Stable Diffusion 3.5 Large

Stability AI

Stability AI's largest open-source image generation model built on the Multimodal Diffusion Transformer (MMDiT) architecture. SD 3.5 Large delivers high-quality results across photorealistic and artistic styles with strong prompt adherence, accurate text rendering, and diverse composition capabilities, available under an open license for both research and commercial use.

Input

$0.50/M

Output

$6.50/M

Ideogram 3.0

Ideogram

Ideogram's third-generation model combining exceptional photorealism with industry-leading text rendering accuracy within generated images. Ideogram 3.0 handles complex typography, logos, signs, and handwritten text with remarkable fidelity, making it the preferred choice for design professionals working on brand assets, marketing materials, and content requiring reliable in-image text.

Input

$2.00/M

Output

$20.00/M

PersonaPlex 7B v1

NVIDIA

Input

Free/M

Output

Free/M

Recraft V3

Recraft

Recraft's flagship image generation model that achieved the number one ranking on the HuggingFace text-to-image leaderboard, with native support for both raster and vector output formats. Recraft V3 excels at brand-consistent design, offering precise color palette control, style locking, and batch generation capabilities that make it uniquely suited for professional design systems.

Input

$2.00/M

Output

$20.00/M

MAI-Image-1

Microsoft

Microsoft's first in-house image generation model developed by Microsoft AI, designed for integration across Microsoft's product ecosystem including Designer, Copilot, and Bing Image Creator. MAI-Image-1 focuses on safety, controllability, and consistent quality, with built-in content filtering and provenance metadata for responsible enterprise deployment.

Input

$2.00/M

Output

$15.00/M

Yi 1.5 9B

01.AI

Mid-size Yi model with enhanced inference speed for extended prompts.

Input

$0.10/M

Output

$0.20/M

Context

128K

Yi-VL 34B

01.AI

Vision-language Yi model for image understanding and visual question answering.

Input

$0.30/M

Output

$0.60/M

Context

16K

StarCoder2 7B

BigCode

Mid-size code model matching CodeLlama 13B quality at half the parameters.

Input

$0.07/M

Output

$0.14/M

Context

16K

SD 3.5 Medium

Stability AI

Mid-size Stable Diffusion optimized for consumer GPUs and edge devices.

Input

$0.02/M

Output

$0.02/M

FLUX.1 Schnell

Black Forest Labs

Fast open-source text-to-image model with 4-step generation. Apache 2.0 licensed.

Input

$0.02/M

Output

$0.02/M

Llama Guard 3 8B

Meta

Safety classification model for detecting unsafe content in LLM inputs and outputs.

Input

$0.05/M

Output

$0.05/M

Context

128K

EXAONE 4.0 32B

LG AI Research

Korean sovereign AI model using MoE with hybrid attention for reduced computation.

Input

$0.25/M

Output

$0.75/M

Context

128K

Solar Pro 2

Upstage

Agentic reasoning-focused model matching larger rivals. Strong multilingual capabilities.

Input

$0.20/M

Output

$0.60/M

Context

128K

BGE-M3

BAAI

Most popular open embedding model. Multi-functionality, multi-linguality, multi-granularity in one model.

Input

$0.01/M

Output

$0.01/M

Context

Whisper Large V3

OpenAI

Gold standard speech recognition model supporting 99+ languages. 1.55B parameter encoder-decoder architecture.

audio

Input

$0.0060/M

Output

$0.0060/M

Claude Sonnet 4.5

Anthropic

High-intelligence Sonnet model with 1M token context window. Strong balance of performance and cost.

Input

$3.00/M

Output

$15.00/M

Context

1.0M

PaliGemma2 28B

Google

Open vision-language model for image captioning, visual QA, and OCR tasks. Built on Gemma 2 backbone.

Input

$0.30/M

Output

$0.60/M

Context

Cohere

State-of-the-art text embedding model for semantic search and RAG applications.

Input

$0.10/M

Output

$0.10/M

Context

Falcon 3 10B

TII

Outperforms all models under 13B on HuggingFace leaderboard. Trained on 14T tokens with innovative 1.58-bit quantized variant.

Input

$0.10/M

Output

$0.30/M

Context

32K

Falcon 3 7B

TII

Versatile 7B model with 30 checkpoint variants including base, instruct, and quantized.

Input

$0.07/M

Output

$0.21/M

Context

32K

Nomic Embed V2

Nomic AI

First MoE embedding model. Trained on 1.6B pairs across ~100 languages with top-2 expert routing.

Input

$0.01/M

Output

$0.01/M

Context

Jina Embeddings V4

Jina AI

Universal multimodal embedding handling text, images, and documents in 30+ languages.

Input

$0.02/M

Output

$0.02/M

Context

Amazon Nova 2 Lite

Amazon

Fast, cost-effective reasoning model with built-in code interpreter and web grounding.

textimagevideo

Input

$0.80/M

Output

$2.40/M

Context

1.0M

Amazon Nova 2 Sonic

Amazon

Speech-to-speech model for natural real-time conversations. Supports 7 languages.

audio

Input

$0.50/M

Output

$0.50/M

Amazon Nova Canvas

Amazon

Image generation model with fine-grained control over composition, style, and content.

Input

$0.04/M

Output

$0.04/M

Reka Flash

Reka AI

One of the few 21B models supporting full interleaved multimodal inputs. Videos up to 5 minutes.

textimagevideoaudio

Input

$0.80/M

Output

$2.40/M

Context

128K

Mochi 1

Genmo

High-performance open text-to-video model excelling in text consistency.

Input

$0.05/M

Output

$0.05/M

Granite 3.1 8B

IBM

Enterprise-grade model with strong instruction following for business applications.

Input

$0.10/M

Output

$0.20/M

Context

128K

Granite 3.2 8B

IBM

Updated Granite with enhanced coding and tool-use capabilities for enterprise automation.

Input

$0.10/M

Output

$0.20/M

Context

128K

MiniCPM-V 2.6

OpenBMB

Efficient vision-language model rivaling GPT-4V quality at a fraction of the size.

Input

$0.10/M

Output

$0.20/M

Context

128K

HyperCLOVA X

Naver

Korean sovereign AI with omnimodal capabilities. Specialized for Korean language and culture.

Input

$1.00/M

Output

$3.00/M

Context

128K

Ministral 14B

Mistral AI

Mid-size Mistral model bridging the gap between 8B edge models and large frontier offerings.

Input

$0.15/M

Output

$0.45/M

Context

128K

Qwen2.5-Math 7B

Alibaba/Qwen

Compact math-specialized model with chain-of-thought reasoning for mathematical problem solving.

Input

$0.07/M

Output

$0.14/M

Context

128K

Phi-3.5 Vision

Microsoft

Lightweight multimodal model with vision capabilities for on-device and edge visual understanding.

Input

$0.05/M

Output

$0.10/M

Context

Google

Google's previous-gen open-source model with strong general capabilities.

Input

$0.07/M

Output

$0.07/M

Context

Llama 3.2 90B Vision

Meta

Meta's largest multimodal Llama model with image understanding capabilities.