All models

Save 91%

584 models across multiple vendors and groups

AI
Save 91%
OpenAI
gpt-5.4-mini
OpenAI

Input: $0.0675 / 1M Tokens

Output: $0.4050 / 1M Tokens

GPT-5.4mini combines the strengths of GPT-5.4 into a faster, more efficient model, specifically designed for high-load workloads.

Volume billingDialogueImage recognitionThinking+1
Save 91%
OpenAI
gpt-5.4-mini-2026-03-17
OpenAI

Input: $0.0675 / 1M Tokens

Output: $0.4050 / 1M Tokens

GPT-5.4mini combines the strengths of GPT-5.4 into a faster, more efficient model, specifically designed for high-load workloads.

Volume billingDialogueImage recognitionThinking+1
Save 91%
OpenAI
gpt-5.4-nano
OpenAI

Input: $0.0180 / 1M Tokens

Output: $0.1125 / 1M Tokens

GPT-5.4 Nano is the lightest and fastest version of GPT-5.4, designed specifically for tasks with extremely high demands on speed and cost efficiency.

Volume billingDialogueImage recognitionThinking+1
Save 91%
OpenAI
gpt-5.4-nano-2026-03-17
OpenAI

Input: $0.0180 / 1M Tokens

Output: $0.1125 / 1M Tokens

GPT-5.4 Nano is the lightest and fastest version of GPT-5.4, designed specifically for tasks with extremely high demands on speed and cost efficiency.

Volume billingDialogueImage recognitionThinking+1
Save 91%
OpenAI
gpt-5.4
OpenAI

Input: $0.2250 / 1M Tokens

Output: $1.3500 / 1M Tokens

GPT-5.4 is our state-of-the-art model for complex professional tasks.

Volume billingDialogueImage recognitionTool
Save 91%
OpenAI
gpt-5.4-pro
OpenAI

Input: $2.7000 / 1M Tokens

Output: $16.2000 / 1M Tokens

GPT-5.4pro leverages greater computational resources to think more deeply and deliver consistently better answers. It is accessible only via the Response API, enabling multi-turn model interactions before responding to API requests, as well as supporting other advanced API features in the future.

Volume billingDialogueImage recognitionTool
Save 91%
NanoBanana
gemini-3.1-flash-image-preview
Google

$0.0149 / 次

Nano Banana 2 offers high-quality image generation and conversational editing at an affordable price, with low latency.

Per-call billingPainting
Save 91%
OpenAI
gpt-5.3-chat-latest
OpenAI

Input: $0.1575 / 1M Tokens

Output: $1.2600 / 1M Tokens

GPT-5.3-chat-latest is a high-speed conversational model optimized by OpenAI, offering more direct and natural responses while reducing didacticism and refusal to answer. It is well-suited for everyday light tasks.

Volume billingDialogueImage recognitionTool
Save 91%
Claude
claude-sonnet-4-6
Anthropic

Input: $0.2700 / 1M Tokens

Output: $1.3500 / 1M Tokens

Claude Sonnet 4.6 delivers cutting-edge intelligence at scale, specifically designed for coding, agent applications, and enterprise workflows.

Volume billingDialogueImage recognitionTool
Save 89%
OpenAI
gpt-5.3-codex-spark
OpenAI Plus

Input: $0.1925 / 1M Tokens

Output: $1.5400 / 1M Tokens

GPT-5.3 Codex Spark Research Preview. As a lightweight version of GPT-5.3 Codex, it is our first model specifically designed for “real-time coding” scenarios.

Volume billingDialogueTool
Save 65%
Claude
claude-sonnet-4-6-thinking
Anthropic

Input: $0.0000 / 1M Tokens

Output: $0.0000 / 1M Tokens

Claude Sonnet 4.6 delivers cutting-edge intelligence at scale, specifically designed for coding, agent applications, and enterprise workflows.

Volume billingDialogueImage recognitionTool+1
Save 79%
Gemini
gemini-3.1-flash-lite-preview
Google

Input: $0.0525 / 1M Tokens

Output: $0.3150 / 1M Tokens

Our most cost-effective multimodal model delivers the fastest performance for high-frequency, lightweight tasks. Gemini 3.1 Flash-Lite is ideally suited for handling massive-scale agent workflows, straightforward data extraction tasks, and ultra-low-latency applications where budget and speed are primary constraints.

Volume billingDialogueImage recognition
Save 90%
Gemini
gemini-3.1-pro-preview
Google

Input: $0.2000 / 1M Tokens

Output: $1.2000 / 1M Tokens

Gemini 3.1 is Google’s most advanced model family to date, built on cutting-edge reasoning capabilities. It is designed to turn any idea into reality by mastering agent workflows, autonomous coding, and complex multimodal tasks. Gemini-3.1-Pro-Preview is best suited for sophisticated tasks that require broad world knowledge and high-level cross-modal reasoning.

Volume billingDialogueThinkingMultimodal
Save 91%
Claude
claude-opus-4-6
Anthropic

Input: $0.4500 / 1M Tokens

Output: $2.2500 / 1M Tokens

Claude Opus 4.6 is Anthropic’s latest flagship AI model, delivering significant advancements in professional task execution, long-context understanding, and multi-agent collaboration.

Volume billingDialogueImage recognitionTool
Save 91%
Claude
claude-opus-4-6-thinking
Anthropic

Input: $0.4500 / 1M Tokens

Output: $2.2500 / 1M Tokens

A professional-grade strategic AI brain, designed specifically for complex business decision-making and deep logical reasoning.

Volume billingDialogueThinkingTool
Save 91%
OpenAI
gpt-5.2
OpenAI

Input: $0.1575 / 1M Tokens

Output: $1.2600 / 1M Tokens

GPT-5.2 is the best model for coding and intelligent tasks across all industries.

Volume billingDialogueImage recognitionTool
Save 91%
OpenAI
gpt-5.2-chat
OpenAI

Input: $0.1575 / 1M Tokens

Output: $1.2600 / 1M Tokens

GPT-5.2-chat is the latest version designed for natural, fluent conversations and dynamic interactive experiences.

Volume billingDialogueImage recognition
Save 91%
OpenAI
gpt-5.2-chat-latest
OpenAI

Input: $0.1575 / 1M Tokens

Output: $1.2600 / 1M Tokens

GPT-5.2-chat-latest: The latest version designed for natural, fluent conversations and dynamic interactive experiences.

Volume billingDialogueImage recognition
Save 91%
OpenAI
gpt-5.2-pro
OpenAI

Input: $1.8900 / 1M Tokens

Output: $15.1200 / 1M Tokens

GPT-5.2 pro is available only through the Responses API to support multi-turn model interactions and to enable additional advanced API features before responding to API requests. Because GPT-5.2 pro is designed to tackle challenging problems, some requests may take several minutes to complete.

Volume billingDialogueThinkingImage recognition
Save 85%
Minimax
MiniMax-M2.5
Minimax

Input: $0.3150 / 1M Tokens

Output: $1.2600 / 1M Tokens

MiniMax-M2.5 has achieved or surpassed the state-of-the-art (SOTA) performance in productivity scenarios such as programming, tool invocation and search, and office work.

Volume billingDialogue
Showing 1 - 20 / 584