All models
Save 91%584 models across multiple vendors and groups
Input: $0.0675 / 1M Tokens
Output: $0.4050 / 1M Tokens
GPT-5.4mini combines the strengths of GPT-5.4 into a faster, more efficient model, specifically designed for high-load workloads.
Input: $0.0675 / 1M Tokens
Output: $0.4050 / 1M Tokens
GPT-5.4mini combines the strengths of GPT-5.4 into a faster, more efficient model, specifically designed for high-load workloads.
Input: $0.0180 / 1M Tokens
Output: $0.1125 / 1M Tokens
GPT-5.4 Nano is the lightest and fastest version of GPT-5.4, designed specifically for tasks with extremely high demands on speed and cost efficiency.
Input: $0.0180 / 1M Tokens
Output: $0.1125 / 1M Tokens
GPT-5.4 Nano is the lightest and fastest version of GPT-5.4, designed specifically for tasks with extremely high demands on speed and cost efficiency.
Input: $0.2250 / 1M Tokens
Output: $1.3500 / 1M Tokens
GPT-5.4 is our state-of-the-art model for complex professional tasks.
Input: $2.7000 / 1M Tokens
Output: $16.2000 / 1M Tokens
GPT-5.4pro leverages greater computational resources to think more deeply and deliver consistently better answers. It is accessible only via the Response API, enabling multi-turn model interactions before responding to API requests, as well as supporting other advanced API features in the future.
$0.0149 / 次
Nano Banana 2 offers high-quality image generation and conversational editing at an affordable price, with low latency.
Input: $0.1575 / 1M Tokens
Output: $1.2600 / 1M Tokens
GPT-5.3-chat-latest is a high-speed conversational model optimized by OpenAI, offering more direct and natural responses while reducing didacticism and refusal to answer. It is well-suited for everyday light tasks.
Input: $0.2700 / 1M Tokens
Output: $1.3500 / 1M Tokens
Claude Sonnet 4.6 delivers cutting-edge intelligence at scale, specifically designed for coding, agent applications, and enterprise workflows.
Input: $0.1925 / 1M Tokens
Output: $1.5400 / 1M Tokens
GPT-5.3 Codex Spark Research Preview. As a lightweight version of GPT-5.3 Codex, it is our first model specifically designed for “real-time coding” scenarios.
Input: $0.0000 / 1M Tokens
Output: $0.0000 / 1M Tokens
Claude Sonnet 4.6 delivers cutting-edge intelligence at scale, specifically designed for coding, agent applications, and enterprise workflows.
Input: $0.0525 / 1M Tokens
Output: $0.3150 / 1M Tokens
Our most cost-effective multimodal model delivers the fastest performance for high-frequency, lightweight tasks. Gemini 3.1 Flash-Lite is ideally suited for handling massive-scale agent workflows, straightforward data extraction tasks, and ultra-low-latency applications where budget and speed are primary constraints.
Input: $0.2000 / 1M Tokens
Output: $1.2000 / 1M Tokens
Gemini 3.1 is Google’s most advanced model family to date, built on cutting-edge reasoning capabilities. It is designed to turn any idea into reality by mastering agent workflows, autonomous coding, and complex multimodal tasks. Gemini-3.1-Pro-Preview is best suited for sophisticated tasks that require broad world knowledge and high-level cross-modal reasoning.
Input: $0.4500 / 1M Tokens
Output: $2.2500 / 1M Tokens
Claude Opus 4.6 is Anthropic’s latest flagship AI model, delivering significant advancements in professional task execution, long-context understanding, and multi-agent collaboration.
Input: $0.4500 / 1M Tokens
Output: $2.2500 / 1M Tokens
A professional-grade strategic AI brain, designed specifically for complex business decision-making and deep logical reasoning.
Input: $0.1575 / 1M Tokens
Output: $1.2600 / 1M Tokens
GPT-5.2 is the best model for coding and intelligent tasks across all industries.
Input: $0.1575 / 1M Tokens
Output: $1.2600 / 1M Tokens
GPT-5.2-chat is the latest version designed for natural, fluent conversations and dynamic interactive experiences.
Input: $0.1575 / 1M Tokens
Output: $1.2600 / 1M Tokens
GPT-5.2-chat-latest: The latest version designed for natural, fluent conversations and dynamic interactive experiences.
Input: $1.8900 / 1M Tokens
Output: $15.1200 / 1M Tokens
GPT-5.2 pro is available only through the Responses API to support multi-turn model interactions and to enable additional advanced API features before responding to API requests. Because GPT-5.2 pro is designed to tackle challenging problems, some requests may take several minutes to complete.
Input: $0.3150 / 1M Tokens
Output: $1.2600 / 1M Tokens
MiniMax-M2.5 has achieved or surpassed the state-of-the-art (SOTA) performance in productivity scenarios such as programming, tool invocation and search, and office work.