Compare pricing and specifications for large language models from all major providers.
Alibaba's large language model with strong performance in reasoning, coding, and multilingual tasks.
Alibaba's flagship Qwen3 Mixture-of-Experts model with 235B total parameters (22B active). Features hybrid reasoning and supports 119 languages. (Note: Not publicly available at release).
Alibaba's Qwen3 Mixture-of-Experts model with 30B total parameters (3B active). Features hybrid reasoning and supports 119 languages. Apache 2.0 license.
Alibaba's largest dense model in the Qwen3 family with 32B parameters. Features hybrid reasoning and supports 119 languages. Apache 2.0 license.
Anthropic's most advanced model, significantly improved over Claude 3 Sonnet with enhanced reasoning, coding, and vision capabilities.
Anthropic's enhanced Claude model with improved reasoning capabilities and better performance on complex tasks, positioned between Claude 3.5 Sonnet and Claude 4.
Anthropic's most powerful model prior to 3.5 Sonnet, excelling at highly complex tasks and demonstrating near-human levels of comprehension and fluency.
A balanced model from Anthropic, offering a blend of intelligence and speed, ideal for enterprise workloads and scaled AI deployments.
Anthropic's fastest and most compact model, designed for near-instant responsiveness and high throughput tasks.
Anthropic's most powerful model, excelling in coding, advanced reasoning, and AI agent workflows. Handles complex, long-running tasks.
Anthropic's highly capable and versatile model, offering a strong balance of intelligence, speed, and cost-effectiveness for enterprise applications.
DeepSeek's latest model with strong performance across reasoning, coding, and general tasks at competitive pricing.
DeepSeek's enhanced model with improved reasoning capabilities, expanded context window, and even more competitive pricing.
Google's latest experimental model with breakthrough multimodal capabilities and enhanced reasoning at an extremely competitive price point.
Google's highly capable multimodal model with a breakthrough long context window of up to 2 million tokens. Excels at complex reasoning, problem-solving, and understanding long-form content.
Google's faster and lower-cost version of Gemini 1.5 Pro, optimized for high-volume, high-frequency tasks while retaining a large context window and multimodal capabilities.
Google's most advanced reasoning Gemini model, capable of solving complex problems. Supports text, code, image, audio, and video inputs. Features a 1M token context window (up to 2M in some versions).
Google's best model for price and performance (as of May 2025), featuring hybrid reasoning capabilities. Supports text, code, image, audio, and video inputs. 1M token context window.
Meta's latest 70B parameter model with improved performance and capabilities, offering state-of-the-art results for its size.
Meta's multimodal model combining text and vision capabilities with strong performance across various tasks.
A smaller, efficient multimodal model from Meta with vision capabilities, suitable for edge deployment and cost-sensitive applications.
Meta's largest and most capable Llama 3.1 model, designed for complex reasoning, coding, and nuanced instruction following.
A large instruction-tuned model from Meta's Llama 3.1 series, offering a strong balance of performance and efficiency for a wide range of tasks.
A highly efficient instruction-tuned model from Meta's Llama 3.1 series, suitable for fast, on-device, or edge applications.
Mistral AI's flagship model with enhanced reasoning, coding, and multilingual capabilities.
Mistral AI's cost-effective model for straightforward tasks, offering good performance and efficiency.
Mistral AI's open-weight generative model specialized for code generation, supporting 80+ languages.
Mistral AI's frontier-class multimodal model balancing SOTA performance, lower cost, and simpler deployability for enterprise usage. Excels in coding and multimodal understanding.
Mistral AI's cutting-edge language model for coding (second version). Specializes in low-latency, high-frequency tasks like fill-in-the-middle (FIM), code correction, and test generation.
A new leader in the small models category by Mistral AI, with image understanding capabilities and an extended 128k context length. Apache 2.0 license.
A 24B open-source text model from Mistral AI that excels at using tools to explore codebases, editing multiple files, and powering software engineering agents. Apache 2.0 license.
A powerful and efficient model from Mistral AI for languages from the Middle East and South Asia.
OpenAI's most advanced reasoning model, designed for complex problem-solving with enhanced chain-of-thought capabilities.
A faster, more cost-effective version of o1 with strong reasoning capabilities, optimized for coding and STEM tasks.
OpenAI's flagship multimodal model, natively processing text, audio, and images for faster, more capable interactions.
OpenAI's most affordable and fastest model in the GPT-4o family, designed for high-volume, low-latency tasks.
OpenAI's powerful model prior to GPT-4o, with a large context window and strong performance on complex tasks. Supports vision.
OpenAI's fast, cost-effective model optimized for chat and simple tasks.
OpenAI's advanced reasoning model released in April 2025, successor to o1-preview. Features strong performance in complex problem-solving, coding, and handles both text and image inputs. Includes autonomous tool use.
A faster, more cost-effective version of o3, released in January 2025. Offers strong reasoning, coding, and vision capabilities. Optimized for math and coding tasks.
A faster, cost-efficient reasoning model, successor to o3-mini, released in April 2025. Offers strong performance on math, coding, and vision. Can process text and images, and features autonomous tool use.
Alibaba's large language model with strong performance in reasoning, coding, and multilingual tasks.
Alibaba's flagship Qwen3 Mixture-of-Experts model with 235B total parameters (22B active). Features hybrid reasoning and supports 119 languages. (Note: Not publicly available at release).
Alibaba's Qwen3 Mixture-of-Experts model with 30B total parameters (3B active). Features hybrid reasoning and supports 119 languages. Apache 2.0 license.
Alibaba's largest dense model in the Qwen3 family with 32B parameters. Features hybrid reasoning and supports 119 languages. Apache 2.0 license.
Anthropic's most advanced model, significantly improved over Claude 3 Sonnet with enhanced reasoning, coding, and vision capabilities.
Anthropic's enhanced Claude model with improved reasoning capabilities and better performance on complex tasks, positioned between Claude 3.5 Sonnet and Claude 4.
Anthropic's most powerful model prior to 3.5 Sonnet, excelling at highly complex tasks and demonstrating near-human levels of comprehension and fluency.
A balanced model from Anthropic, offering a blend of intelligence and speed, ideal for enterprise workloads and scaled AI deployments.
Anthropic's fastest and most compact model, designed for near-instant responsiveness and high throughput tasks.
Anthropic's most powerful model, excelling in coding, advanced reasoning, and AI agent workflows. Handles complex, long-running tasks.
Anthropic's highly capable and versatile model, offering a strong balance of intelligence, speed, and cost-effectiveness for enterprise applications.
DeepSeek's latest model with strong performance across reasoning, coding, and general tasks at competitive pricing.
DeepSeek's enhanced model with improved reasoning capabilities, expanded context window, and even more competitive pricing.
Google's latest experimental model with breakthrough multimodal capabilities and enhanced reasoning at an extremely competitive price point.
Google's highly capable multimodal model with a breakthrough long context window of up to 2 million tokens. Excels at complex reasoning, problem-solving, and understanding long-form content.
Google's faster and lower-cost version of Gemini 1.5 Pro, optimized for high-volume, high-frequency tasks while retaining a large context window and multimodal capabilities.
Google's most advanced reasoning Gemini model, capable of solving complex problems. Supports text, code, image, audio, and video inputs. Features a 1M token context window (up to 2M in some versions).
Google's best model for price and performance (as of May 2025), featuring hybrid reasoning capabilities. Supports text, code, image, audio, and video inputs. 1M token context window.
Meta's latest 70B parameter model with improved performance and capabilities, offering state-of-the-art results for its size.
Meta's multimodal model combining text and vision capabilities with strong performance across various tasks.
A smaller, efficient multimodal model from Meta with vision capabilities, suitable for edge deployment and cost-sensitive applications.
Meta's largest and most capable Llama 3.1 model, designed for complex reasoning, coding, and nuanced instruction following.
A large instruction-tuned model from Meta's Llama 3.1 series, offering a strong balance of performance and efficiency for a wide range of tasks.
A highly efficient instruction-tuned model from Meta's Llama 3.1 series, suitable for fast, on-device, or edge applications.
Mistral AI's flagship model with enhanced reasoning, coding, and multilingual capabilities.
Mistral AI's cost-effective model for straightforward tasks, offering good performance and efficiency.
Mistral AI's open-weight generative model specialized for code generation, supporting 80+ languages.
Mistral AI's frontier-class multimodal model balancing SOTA performance, lower cost, and simpler deployability for enterprise usage. Excels in coding and multimodal understanding.
Mistral AI's cutting-edge language model for coding (second version). Specializes in low-latency, high-frequency tasks like fill-in-the-middle (FIM), code correction, and test generation.
A new leader in the small models category by Mistral AI, with image understanding capabilities and an extended 128k context length. Apache 2.0 license.
A 24B open-source text model from Mistral AI that excels at using tools to explore codebases, editing multiple files, and powering software engineering agents. Apache 2.0 license.
A powerful and efficient model from Mistral AI for languages from the Middle East and South Asia.
OpenAI's most advanced reasoning model, designed for complex problem-solving with enhanced chain-of-thought capabilities.
A faster, more cost-effective version of o1 with strong reasoning capabilities, optimized for coding and STEM tasks.
OpenAI's flagship multimodal model, natively processing text, audio, and images for faster, more capable interactions.
OpenAI's most affordable and fastest model in the GPT-4o family, designed for high-volume, low-latency tasks.
OpenAI's powerful model prior to GPT-4o, with a large context window and strong performance on complex tasks. Supports vision.
OpenAI's fast, cost-effective model optimized for chat and simple tasks.
OpenAI's advanced reasoning model released in April 2025, successor to o1-preview. Features strong performance in complex problem-solving, coding, and handles both text and image inputs. Includes autonomous tool use.
A faster, more cost-effective version of o3, released in January 2025. Offers strong reasoning, coding, and vision capabilities. Optimized for math and coding tasks.
A faster, cost-efficient reasoning model, successor to o3-mini, released in April 2025. Offers strong performance on math, coding, and vision. Can process text and images, and features autonomous tool use.