AI Model Changelog & Release Timeline

OpenAI

OpenAI GPT-5.2 (400k)

400K

OpenAI flagship model for coding and agentic tasks across industries.

OpenAI

OpenAI GPT-5.2 (400k)

400K

OpenAI flagship model for coding and agentic tasks across industries.

OpenAI

OpenAI GPT-5.1 (400k)

400K

Flagship GPT model with configurable reasoning effort; predecessor to GPT-5.2.

OpenAI

OpenAI GPT-5.1 (400k)

400K

Flagship GPT model with configurable reasoning effort; predecessor to GPT-5.2.

OpenAI

OpenAI GPT-5 Nano (400k)

400K

Fastest, most cost-efficient GPT-5 variant.

OpenAI

OpenAI GPT-5 Nano (400k)

400K

Fastest, most cost-efficient GPT-5 variant.

OpenAI

OpenAI GPT-5.1-Codex (400k)

400K

GPT-5.1 variant optimized for agentic coding in Codex (Responses API only).

OpenAI

OpenAI GPT-5.1-Codex (400k)

400K

GPT-5.1 variant optimized for agentic coding in Codex (Responses API only).

OpenAI

OpenAI GPT-5.1-Codex-Max (400k)

400K

Most intelligent Codex model optimized for long-horizon agentic coding (Responses API only).

OpenAI

OpenAI GPT-5.1-Codex-Max (400k)

400K

Most intelligent Codex model optimized for long-horizon agentic coding (Responses API only).

OpenAI

OpenAI o3-pro (200k)

200K

Version of o3 with more compute for better responses (Responses API only).

OpenAI

OpenAI o3-pro (200k)

200K

Version of o3 with more compute for better responses (Responses API only).

OpenAI

OpenAI GPT-OSS 120B (Open-Weight)

Unknown

Open-weight model entry as listed by OpenAI (see models page). Token costs depend on where you run it.

OpenAI

OpenAI GPT-OSS 120B (Open-Weight)

Unknown

Open-weight model entry as listed by OpenAI (see models page). Token costs depend on where you run it.

OpenAI

OpenAI GPT-OSS 20B (Open-Weight)

Unknown

Open-weight model entry as listed by OpenAI (see models page). Token costs depend on where you run it.

OpenAI

OpenAI GPT-OSS 20B (Open-Weight)

Unknown

Open-weight model entry as listed by OpenAI (see models page). Token costs depend on where you run it.

Anthropic

Anthropic Claude Mythos 5 (1M)

1M

Anthropic's most powerful frontier model - the Mythos-class system with state-of-the-art agentic and cybersecurity capability. Distributed under strict access controls; Fable 5 is its safeguarded public sibling.

Anthropic

Anthropic Claude Mythos 5 (1M)

1M

Anthropic's most powerful frontier model - the Mythos-class system with state-of-the-art agentic and cybersecurity capability. Distributed under strict access controls; Fable 5 is its safeguarded public sibling.

Anthropic

Anthropic Claude Fable 5 (1M)

1M

The first public Mythos-class model - Anthropic's most powerful generally available model, sharing Mythos 5's underlying intelligence with added safeguards that fall back to Opus 4.8 on sensitive prompts.

Anthropic

Anthropic Claude Fable 5 (1M)

1M

The first public Mythos-class model - Anthropic's most powerful generally available model, sharing Mythos 5's underlying intelligence with added safeguards that fall back to Opus 4.8 on sensitive prompts.

Anthropic

Anthropic Claude Opus 4.8 (1M)

1M

Anthropic's most capable Opus-tier model - highly autonomous, state-of-the-art on long-horizon agentic work, knowledge work and memory, with clearer, warmer writing. 1M context at standard pricing.

Anthropic

Anthropic Claude Opus 4.8 (1M)

1M

Anthropic's most capable Opus-tier model - highly autonomous, state-of-the-art on long-horizon agentic work, knowledge work and memory, with clearer, warmer writing. 1M context at standard pricing.

Google

Google Gemini 3.5 Flash (1M)

1M

Google's newest fast model (Google I/O 2026), built for agentic tasks and coding - beats Gemini 3.1 Pro on coding and agentic benchmarks at a fraction of the cost.

Google

Google Gemini 3.5 Flash (1M)

1M

Google's newest fast model (Google I/O 2026), built for agentic tasks and coding - beats Gemini 3.1 Pro on coding and agentic benchmarks at a fraction of the cost.

OpenAI

OpenAI GPT-5.5 (1M)

1M

OpenAI's flagship GPT-5.5, released April 2026. A ~1.05M token context window with strong terminal/agentic benchmarks; available in the Responses and Chat Completions APIs.

OpenAI

OpenAI GPT-5.5 (1M)

1M

OpenAI's flagship GPT-5.5, released April 2026. A ~1.05M token context window with strong terminal/agentic benchmarks; available in the Responses and Chat Completions APIs.

OpenAI

OpenAI GPT-5.5 Pro (1M)

1M

A higher-accuracy GPT-5.5 variant for the most demanding tasks, available via the API at premium pricing.

OpenAI

OpenAI GPT-5.5 Pro (1M)

1M

A higher-accuracy GPT-5.5 variant for the most demanding tasks, available via the API at premium pricing.

OpenAI

GPT-5.4

256K

OpenAI's most advanced model with enhanced reasoning, longer context, and multimodal capabilities. Top of the line for complex tasks.

OpenAI

GPT-5.4

256K

OpenAI's most advanced model with enhanced reasoning, longer context, and multimodal capabilities. Top of the line for complex tasks.

OpenAI

GPT-5.4 Mini

128K

Affordable version of GPT-5.4 with strong performance for everyday tasks at a fraction of the cost.

OpenAI

GPT-5.4 Mini

128K

Affordable version of GPT-5.4 with strong performance for everyday tasks at a fraction of the cost.

Google

Gemini 3.1 Flash

1M

Fast and affordable Gemini 3.1 Flash optimized for high-throughput applications at minimal cost.

Google

Gemini 3.1 Flash

1M

Fast and affordable Gemini 3.1 Flash optimized for high-throughput applications at minimal cost.

xAI

xAI Grok 3.5 (256k)

256K

xAI's latest flagship model with expanded context, improved reasoning, and deeper integration with real-time data from X platform.

xAI

xAI Grok 3.5 (256k)

256K

xAI's latest flagship model with expanded context, improved reasoning, and deeper integration with real-time data from X platform.

Anthropic

Anthropic Claude Opus 4.7 (1M)

1M

Previous-generation Opus - highly autonomous with excellent long-horizon agentic work, vision and memory. Adaptive thinking only; high-resolution vision support.

Anthropic

Anthropic Claude Opus 4.7 (1M)

1M

Previous-generation Opus - highly autonomous with excellent long-horizon agentic work, vision and memory. Adaptive thinking only; high-resolution vision support.

Anthropic

Anthropic Claude Opus 4.6 (1M)

1M

Anthropic's Claude Opus 4.6 with extended 1M token context window for processing entire codebases, books, and massive datasets in a single prompt.

Anthropic

Anthropic Claude Opus 4.6 (1M)

1M

Anthropic's Claude Opus 4.6 with extended 1M token context window for processing entire codebases, books, and massive datasets in a single prompt.

Anthropic

Anthropic Claude Opus 4.6 (200k)

200K

Anthropic's latest flagship model with best-in-class reasoning, coding, and agentic capabilities. Supports extended thinking for complex multi-step problems.

Anthropic

Anthropic Claude Opus 4.6 (200k)

200K

Anthropic's latest flagship model with best-in-class reasoning, coding, and agentic capabilities. Supports extended thinking for complex multi-step problems.

Anthropic

Anthropic Claude Sonnet 4.6 (200k)

200K

Anthropic's balanced mid-tier model offering strong intelligence, speed, and cost-effectiveness. Excellent for everyday coding and enterprise tasks.

Anthropic

Anthropic Claude Sonnet 4.6 (200k)

200K

Anthropic's balanced mid-tier model offering strong intelligence, speed, and cost-effectiveness. Excellent for everyday coding and enterprise tasks.

Google

Google Gemini 3.1 Pro (2M)

2M

Google's most capable Gemini Pro model with an industry-leading 2M token context window - ideal for large-document analysis and long multi-turn reasoning.

Google

Google Gemini 3.1 Pro (2M)

2M

Google's most capable Gemini Pro model with an industry-leading 2M token context window - ideal for large-document analysis and long multi-turn reasoning.

DeepSeek

DeepSeek V4 (128k)

128K

DeepSeek's fourth-generation open-weights model with state-of-the-art reasoning at remarkably low cost. Strong performance on coding and math benchmarks.

DeepSeek

DeepSeek V4 (128k)

128K

DeepSeek's fourth-generation open-weights model with state-of-the-art reasoning at remarkably low cost. Strong performance on coding and math benchmarks.

OpenAI

OpenAI o3 Deep Research (200k)

200K

An o3-family model optimized for deep research tasks. Autonomously browses the web, synthesizes information, and produces comprehensive research reports.

OpenAI

OpenAI o3 Deep Research (200k)

200K

An o3-family model optimized for deep research tasks. Autonomously browses the web, synthesizes information, and produces comprehensive research reports.

Mistral AI

Mistral Large 3 (128k)

128K

Mistral's latest flagship model with multilingual excellence, strong coding, and enterprise-grade function calling. Open-weight model.

Mistral AI

Mistral Large 3 (128k)

128K

Mistral's latest flagship model with multilingual excellence, strong coding, and enterprise-grade function calling. Open-weight model.

Alibaba

Alibaba Qwen 3 (128k)

128K

Alibaba's latest Qwen 3 flagship model with strong multilingual capabilities and improved reasoning. Competitive with frontier models at lower cost.

Alibaba

Alibaba Qwen 3 (128k)

128K

Alibaba's latest Qwen 3 flagship model with strong multilingual capabilities and improved reasoning. Competitive with frontier models at lower cost.

Midjourney

Midjourney v7

N/A

Professional AI image generation with photorealistic quality and artistic control. Subscription-based: Basic $10/mo, Standard $30/mo, Pro $60/mo

Midjourney

Midjourney v7

N/A

Professional AI image generation with photorealistic quality and artistic control. Subscription-based: Basic $10/mo, Standard $30/mo, Pro $60/mo

OpenAI

OpenAI GPT-5.2 Pro (400k)

400K

Higher-compute variant of GPT-5.2 for harder problems (Responses API only).

OpenAI

OpenAI GPT-5.2 Pro (400k)

400K

Higher-compute variant of GPT-5.2 for harder problems (Responses API only).

Google

Google Gemini 3 Flash Preview (1M)

1M

Gemini 3 Flash Preview on Vertex AI.

Google

Google Gemini 3 Flash Preview (1M)

1M

Gemini 3 Flash Preview on Vertex AI.

Anthropic

Anthropic Claude Opus 4.5 (200k)

200K

Claude 4.5 flagship Opus model for long-horizon coding and agentic workflows.

Anthropic

Anthropic Claude Opus 4.5 (200k)

200K

Claude 4.5 flagship Opus model for long-horizon coding and agentic workflows.

Google

Google Gemini 3 Pro Preview (1M)

1M

Gemini 3 Pro Preview on Vertex AI.

Google

Google Gemini 3 Pro Preview (1M)

1M

Gemini 3 Pro Preview on Vertex AI.

Anthropic

Anthropic Claude Haiku 4.5 (200k)

200K

Anthropic's fastest 4.5-series model�cost-effective for high-volume workloads with extended thinking and strong tool use.

Anthropic

Anthropic Claude Haiku 4.5 (200k)

200K

Anthropic's fastest 4.5-series model�cost-effective for high-volume workloads with extended thinking and strong tool use.

Cognition / Windsurf

Cognition SWE-1.5 (Windsurf in-house)

Unknown

Windsurf/Cognition in-house frontier model for agentic coding. Consumed via Windsurf prompt credits (not USD per-token).

Cognition / Windsurf

Cognition SWE-1.5 (Windsurf in-house)

Unknown

Windsurf/Cognition in-house frontier model for agentic coding. Consumed via Windsurf prompt credits (not USD per-token).

Anthropic

Anthropic Claude Haiku 4.5 (200k)

200K

Anthropic's fastest and most cost-effective 4.5-series model, optimized for high-volume workloads with strong tool use capabilities.

Anthropic

Anthropic Claude Haiku 4.5 (200k)

200K

Anthropic's fastest and most cost-effective 4.5-series model, optimized for high-volume workloads with strong tool use capabilities.

Anthropic

Anthropic Claude Sonnet 4.5 (200k)

200K

Anthropic's latest and most advanced Sonnet model released September 30, 2025. Features dramatically improved coding capabilities, enhanced reasoning, and better long-context performance. Outperforms Claude 4 Sonnet on all major benchmarks while maintaining cost efficiency.

Anthropic

Anthropic Claude Sonnet 4.5 (200k)

200K

Anthropic's latest and most advanced Sonnet model released September 30, 2025. Features dramatically improved coding capabilities, enhanced reasoning, and better long-context performance. Outperforms Claude 4 Sonnet on all major benchmarks while maintaining cost efficiency.

OpenAI

OpenAI GPT-5-Codex (400k)

400K

A version of GPT-5 optimized for agentic coding in Codex. Default in Codex cloud & reviews; also usable via API key. Priced the same as GPT-5.

OpenAI

OpenAI GPT-5-Codex (400k)

400K

A version of GPT-5 optimized for agentic coding in Codex. Default in Codex cloud & reviews; also usable via API key. Priced the same as GPT-5.

Anthropic

Anthropic Claude Sonnet 4.5 (200k/1M beta)

1M

Anthropic's most intelligent model for agents and coding, with extended thinking and state-of-the-art performance on SWE-bench Verified.

Anthropic

Anthropic Claude Sonnet 4.5 (200k/1M beta)

1M

Anthropic's most intelligent model for agents and coding, with extended thinking and state-of-the-art performance on SWE-bench Verified.

Google

Google Gemini 3.0 Ultra (2M)

2M

Google's next-generation flagship model with breakthrough multimodal capabilities and 2M token context window. Features advanced reasoning, native code execution, and real-time multimodal understanding across text, image, audio, and video.

Google

Google Gemini 3.0 Ultra (2M)

2M

Google's next-generation flagship model with breakthrough multimodal capabilities and 2M token context window. Features advanced reasoning, native code execution, and real-time multimodal understanding across text, image, audio, and video.

Google

Google Gemini 3.0 (2M)

2M

Next-generation Gemini with advanced multimodal reasoning and long context. Rates pending official pricing page.

Google

Google Gemini 3.0 (2M)

2M

Next-generation Gemini with advanced multimodal reasoning and long context. Rates pending official pricing page.

Anthropic

Anthropic Claude Opus 4.1 (200k)

200K

Opus 4.1 (200k) � higher-cost flagship Opus tier. Newer Opus 4.5 provides a more accessible price point.

Anthropic

Anthropic Claude Opus 4.1 (200k)

200K

Opus 4.1 (200k) � higher-cost flagship Opus tier. Newer Opus 4.5 provides a more accessible price point.

OpenAI

OpenAI GPT-5 Pro (400k)

400K

Version of GPT-5 that produces smarter and more precise responses. Responses API only; higher max output than standard GPT-5.

OpenAI

OpenAI GPT-5 Pro (400k)

400K

Version of GPT-5 that produces smarter and more precise responses. Responses API only; higher max output than standard GPT-5.

OpenAI

OpenAI GPT-5 (400k)

400K

Previous flagship GPT model for coding, reasoning, and agentic tasks. OpenAI recommends GPT-5.1/5.2 for newest improvements.

OpenAI

OpenAI GPT-5 (400k)

400K

Previous flagship GPT model for coding, reasoning, and agentic tasks. OpenAI recommends GPT-5.1/5.2 for newest improvements.

OpenAI

OpenAI GPT-5 Mini (400k)

400K

A faster, lower-cost GPT-5 for well-defined tasks. Text & vision with long context at a fraction of the price.

OpenAI

OpenAI GPT-5 Mini (400k)

400K

A faster, lower-cost GPT-5 for well-defined tasks. Text & vision with long context at a fraction of the price.

OpenAI

OpenAI GPT-5 Low (1M)

1M

Alias tier aligned with GPT-5 Mini pricing; use when your workflow targets the "low" cost tier.

OpenAI

OpenAI GPT-5 Low (1M)

1M

Alias tier aligned with GPT-5 Mini pricing; use when your workflow targets the "low" cost tier.

OpenAI

OpenAI GPT-5 Medium (1M)

1M

Mid-tier GPT-5 variant balancing performance and cost for general-purpose workloads.

OpenAI

OpenAI GPT-5 Medium (1M)

1M

Mid-tier GPT-5 variant balancing performance and cost for general-purpose workloads.

OpenAI

OpenAI GPT-5 High (1M)

1M

High-tier GPT-5 with enhanced reasoning, reliability and coding for mission-critical workloads.

OpenAI

OpenAI GPT-5 High (1M)

1M

High-tier GPT-5 with enhanced reasoning, reliability and coding for mission-critical workloads.

Moonshot AI

Moonshot Kimi K2 Base (128k)

128K

A 1T parameter open-weight Mixture-of-Experts (MoE) model with 32B active parameters. This is the unaligned, pre-trained base model, suitable for further fine-tuning.

Moonshot AI

Moonshot Kimi K2 Base (128k)

128K

A 1T parameter open-weight Mixture-of-Experts (MoE) model with 32B active parameters. This is the unaligned, pre-trained base model, suitable for further fine-tuning.

Moonshot AI

Moonshot Kimi K2 Instruct (128k)

128K

The instruction-tuned version of Kimi K2, optimized for chat, agentic tasks, and tool use. Aligned with RLHF for helpful and safe responses.

Moonshot AI

Moonshot Kimi K2 Instruct (128k)

128K

The instruction-tuned version of Kimi K2, optimized for chat, agentic tasks, and tool use. Aligned with RLHF for helpful and safe responses.

Alibaba

Alibaba Qwen3 Coder Flash (1M)

1M

A 30B parameter model from the Qwen3 series, excelling in coding and agentic tasks with a 1M token context length.

Alibaba

Alibaba Qwen3 Coder Flash (1M)

1M

A 30B parameter model from the Qwen3 series, excelling in coding and agentic tasks with a 1M token context length.

Zhipu AI

Zhipu GLM-4.5 (200k)

200K

GLM-4.5 agentic foundation model. Official pricing is published in RMB on BigModel; USD prices vary by provider.

Zhipu AI

Zhipu GLM-4.5 (200k)

200K

GLM-4.5 agentic foundation model. Official pricing is published in RMB on BigModel; USD prices vary by provider.

Alibaba

Alibaba Qwen3 Coder (1M)

1M

Qwen3 coding-specialized model with long-context capabilities and strong tool-use.

Alibaba

Alibaba Qwen3 Coder (1M)

1M

Qwen3 coding-specialized model with long-context capabilities and strong tool-use.

Anthropic

Anthropic Claude Opus 4 (200k)

200K

Anthropic's most powerful model, excelling in coding, advanced reasoning, and AI agent workflows. Handles complex, long-running tasks.

Anthropic

Anthropic Claude Opus 4 (200k)

200K

Anthropic's most powerful model, excelling in coding, advanced reasoning, and AI agent workflows. Handles complex, long-running tasks.

Anthropic

Anthropic Claude Sonnet 4 (200k)

200K

Anthropic's highly capable and versatile model, offering a strong balance of intelligence, speed, and cost-effectiveness for enterprise applications.

Anthropic

Anthropic Claude Sonnet 4 (200k)

200K

Anthropic's highly capable and versatile model, offering a strong balance of intelligence, speed, and cost-effectiveness for enterprise applications.

Google

Google Gemini 2.5 Pro (1M)

1M

Google's most advanced reasoning Gemini model, capable of solving complex problems. Supports text, code, image, audio, and video inputs. Features a 1M token context window (up to 2M in some versions).

Google

Google Gemini 2.5 Pro (1M)

1M

Google's most advanced reasoning Gemini model, capable of solving complex problems. Supports text, code, image, audio, and video inputs. Features a 1M token context window (up to 2M in some versions).

Mistral AI

Mistral Medium 3 (128k)

128K

Mistral AI's frontier-class multimodal model balancing SOTA performance, lower cost, and simpler deployability for enterprise usage. Excels in coding and multimodal understanding.

Mistral AI

Mistral Medium 3 (128k)

128K

Mistral AI's frontier-class multimodal model balancing SOTA performance, lower cost, and simpler deployability for enterprise usage. Excels in coding and multimodal understanding.

Mistral AI

Mistral Devstral Small (128k)

128K

A 24B open-source text model from Mistral AI that excels at using tools to explore codebases, editing multiple files, and powering software engineering agents. Apache 2.0 license.

Mistral AI

Mistral Devstral Small (128k)

128K

A 24B open-source text model from Mistral AI that excels at using tools to explore codebases, editing multiple files, and powering software engineering agents. Apache 2.0 license.

Google

Google Gemini 2.5 Flash (1M)

1M

Google's best model for price and performance (as of May 2025), featuring hybrid reasoning capabilities. Supports text, code, image, audio, and video inputs. 1M token context window.

Google

Google Gemini 2.5 Flash (1M)

1M

Google's best model for price and performance (as of May 2025), featuring hybrid reasoning capabilities. Supports text, code, image, audio, and video inputs. 1M token context window.

OpenAI

OpenAI Codex Mini (200k)

200K

OpenAI's fast coding model designed for the Codex coding agent. Optimized for rapid code generation, editing, and review within development workflows.

OpenAI

OpenAI Codex Mini (200k)

200K

OpenAI's fast coding model designed for the Codex coding agent. Optimized for rapid code generation, editing, and review within development workflows.

Pika Labs

Pika 2

N/A

Fast and creative video generation with emphasis on artistic styles and special effects. Free tier available with paid options.

Pika Labs

Pika 2

N/A

Fast and creative video generation with emphasis on artistic styles and special effects. Free tier available with paid options.

OpenAI

OpenAI o3 (200k)

200K

OpenAI reasoning model for complex tasks (text+image input, text output). Succeeded by GPT-5.x for many agentic workloads.

OpenAI

OpenAI o3 (200k)

200K

OpenAI reasoning model for complex tasks (text+image input, text output). Succeeded by GPT-5.x for many agentic workloads.

OpenAI

OpenAI o4-mini (200k)

200K

A faster, cost-efficient reasoning model, successor to o3-mini, released in April 2025. Offers strong performance on math, coding, and vision. Can process text and images, and features autonomous tool use.

OpenAI

OpenAI o4-mini (200k)

200K

A faster, cost-efficient reasoning model, successor to o3-mini, released in April 2025. Offers strong performance on math, coding, and vision. Can process text and images, and features autonomous tool use.

Alibaba

Alibaba Qwen3 235B MoE (128k)

128K

Alibaba's flagship Qwen3 Mixture-of-Experts model with 235B total parameters (22B active). Features hybrid reasoning and supports 119 languages. (Note: Not publicly available at release).

Alibaba

Alibaba Qwen3 235B MoE (128k)

128K

Alibaba's flagship Qwen3 Mixture-of-Experts model with 235B total parameters (22B active). Features hybrid reasoning and supports 119 languages. (Note: Not publicly available at release).

Alibaba

Alibaba Qwen3 30B MoE (128k)

128K

Alibaba's Qwen3 Mixture-of-Experts model with 30B total parameters (3B active). Features hybrid reasoning and supports 119 languages. Apache 2.0 license.

Alibaba

Alibaba Qwen3 30B MoE (128k)

128K

Alibaba's Qwen3 Mixture-of-Experts model with 30B total parameters (3B active). Features hybrid reasoning and supports 119 languages. Apache 2.0 license.

Alibaba

Alibaba Qwen3 32B Dense (128k)

128K

Alibaba's largest dense model in the Qwen3 family with 32B parameters. Features hybrid reasoning and supports 119 languages. Apache 2.0 license.

Alibaba

Alibaba Qwen3 32B Dense (128k)

128K

Alibaba's largest dense model in the Qwen3 family with 32B parameters. Features hybrid reasoning and supports 119 languages. Apache 2.0 license.

OpenAI

OpenAI GPT-4.1 (400k)

400K

Smartest non-reasoning GPT model (text+image in, text out).

OpenAI

OpenAI GPT-4.1 (400k)

400K

Smartest non-reasoning GPT model (text+image in, text out).

OpenAI

OpenAI GPT-4.1 mini (Unknown context)

Unknown

Smaller, faster GPT-4.1 tier with low cost.

OpenAI

OpenAI GPT-4.1 mini (Unknown context)

Unknown

Smaller, faster GPT-4.1 tier with low cost.

OpenAI

OpenAI GPT-4.1 nano (Unknown context)

Unknown

Fastest, cheapest GPT-4.1 tier.

OpenAI

OpenAI GPT-4.1 nano (Unknown context)

Unknown

Fastest, cheapest GPT-4.1 tier.

Google

Google Gemini 2.5 Flash Preview (1M)

1M

Preview of Google's Gemini 2.5 Flash with hybrid reasoning capabilities. Ultra-fast and cost-efficient for high-volume applications.

Google

Google Gemini 2.5 Flash Preview (1M)

1M

Preview of Google's Gemini 2.5 Flash with hybrid reasoning capabilities. Ultra-fast and cost-efficient for high-volume applications.

Meta

Meta Llama 4 Maverick (1M)

1M

Meta's large Llama 4 Mixture-of-Experts model with 400B total parameters (17B active per expert, 128 experts). Natively multimodal with a 1M token context window. Open source.

Meta

Meta Llama 4 Maverick (1M)

1M

Meta's large Llama 4 Mixture-of-Experts model with 400B total parameters (17B active per expert, 128 experts). Natively multimodal with a 1M token context window. Open source.

Meta

Meta Llama 4 Scout (10M)

10M

Meta's efficient Llama 4 model with an industry-leading 10M token context window. 109B total parameters (17B active per expert, 16 experts). Natively multimodal and open source.

Meta

Meta Llama 4 Scout (10M)

10M

Meta's efficient Llama 4 model with an industry-leading 10M token context window. 109B total parameters (17B active per expert, 16 experts). Natively multimodal and open source.

Meta

Meta Llama 4 Maverick (1M)

1M

Meta's Llama 4 Maverick with mixture-of-experts architecture, 1M context window, and strong multilingual support. Open-source model.

Meta

Meta Llama 4 Maverick (1M)

1M

Meta's Llama 4 Maverick with mixture-of-experts architecture, 1M context window, and strong multilingual support. Open-source model.

Meta

Meta Llama 4 Scout (10M)

10M

Meta's Llama 4 Scout with an industry-leading 10M token context window and 16 experts MoE architecture. Optimized for efficiency.

Meta

Meta Llama 4 Scout (10M)

10M

Meta's Llama 4 Scout with an industry-leading 10M token context window and 16 experts MoE architecture. Optimized for efficiency.

Mistral AI

Mistral Small 3.1 (128k)

128K

A new leader in the small models category by Mistral AI, with image understanding capabilities and an extended 128k context length. Apache 2.0 license.

Mistral AI

Mistral Small 3.1 (128k)

128K

A new leader in the small models category by Mistral AI, with image understanding capabilities and an extended 128k context length. Apache 2.0 license.

OpenAI

OpenAI GPT-4o with Search (128k)

128K

GPT-4o variant with built-in web search grounding. Provides up-to-date, cited answers by searching the web in real time.

OpenAI

OpenAI GPT-4o with Search (128k)

128K

GPT-4o variant with built-in web search grounding. Provides up-to-date, cited answers by searching the web in real time.

Google

Google Gemini 2.5 Pro Preview (1M)

1M

Early preview of Google's Gemini 2.5 Pro thinking model. Excels at reasoning, coding, and multimodal tasks with a 1M token context window.

Google

Google Gemini 2.5 Pro Preview (1M)

1M

Early preview of Google's Gemini 2.5 Pro thinking model. Excels at reasoning, coding, and multimodal tasks with a 1M token context window.

Cohere

Cohere Command A (256k)

256K

Cohere's next-generation enterprise model with an expanded 256K context window. Optimized for agentic RAG, tool use, and structured outputs.

Cohere

Cohere Command A (256k)

256K

Cohere's next-generation enterprise model with an expanded 256K context window. Optimized for agentic RAG, tool use, and structured outputs.

Google

Google Gemma 3 27B (128k)

128K

Google's open-source 27B parameter model from the Gemma 3 family. Natively multimodal with strong performance on text, image, and video tasks. Free to use under open license.

Google

Google Gemma 3 27B (128k)

128K

Google's open-source 27B parameter model from the Gemma 3 family. Natively multimodal with strong performance on text, image, and video tasks. Free to use under open license.

Anthropic

Anthropic Claude 3.7 Sonnet (Deprecated, 200k)

200K

Hybrid reasoning Claude 3.x model (extended thinking). Deprecated; recommended replacement is Claude Sonnet 4.5.

Anthropic

Anthropic Claude 3.7 Sonnet (Deprecated, 200k)

200K

Hybrid reasoning Claude 3.x model (extended thinking). Deprecated; recommended replacement is Claude Sonnet 4.5.

OpenAI

OpenAI o3-mini (200k)

200K

A faster, more cost-effective version of o3, released in January 2025. Offers strong reasoning, coding, and vision capabilities. Optimized for math and coding tasks.

OpenAI

OpenAI o3-mini (200k)

200K

A faster, more cost-effective version of o3, released in January 2025. Offers strong reasoning, coding, and vision capabilities. Optimized for math and coding tasks.

Mistral AI

Mistral Saba (32k)

32K

A powerful and efficient model from Mistral AI for languages from the Middle East and South Asia.

Mistral AI

Mistral Saba (32k)

32K

A powerful and efficient model from Mistral AI for languages from the Middle East and South Asia.

xAI

xAI Grok 3 (128k)

128K

xAI's flagship large language model with strong reasoning, coding, and math capabilities. Trained on the Colossus supercluster.

xAI

xAI Grok 3 (128k)

128K

xAI's flagship large language model with strong reasoning, coding, and math capabilities. Trained on the Colossus supercluster.

xAI

xAI Grok 3 Mini (128k)

128K

xAI's lightweight reasoning model with think mode. Faster and more cost-efficient than Grok 3 while maintaining strong reasoning capabilities.

xAI

xAI Grok 3 Mini (128k)

128K

xAI's lightweight reasoning model with think mode. Faster and more cost-efficient than Grok 3 while maintaining strong reasoning capabilities.

DeepSeek

DeepSeek 3.1 (128k)

128K

DeepSeek's enhanced model with improved reasoning capabilities, expanded context window, and even more competitive pricing.

DeepSeek

DeepSeek 3.1 (128k)

128K

DeepSeek's enhanced model with improved reasoning capabilities, expanded context window, and even more competitive pricing.

Mistral AI

Mistral Codestral 2 (256k)

256K

Mistral AI's cutting-edge language model for coding (second version). Specializes in low-latency, high-frequency tasks like fill-in-the-middle (FIM), code correction, and test generation.

Mistral AI

Mistral Codestral 2 (256k)

256K

Mistral AI's cutting-edge language model for coding (second version). Specializes in low-latency, high-frequency tasks like fill-in-the-middle (FIM), code correction, and test generation.

DeepSeek

DeepSeek R1 (128k)

128K

DeepSeek's reasoning model trained with reinforcement learning. Excels at math, coding, and complex reasoning tasks with transparent chain-of-thought.

DeepSeek

DeepSeek R1 (128k)

128K

DeepSeek's reasoning model trained with reinforcement learning. Excels at math, coding, and complex reasoning tasks with transparent chain-of-thought.

Google

Google Gemini 2.0 Flash (1M)

1M

Google's latest experimental model with breakthrough multimodal capabilities and enhanced reasoning at an extremely competitive price point.

Google

Google Gemini 2.0 Flash (1M)

1M

Google's latest experimental model with breakthrough multimodal capabilities and enhanced reasoning at an extremely competitive price point.

Meta

Meta Llama 3.3 70B Instruct

128K

Meta's latest 70B parameter model with improved performance and capabilities, offering state-of-the-art results for its size.

Meta

Meta Llama 3.3 70B Instruct

128K

Meta's latest 70B parameter model with improved performance and capabilities, offering state-of-the-art results for its size.

DeepSeek

DeepSeek V3 (64k)

64K

DeepSeek's latest model with strong performance across reasoning, coding, and general tasks at competitive pricing.

DeepSeek

DeepSeek V3 (64k)

64K

DeepSeek's latest model with strong performance across reasoning, coding, and general tasks at competitive pricing.

OpenAI

OpenAI o1 (200k)

200K

Previous full o-series reasoning model (text+image in, text out).

OpenAI

OpenAI o1 (200k)

200K

Previous full o-series reasoning model (text+image in, text out).

OpenAI

OpenAI o1-pro (200k)

200K

Higher-compute variant of o1 for better responses.

OpenAI

OpenAI o1-pro (200k)

200K

Higher-compute variant of o1 for better responses.

Microsoft

Microsoft Phi-4 (16k)

16K

Microsoft's small language model with 14B parameters that punches well above its weight. Excels at STEM reasoning and coding despite its compact size. Open source under MIT license.

Microsoft

Microsoft Phi-4 (16k)

16K

Microsoft's small language model with 14B parameters that punches well above its weight. Excels at STEM reasoning and coding despite its compact size. Open source under MIT license.

Amazon

Amazon Nova Pro (300k)

300K

Amazon's highly capable multimodal model balancing accuracy, speed, and cost. Processes text, images, and video inputs for a wide range of enterprise tasks.

Amazon

Amazon Nova Pro (300k)

300K

Amazon's highly capable multimodal model balancing accuracy, speed, and cost. Processes text, images, and video inputs for a wide range of enterprise tasks.

Amazon

Amazon Nova Lite (300k)

300K

Amazon's very low-cost multimodal model for high-volume tasks. Processes text, images, and video at extremely competitive pricing via Amazon Bedrock.

Amazon

Amazon Nova Lite (300k)

300K

Amazon's very low-cost multimodal model for high-volume tasks. Processes text, images, and video at extremely competitive pricing via Amazon Bedrock.

OpenAI

OpenAI Sora Turbo

N/A

Advanced text-to-video generation with realistic motion and scene understanding. Pricing per video generation varies by length and resolution.

OpenAI

OpenAI Sora Turbo

N/A

Advanced text-to-video generation with realistic motion and scene understanding. Pricing per video generation varies by length and resolution.

Anthropic

Anthropic Claude 3.5 Sonnet (200k)

200K

Anthropic's most advanced model, significantly improved over Claude 3 Sonnet with enhanced reasoning, coding, and vision capabilities.

Anthropic

Anthropic Claude 3.5 Sonnet (200k)

200K

Anthropic's most advanced model, significantly improved over Claude 3 Sonnet with enhanced reasoning, coding, and vision capabilities.

Anthropic

Anthropic Claude 3.5 Haiku (Deprecated, 200k)

200K

Claude 3.5 Haiku snapshot. Deprecated; recommended replacement is Haiku 4.5.

Anthropic

Anthropic Claude 3.5 Haiku (Deprecated, 200k)

200K

Claude 3.5 Haiku snapshot. Deprecated; recommended replacement is Haiku 4.5.

OpenAI

OpenAI o1-preview (Deprecated, 128k)

128K

Deprecated preview snapshot of OpenAI's first o-series reasoning model. Kept for backwards compatibility; prefer o1 for production.

OpenAI

OpenAI o1-preview (Deprecated, 128k)

128K

Deprecated preview snapshot of OpenAI's first o-series reasoning model. Kept for backwards compatibility; prefer o1 for production.

OpenAI

OpenAI o1-mini (128k)

128K

Smaller, faster o-series reasoning model. Deprecated in favor of newer reasoning models, but still supported in legacy workflows.

OpenAI

OpenAI o1-mini (128k)

128K

Smaller, faster o-series reasoning model. Deprecated in favor of newer reasoning models, but still supported in legacy workflows.

Meta

Meta Llama 3.2 90B Vision Instruct

128K

Meta's multimodal model combining text and vision capabilities with strong performance across various tasks.

Meta

Meta Llama 3.2 90B Vision Instruct

128K

Meta's multimodal model combining text and vision capabilities with strong performance across various tasks.

Meta

Meta Llama 3.2 11B Vision Instruct

128K

A smaller, efficient multimodal model from Meta with vision capabilities, suitable for edge deployment and cost-sensitive applications.

Meta

Meta Llama 3.2 11B Vision Instruct

128K

A smaller, efficient multimodal model from Meta with vision capabilities, suitable for edge deployment and cost-sensitive applications.

Mistral AI

Mistral Small (32k)

32K

Mistral AI's cost-effective model for straightforward tasks, offering good performance and efficiency.

Mistral AI

Mistral Small (32k)

32K

Mistral AI's cost-effective model for straightforward tasks, offering good performance and efficiency.

Alibaba

Qwen 2.5 72B Instruct

32K

Alibaba's large language model with strong performance in reasoning, coding, and multilingual tasks.

Alibaba

Qwen 2.5 72B Instruct

32K

Alibaba's large language model with strong performance in reasoning, coding, and multilingual tasks.

AI21 Labs

AI21 Jamba 1.5 Large (256k)

256K

AI21's large hybrid SSM-Transformer model with a 256K context window. Uses a novel Jamba architecture combining Mamba SSM layers with Transformer attention for efficient long-context processing.

AI21 Labs

AI21 Jamba 1.5 Large (256k)

256K

AI21's large hybrid SSM-Transformer model with a 256K context window. Uses a novel Jamba architecture combining Mamba SSM layers with Transformer attention for efficient long-context processing.

OpenAI

OpenAI GPT-4o mini (128k)

128K

OpenAI's most affordable and fastest model in the GPT-4o family, designed for high-volume, low-latency tasks.

OpenAI

OpenAI GPT-4o mini (128k)

128K

OpenAI's most affordable and fastest model in the GPT-4o family, designed for high-volume, low-latency tasks.

Meta

Meta Llama 3.1 405B Instruct

128K

Meta's largest and most capable Llama 3.1 model, designed for complex reasoning, coding, and nuanced instruction following.

Meta

Meta Llama 3.1 405B Instruct

128K

Meta's largest and most capable Llama 3.1 model, designed for complex reasoning, coding, and nuanced instruction following.

Meta

Meta Llama 3.1 70B Instruct

128K

A large instruction-tuned model from Meta's Llama 3.1 series, offering a strong balance of performance and efficiency for a wide range of tasks.

Meta

Meta Llama 3.1 70B Instruct

128K

A large instruction-tuned model from Meta's Llama 3.1 series, offering a strong balance of performance and efficiency for a wide range of tasks.

Meta

Meta Llama 3.1 8B Instruct

128K

A highly efficient instruction-tuned model from Meta's Llama 3.1 series, suitable for fast, on-device, or edge applications.

Meta

Meta Llama 3.1 8B Instruct

128K

A highly efficient instruction-tuned model from Meta's Llama 3.1 series, suitable for fast, on-device, or edge applications.

Mistral AI

Mistral Large 2 (128k)

128K

Mistral AI's flagship model with enhanced reasoning, coding, and multilingual capabilities.

Mistral AI

Mistral Large 2 (128k)

128K

Mistral AI's flagship model with enhanced reasoning, coding, and multilingual capabilities.

Runway

Runway Gen-3 Alpha

N/A

Professional video generation and editing AI with motion control and style consistency. Subscription-based pricing.

Runway

Runway Gen-3 Alpha

N/A

Professional video generation and editing AI with motion control and style consistency. Subscription-based pricing.

OpenAI

OpenAI GPT-4o (128k)

128K

OpenAI's flagship multimodal model, natively processing text, audio, and images for faster, more capable interactions.

OpenAI

OpenAI GPT-4o (128k)

128K

OpenAI's flagship multimodal model, natively processing text, audio, and images for faster, more capable interactions.

Google

Google Gemini 1.5 Flash (1M)

1M

Google's faster and lower-cost version of Gemini 1.5 Pro, optimized for high-volume, high-frequency tasks while retaining a large context window and multimodal capabilities.

Google

Google Gemini 1.5 Flash (1M)

1M

Google's faster and lower-cost version of Gemini 1.5 Pro, optimized for high-volume, high-frequency tasks while retaining a large context window and multimodal capabilities.

Mistral AI

Mistral Codestral (22B)

32K

Mistral AI's open-weight generative model specialized for code generation, supporting 80+ languages.

Mistral AI

Mistral Codestral (22B)

32K

Mistral AI's open-weight generative model specialized for code generation, supporting 80+ languages.

OpenAI

OpenAI GPT-4 Turbo (128k)

128K

OpenAI's powerful model prior to GPT-4o, with a large context window and strong performance on complex tasks. Supports vision.

OpenAI

OpenAI GPT-4 Turbo (128k)

128K

OpenAI's powerful model prior to GPT-4o, with a large context window and strong performance on complex tasks. Supports vision.

Cohere

Cohere Command R+ (128k)

128K

Cohere's most powerful model optimized for enterprise RAG and tool use. Excels at grounded generation with citations and multi-step tool workflows.

Cohere

Cohere Command R+ (128k)

128K

Cohere's most powerful model optimized for enterprise RAG and tool use. Excels at grounded generation with citations and multi-step tool workflows.

Anthropic

Anthropic Claude 3 Opus (200k)

200K

Claude 3 Opus (deprecated) � previous highest-intelligence Claude 3 model.

Anthropic

Anthropic Claude 3 Opus (200k)

200K

Claude 3 Opus (deprecated) � previous highest-intelligence Claude 3 model.

Anthropic

Anthropic Claude 3 Sonnet (200k)

200K

A balanced model from Anthropic, offering a blend of intelligence and speed, ideal for enterprise workloads and scaled AI deployments.

Anthropic

Anthropic Claude 3 Sonnet (200k)

200K

A balanced model from Anthropic, offering a blend of intelligence and speed, ideal for enterprise workloads and scaled AI deployments.

Anthropic

Anthropic Claude 3 Haiku (200k)

200K

Anthropic's fastest and most compact model, designed for near-instant responsiveness and high throughput tasks.

Anthropic

Anthropic Claude 3 Haiku (200k)

200K

Anthropic's fastest and most compact model, designed for near-instant responsiveness and high throughput tasks.

Google

Google Gemini 1.5 Pro (2M)

2M

Google's highly capable multimodal model with a breakthrough long context window of up to 2 million tokens. Excels at complex reasoning, problem-solving, and understanding long-form content.

Google

Google Gemini 1.5 Pro (2M)

2M

Google's highly capable multimodal model with a breakthrough long context window of up to 2 million tokens. Excels at complex reasoning, problem-solving, and understanding long-form content.

Leonardo.AI

Leonardo AI

N/A

Game asset and creative image generation with consistent character and style creation. Credit-based pricing system.

Leonardo.AI

Leonardo AI

N/A

Game asset and creative image generation with consistent character and style creation. Credit-based pricing system.

OpenAI

OpenAI DALL-E 3

N/A

State-of-the-art text-to-image generation with improved prompt following and image quality. Pricing per image: HD 1024×1024 $0.040, Standard $0.020

OpenAI

OpenAI DALL-E 3

N/A

State-of-the-art text-to-image generation with improved prompt following and image quality. Pricing per image: HD 1024×1024 $0.040, Standard $0.020

Stability AI

Stable Diffusion XL

N/A

Open-source image generation model with fine-tuning capabilities. API pricing varies by provider, self-hosting available.

Stability AI

Stable Diffusion XL

N/A

Open-source image generation model with fine-tuning capabilities. API pricing varies by provider, self-hosting available.

OpenAI

OpenAI GPT-3.5 Turbo (16k)

16K

OpenAI's fast, cost-effective model optimized for chat and simple tasks.

OpenAI

OpenAI GPT-3.5 Turbo (16k)

16K

OpenAI's fast, cost-effective model optimized for chat and simple tasks.

AI Model Release Timeline

OpenAI GPT-5.2 (400k)

OpenAI GPT-5.2 (400k)

OpenAI GPT-5.1 (400k)

OpenAI GPT-5.1 (400k)

OpenAI GPT-5 Nano (400k)

OpenAI GPT-5 Nano (400k)

OpenAI GPT-5.1-Codex (400k)

OpenAI GPT-5.1-Codex (400k)

OpenAI GPT-5.1-Codex-Max (400k)

OpenAI GPT-5.1-Codex-Max (400k)

OpenAI o3-pro (200k)

OpenAI o3-pro (200k)

OpenAI GPT-OSS 120B (Open-Weight)

OpenAI GPT-OSS 120B (Open-Weight)

OpenAI GPT-OSS 20B (Open-Weight)

OpenAI GPT-OSS 20B (Open-Weight)

Anthropic Claude Mythos 5 (1M)

Anthropic Claude Mythos 5 (1M)

Anthropic Claude Fable 5 (1M)

Anthropic Claude Fable 5 (1M)

Anthropic Claude Opus 4.8 (1M)

Anthropic Claude Opus 4.8 (1M)

Google Gemini 3.5 Flash (1M)

Google Gemini 3.5 Flash (1M)

OpenAI GPT-5.5 (1M)

OpenAI GPT-5.5 (1M)

OpenAI GPT-5.5 Pro (1M)

OpenAI GPT-5.5 Pro (1M)

GPT-5.4

GPT-5.4

GPT-5.4 Mini

GPT-5.4 Mini

Gemini 3.1 Flash

Gemini 3.1 Flash

xAI Grok 3.5 (256k)

xAI Grok 3.5 (256k)

Anthropic Claude Opus 4.7 (1M)

Anthropic Claude Opus 4.7 (1M)

Anthropic Claude Opus 4.6 (1M)

Anthropic Claude Opus 4.6 (1M)

Anthropic Claude Opus 4.6 (200k)

Anthropic Claude Opus 4.6 (200k)

Anthropic Claude Sonnet 4.6 (200k)

Anthropic Claude Sonnet 4.6 (200k)

Google Gemini 3.1 Pro (2M)

Google Gemini 3.1 Pro (2M)

DeepSeek V4 (128k)

DeepSeek V4 (128k)

OpenAI o3 Deep Research (200k)

OpenAI o3 Deep Research (200k)

Mistral Large 3 (128k)

Mistral Large 3 (128k)

Alibaba Qwen 3 (128k)

Alibaba Qwen 3 (128k)

Midjourney v7

Midjourney v7

OpenAI GPT-5.2 Pro (400k)

OpenAI GPT-5.2 Pro (400k)

Google Gemini 3 Flash Preview (1M)

Google Gemini 3 Flash Preview (1M)

Anthropic Claude Opus 4.5 (200k)

Anthropic Claude Opus 4.5 (200k)

Google Gemini 3 Pro Preview (1M)

Google Gemini 3 Pro Preview (1M)

Anthropic Claude Haiku 4.5 (200k)

Anthropic Claude Haiku 4.5 (200k)

Cognition SWE-1.5 (Windsurf in-house)

Cognition SWE-1.5 (Windsurf in-house)

Anthropic Claude Haiku 4.5 (200k)

Anthropic Claude Haiku 4.5 (200k)

Anthropic Claude Sonnet 4.5 (200k)

Anthropic Claude Sonnet 4.5 (200k)

OpenAI GPT-5-Codex (400k)

OpenAI GPT-5-Codex (400k)

Anthropic Claude Sonnet 4.5 (200k/1M beta)

Anthropic Claude Sonnet 4.5 (200k/1M beta)

Google Gemini 3.0 Ultra (2M)

Google Gemini 3.0 Ultra (2M)

Google Gemini 3.0 (2M)