AI Model Release Timeline

Track the latest AI model releases from all major providers in one place.

Unknown
OpenAI

OpenAI GPT-5.2 (400k)

400K

OpenAI flagship model for coding and agentic tasks across industries.

OpenAI

OpenAI GPT-5.1 (400k)

400K

Flagship GPT model with configurable reasoning effort; predecessor to GPT-5.2.

OpenAI

OpenAI GPT-5 Nano (400k)

400K

Fastest, most cost-efficient GPT-5 variant.

OpenAI

OpenAI GPT-5.1-Codex (400k)

400K

GPT-5.1 variant optimized for agentic coding in Codex (Responses API only).

OpenAI

OpenAI GPT-5.1-Codex-Max (400k)

400K

Most intelligent Codex model optimized for long-horizon agentic coding (Responses API only).

OpenAI

OpenAI o3-pro (200k)

200K

Version of o3 with more compute for better responses (Responses API only).

OpenAI

OpenAI GPT-OSS 120B (Open-Weight)

Unknown

Open-weight model entry as listed by OpenAI (see models page). Token costs depend on where you run it.

OpenAI

OpenAI GPT-OSS 20B (Open-Weight)

Unknown

Open-weight model entry as listed by OpenAI (see models page). Token costs depend on where you run it.

April 2026
OpenAI

OpenAI GPT-5.5

512K

OpenAI's April 2026 flagship: GPT-5.5 ships materially better long-context reasoning than 5.4, a 512K default context window, doubled multimodal capability, and lower API pricing. Targets premium production workloads where 5.4 was leaving capability on the table.

Anthropic

Anthropic Claude Opus 4.7

1M

Anthropic's April 2026 flagship. Drops the long-context premium SKU — 1M token context is the default tier. ~30% lower median latency than 4.6 on long-context requests, materially better long-context retrieval (96.4% at 1M vs 91% for 4.6), and adds a `thinking_budget_tokens` parameter for explicit cost control on extended reasoning. Released alongside the easing of the March 2026 peak-hour Pro/Max throttle.

March 2026
OpenAI

GPT-5.4

256K

OpenAI's most advanced model with enhanced reasoning, longer context, and multimodal capabilities. Top of the line for complex tasks.

OpenAI

GPT-5.4 Mini

128K

Affordable version of GPT-5.4 with strong performance for everyday tasks at a fraction of the cost.

Google

Gemini 3.1 Pro

2M

Google's latest Gemini 3.1 Pro with 2M context window, improved code understanding, and enhanced factual accuracy.

Google

Gemini 3.1 Flash

1M

Fast and affordable Gemini 3.1 Flash optimized for high-throughput applications at minimal cost.

xAI

xAI Grok 3.5 (256k)

256K

xAI's latest flagship model with expanded context, improved reasoning, and deeper integration with real-time data from X platform.

February 2026
Anthropic

Anthropic Claude Opus 4.6 (1M)

1M

Anthropic's Claude Opus 4.6 with extended 1M token context window for processing entire codebases, books, and massive datasets in a single prompt.

Anthropic

Anthropic Claude Opus 4.6 (200k)

200K

Anthropic's latest flagship model with best-in-class reasoning, coding, and agentic capabilities. Supports extended thinking for complex multi-step problems.

Anthropic

Anthropic Claude Sonnet 4.6 (200k)

200K

Anthropic's balanced mid-tier model offering strong intelligence, speed, and cost-effectiveness. Excellent for everyday coding and enterprise tasks.

DeepSeek

DeepSeek V4 (128k)

128K

DeepSeek's fourth-generation open-weights model with state-of-the-art reasoning at remarkably low cost. Strong performance on coding and math benchmarks.

January 2026
OpenAI

OpenAI o3 Deep Research (200k)

200K

An o3-family model optimized for deep research tasks. Autonomously browses the web, synthesizes information, and produces comprehensive research reports.

Mistral AI

Mistral Large 3 (128k)

128K

Mistral's latest flagship model with multilingual excellence, strong coding, and enterprise-grade function calling. Open-weight model.

Alibaba

Alibaba Qwen 3 (128k)

128K

Alibaba's latest Qwen 3 flagship model with strong multilingual capabilities and improved reasoning. Competitive with frontier models at lower cost.

Midjourney

Midjourney v7

N/A

Professional AI image generation with photorealistic quality and artistic control. Subscription-based: Basic $10/mo, Standard $30/mo, Pro $60/mo

December 2025
OpenAI

OpenAI GPT-5.2 Pro (400k)

400K

Higher-compute variant of GPT-5.2 for harder problems (Responses API only).

Google

Google Gemini 3 Flash Preview (1M)

1M

Gemini 3 Flash Preview on Vertex AI.

November 2025
Anthropic

Anthropic Claude Opus 4.5 (200k)

200K

Claude 4.5 flagship Opus model for long-horizon coding and agentic workflows.

Google

Google Gemini 3 Pro Preview (1M)

1M

Gemini 3 Pro Preview on Vertex AI.

October 2025
Anthropic

Anthropic Claude Haiku 4.5 (200k)

200K

Anthropic's fastest 4.5-series model�cost-effective for high-volume workloads with extended thinking and strong tool use.

Cognition / Windsurf

Cognition SWE-1.5 (Windsurf in-house)

Unknown

Windsurf/Cognition in-house frontier model for agentic coding. Consumed via Windsurf prompt credits (not USD per-token).

Anthropic

Anthropic Claude Haiku 4.5 (200k)

200K

Anthropic's fastest and most cost-effective 4.5-series model, optimized for high-volume workloads with strong tool use capabilities.

2025-09-30
Anthropic

Anthropic Claude Sonnet 4.5 (200k)

200K

Anthropic's latest and most advanced Sonnet model released September 30, 2025. Features dramatically improved coding capabilities, enhanced reasoning, and better long-context performance. Outperforms Claude 4 Sonnet on all major benchmarks while maintaining cost efficiency.

September 2025
OpenAI

OpenAI GPT-5-Codex (400k)

400K

A version of GPT-5 optimized for agentic coding in Codex. Default in Codex cloud & reviews; also usable via API key. Priced the same as GPT-5.

Anthropic

Anthropic Claude Sonnet 4.5 (200k/1M beta)

1M

Anthropic's most intelligent model for agents and coding, with extended thinking and state-of-the-art performance on SWE-bench Verified.

Google

Google Gemini 3.0 Ultra (2M)

2M

Google's next-generation flagship model with breakthrough multimodal capabilities and 2M token context window. Features advanced reasoning, native code execution, and real-time multimodal understanding across text, image, audio, and video.

Google

Google Gemini 3.0 (2M)

2M

Next-generation Gemini with advanced multimodal reasoning and long context. Rates pending official pricing page.

August 2025
Anthropic

Anthropic Claude Opus 4.1 (200k)

200K

Opus 4.1 (200k) � higher-cost flagship Opus tier. Newer Opus 4.5 provides a more accessible price point.

OpenAI

OpenAI GPT-5 Pro (400k)

400K

Version of GPT-5 that produces smarter and more precise responses. Responses API only; higher max output than standard GPT-5.

OpenAI

OpenAI GPT-5 (400k)

400K

Previous flagship GPT model for coding, reasoning, and agentic tasks. OpenAI recommends GPT-5.1/5.2 for newest improvements.

OpenAI

OpenAI GPT-5 Mini (400k)

400K

A faster, lower-cost GPT-5 for well-defined tasks. Text & vision with long context at a fraction of the price.

OpenAI

OpenAI GPT-5 Low (1M)

1M

Alias tier aligned with GPT-5 Mini pricing; use when your workflow targets the "low" cost tier.

OpenAI

OpenAI GPT-5 Medium (1M)

1M

Mid-tier GPT-5 variant balancing performance and cost for general-purpose workloads.

OpenAI

OpenAI GPT-5 High (1M)

1M

High-tier GPT-5 with enhanced reasoning, reliability and coding for mission-critical workloads.

July 2025
Moonshot AI

Moonshot Kimi K2 Base (128k)

128K

A 1T parameter open-weight Mixture-of-Experts (MoE) model with 32B active parameters. This is the unaligned, pre-trained base model, suitable for further fine-tuning.

Moonshot AI

Moonshot Kimi K2 Instruct (128k)

128K

The instruction-tuned version of Kimi K2, optimized for chat, agentic tasks, and tool use. Aligned with RLHF for helpful and safe responses.

Alibaba

Alibaba Qwen3 Coder Flash (1M)

1M

A 30B parameter model from the Qwen3 series, excelling in coding and agentic tasks with a 1M token context length.

Zhipu AI

Zhipu GLM-4.5 (200k)

200K

GLM-4.5 agentic foundation model. Official pricing is published in RMB on BigModel; USD prices vary by provider.

Alibaba

Alibaba Qwen3 Coder (1M)

1M

Qwen3 coding-specialized model with long-context capabilities and strong tool-use.

May 2025
Anthropic

Anthropic Claude Opus 4 (200k)

200K

Anthropic's most powerful model, excelling in coding, advanced reasoning, and AI agent workflows. Handles complex, long-running tasks.

Anthropic

Anthropic Claude Sonnet 4 (200k)

200K

Anthropic's highly capable and versatile model, offering a strong balance of intelligence, speed, and cost-effectiveness for enterprise applications.

Google

Google Gemini 2.5 Pro (1M)

1M

Google's most advanced reasoning Gemini model, capable of solving complex problems. Supports text, code, image, audio, and video inputs. Features a 1M token context window (up to 2M in some versions).

Mistral AI

Mistral Medium 3 (128k)

128K

Mistral AI's frontier-class multimodal model balancing SOTA performance, lower cost, and simpler deployability for enterprise usage. Excels in coding and multimodal understanding.

Mistral AI

Mistral Devstral Small (128k)

128K

A 24B open-source text model from Mistral AI that excels at using tools to explore codebases, editing multiple files, and powering software engineering agents. Apache 2.0 license.

Google

Google Gemini 2.5 Flash (1M)

1M

Google's best model for price and performance (as of May 2025), featuring hybrid reasoning capabilities. Supports text, code, image, audio, and video inputs. 1M token context window.

OpenAI

OpenAI Codex Mini (200k)

200K

OpenAI's fast coding model designed for the Codex coding agent. Optimized for rapid code generation, editing, and review within development workflows.

Pika Labs

Pika 2

N/A

Fast and creative video generation with emphasis on artistic styles and special effects. Free tier available with paid options.

April 2025
OpenAI

OpenAI o3 (200k)

200K

OpenAI reasoning model for complex tasks (text+image input, text output). Succeeded by GPT-5.x for many agentic workloads.

OpenAI

OpenAI o4-mini (200k)

200K

A faster, cost-efficient reasoning model, successor to o3-mini, released in April 2025. Offers strong performance on math, coding, and vision. Can process text and images, and features autonomous tool use.

Alibaba

Alibaba Qwen3 235B MoE (128k)

128K

Alibaba's flagship Qwen3 Mixture-of-Experts model with 235B total parameters (22B active). Features hybrid reasoning and supports 119 languages. (Note: Not publicly available at release).

Alibaba

Alibaba Qwen3 30B MoE (128k)

128K

Alibaba's Qwen3 Mixture-of-Experts model with 30B total parameters (3B active). Features hybrid reasoning and supports 119 languages. Apache 2.0 license.

Alibaba

Alibaba Qwen3 32B Dense (128k)

128K

Alibaba's largest dense model in the Qwen3 family with 32B parameters. Features hybrid reasoning and supports 119 languages. Apache 2.0 license.

OpenAI

OpenAI GPT-4.1 (400k)

400K

Smartest non-reasoning GPT model (text+image in, text out).

OpenAI

OpenAI GPT-4.1 mini (Unknown context)

Unknown

Smaller, faster GPT-4.1 tier with low cost.

OpenAI

OpenAI GPT-4.1 nano (Unknown context)

Unknown

Fastest, cheapest GPT-4.1 tier.

Google

Google Gemini 2.5 Flash Preview (1M)

1M

Preview of Google's Gemini 2.5 Flash with hybrid reasoning capabilities. Ultra-fast and cost-efficient for high-volume applications.

Meta

Meta Llama 4 Maverick (1M)

1M

Meta's large Llama 4 Mixture-of-Experts model with 400B total parameters (17B active per expert, 128 experts). Natively multimodal with a 1M token context window. Open source.

Meta

Meta Llama 4 Scout (10M)

10M

Meta's efficient Llama 4 model with an industry-leading 10M token context window. 109B total parameters (17B active per expert, 16 experts). Natively multimodal and open source.

Meta

Meta Llama 4 Maverick (1M)

1M

Meta's Llama 4 Maverick with mixture-of-experts architecture, 1M context window, and strong multilingual support. Open-source model.

Meta

Meta Llama 4 Scout (10M)

10M

Meta's Llama 4 Scout with an industry-leading 10M token context window and 16 experts MoE architecture. Optimized for efficiency.

March 2025
Mistral AI

Mistral Small 3.1 (128k)

128K

A new leader in the small models category by Mistral AI, with image understanding capabilities and an extended 128k context length. Apache 2.0 license.

OpenAI

OpenAI GPT-4o with Search (128k)

128K

GPT-4o variant with built-in web search grounding. Provides up-to-date, cited answers by searching the web in real time.

Google

Google Gemini 2.5 Pro Preview (1M)

1M

Early preview of Google's Gemini 2.5 Pro thinking model. Excels at reasoning, coding, and multimodal tasks with a 1M token context window.

Cohere

Cohere Command A (256k)

256K

Cohere's next-generation enterprise model with an expanded 256K context window. Optimized for agentic RAG, tool use, and structured outputs.

Google

Google Gemma 3 27B (128k)

128K

Google's open-source 27B parameter model from the Gemma 3 family. Natively multimodal with strong performance on text, image, and video tasks. Free to use under open license.

February 2025
Anthropic

Anthropic Claude 3.7 Sonnet (Deprecated, 200k)

200K

Hybrid reasoning Claude 3.x model (extended thinking). Deprecated; recommended replacement is Claude Sonnet 4.5.

OpenAI

OpenAI o3-mini (200k)

200K

A faster, more cost-effective version of o3, released in January 2025. Offers strong reasoning, coding, and vision capabilities. Optimized for math and coding tasks.

Mistral AI

Mistral Saba (32k)

32K

A powerful and efficient model from Mistral AI for languages from the Middle East and South Asia.

xAI

xAI Grok 3 (128k)

128K

xAI's flagship large language model with strong reasoning, coding, and math capabilities. Trained on the Colossus supercluster.

xAI

xAI Grok 3 Mini (128k)

128K

xAI's lightweight reasoning model with think mode. Faster and more cost-efficient than Grok 3 while maintaining strong reasoning capabilities.

January 2025
DeepSeek

DeepSeek 3.1 (128k)

128K

DeepSeek's enhanced model with improved reasoning capabilities, expanded context window, and even more competitive pricing.

Mistral AI

Mistral Codestral 2 (256k)

256K

Mistral AI's cutting-edge language model for coding (second version). Specializes in low-latency, high-frequency tasks like fill-in-the-middle (FIM), code correction, and test generation.

DeepSeek

DeepSeek R1 (128k)

128K

DeepSeek's reasoning model trained with reinforcement learning. Excels at math, coding, and complex reasoning tasks with transparent chain-of-thought.

December 2024
Google

Google Gemini 2.0 Flash (1M)

1M

Google's latest experimental model with breakthrough multimodal capabilities and enhanced reasoning at an extremely competitive price point.

Meta

Meta Llama 3.3 70B Instruct

128K

Meta's latest 70B parameter model with improved performance and capabilities, offering state-of-the-art results for its size.

DeepSeek

DeepSeek V3 (64k)

64K

DeepSeek's latest model with strong performance across reasoning, coding, and general tasks at competitive pricing.

OpenAI

OpenAI o1 (200k)

200K

Previous full o-series reasoning model (text+image in, text out).

OpenAI

OpenAI o1-pro (200k)

200K

Higher-compute variant of o1 for better responses.

Microsoft

Microsoft Phi-4 (16k)

16K

Microsoft's small language model with 14B parameters that punches well above its weight. Excels at STEM reasoning and coding despite its compact size. Open source under MIT license.

Amazon

Amazon Nova Pro (300k)

300K

Amazon's highly capable multimodal model balancing accuracy, speed, and cost. Processes text, images, and video inputs for a wide range of enterprise tasks.

Amazon

Amazon Nova Lite (300k)

300K

Amazon's very low-cost multimodal model for high-volume tasks. Processes text, images, and video at extremely competitive pricing via Amazon Bedrock.

OpenAI

OpenAI Sora Turbo

N/A

Advanced text-to-video generation with realistic motion and scene understanding. Pricing per video generation varies by length and resolution.

October 2024
Anthropic

Anthropic Claude 3.5 Sonnet (200k)

200K

Anthropic's most advanced model, significantly improved over Claude 3 Sonnet with enhanced reasoning, coding, and vision capabilities.

Anthropic

Anthropic Claude 3.5 Haiku (Deprecated, 200k)

200K

Claude 3.5 Haiku snapshot. Deprecated; recommended replacement is Haiku 4.5.

September 2024
OpenAI

OpenAI o1-preview (Deprecated, 128k)

128K

Deprecated preview snapshot of OpenAI's first o-series reasoning model. Kept for backwards compatibility; prefer o1 for production.

OpenAI

OpenAI o1-mini (128k)

128K

Smaller, faster o-series reasoning model. Deprecated in favor of newer reasoning models, but still supported in legacy workflows.

Meta

Meta Llama 3.2 90B Vision Instruct

128K

Meta's multimodal model combining text and vision capabilities with strong performance across various tasks.

Meta

Meta Llama 3.2 11B Vision Instruct

128K

A smaller, efficient multimodal model from Meta with vision capabilities, suitable for edge deployment and cost-sensitive applications.

Mistral AI

Mistral Small (32k)

32K

Mistral AI's cost-effective model for straightforward tasks, offering good performance and efficiency.

Alibaba

Qwen 2.5 72B Instruct

32K

Alibaba's large language model with strong performance in reasoning, coding, and multilingual tasks.

August 2024
AI21 Labs

AI21 Jamba 1.5 Large (256k)

256K

AI21's large hybrid SSM-Transformer model with a 256K context window. Uses a novel Jamba architecture combining Mamba SSM layers with Transformer attention for efficient long-context processing.

July 2024
OpenAI

OpenAI GPT-4o mini (128k)

128K

OpenAI's most affordable and fastest model in the GPT-4o family, designed for high-volume, low-latency tasks.

Meta

Meta Llama 3.1 405B Instruct

128K

Meta's largest and most capable Llama 3.1 model, designed for complex reasoning, coding, and nuanced instruction following.

Meta

Meta Llama 3.1 70B Instruct

128K

A large instruction-tuned model from Meta's Llama 3.1 series, offering a strong balance of performance and efficiency for a wide range of tasks.

Meta

Meta Llama 3.1 8B Instruct

128K

A highly efficient instruction-tuned model from Meta's Llama 3.1 series, suitable for fast, on-device, or edge applications.

Mistral AI

Mistral Large 2 (128k)

128K

Mistral AI's flagship model with enhanced reasoning, coding, and multilingual capabilities.

June 2024
Runway

Runway Gen-3 Alpha

N/A

Professional video generation and editing AI with motion control and style consistency. Subscription-based pricing.

May 2024
OpenAI

OpenAI GPT-4o (128k)

128K

OpenAI's flagship multimodal model, natively processing text, audio, and images for faster, more capable interactions.

Google

Google Gemini 1.5 Flash (1M)

1M

Google's faster and lower-cost version of Gemini 1.5 Pro, optimized for high-volume, high-frequency tasks while retaining a large context window and multimodal capabilities.

Mistral AI

Mistral Codestral (22B)

32K

Mistral AI's open-weight generative model specialized for code generation, supporting 80+ languages.

April 2024
OpenAI

OpenAI GPT-4 Turbo (128k)

128K

OpenAI's powerful model prior to GPT-4o, with a large context window and strong performance on complex tasks. Supports vision.

Cohere

Cohere Command R+ (128k)

128K

Cohere's most powerful model optimized for enterprise RAG and tool use. Excels at grounded generation with citations and multi-step tool workflows.

March 2024
Anthropic

Anthropic Claude 3 Opus (200k)

200K

Claude 3 Opus (deprecated) � previous highest-intelligence Claude 3 model.

Anthropic

Anthropic Claude 3 Sonnet (200k)

200K

A balanced model from Anthropic, offering a blend of intelligence and speed, ideal for enterprise workloads and scaled AI deployments.

Anthropic

Anthropic Claude 3 Haiku (200k)

200K

Anthropic's fastest and most compact model, designed for near-instant responsiveness and high throughput tasks.

February 2024
Google

Google Gemini 1.5 Pro (2M)

2M

Google's highly capable multimodal model with a breakthrough long context window of up to 2 million tokens. Excels at complex reasoning, problem-solving, and understanding long-form content.

November 2023
Leonardo.AI

Leonardo AI

N/A

Game asset and creative image generation with consistent character and style creation. Credit-based pricing system.

October 2023
OpenAI

OpenAI DALL-E 3

N/A

State-of-the-art text-to-image generation with improved prompt following and image quality. Pricing per image: HD 1024×1024 $0.040, Standard $0.020

July 2023
Stability AI

Stable Diffusion XL

N/A

Open-source image generation model with fine-tuning capabilities. API pricing varies by provider, self-hosting available.

March 2023
OpenAI

OpenAI GPT-3.5 Turbo (16k)

16K

OpenAI's fast, cost-effective model optimized for chat and simple tasks.

118
Total Models
31
Months Tracked
20
Providers
Unknown
Latest Update