Anthropic's April 2026 flagship. Drops the long-context premium SKU — 1M token context is the default tier. ~30% lower median latency than 4.6 on long-context requests, materially better long-context retrieval (96.4% at 1M vs 91% for 4.6), and adds a `thinking_budget_tokens` parameter for explicit cost control on extended reasoning. Released alongside the easing of the March 2026 peak-hour Pro/Max throttle.
Visit Model Website