Tracking quality regressions and performance changes across major AI models. Reports are based on community observations, benchmark comparisons, and direct testing by the TokenCalculator team.
Anthropic / claude.ai & API · Reported: March 27, 2026 · Status: Active
Starting March 27, 2026, users across Claude Free, Pro, Max, and Team plans began reporting significantly faster depletion of their session and weekly usage allowances during weekday afternoon hours. Anthropic confirmed that a peak-hour usage adjustment is in effect: during the window of 1:00 PM – 7:00 PM UTC, Monday through Friday, each session consumes the weekly allowance at an accelerated rate. Enterprise plans are not affected. Weekends are fully off-peak and unaffected.
| City / Region | Peak Window (Local) | Zone |
|---|---|---|
| New York | 9:00 AM – 3:00 PM | EDT (UTC−4) |
| San Francisco | 6:00 AM – 12:00 PM | PDT (UTC−7) |
| São Paulo | 10:00 AM – 4:00 PM | BRT (UTC−3) |
| London | 2:00 PM – 8:00 PM | BST (UTC+1) |
| Paris / Berlin | 3:00 PM – 9:00 PM | CEST (UTC+2) |
| Istanbul | 4:00 PM – 10:00 PM | TRT (UTC+3) |
| New Delhi | 6:30 PM – 12:30 AM | IST (UTC+5:30) |
| Beijing | 9:00 PM – 3:00 AM | CST (UTC+8) |
| Tokyo / Seoul | 10:00 PM – 4:00 AM | JST/KST (UTC+9) |
Weekends (Sat–Sun) are fully off-peak at all times worldwide.
More details: See our full blog post on this change — Claude Peak Hours 2026: Why Your Weekly Limit Drains Faster on Weekday Afternoons.
Google DeepMind / Vertex AI & Google AI Studio · Reported: April 2026 · Status: Under Investigation
A growing number of developers and researchers have reported that Gemini 3.1 Pro underperforms Gemini 3.0 on several key task categories despite the higher version number. The pattern — where a newer model version is subjectively worse than its predecessor on specific workloads — is sometimes called a "quality regression" and is a known phenomenon in LLM development cycles. Google has not officially acknowledged the regression.
Quality regressions in LLM updates typically stem from one of several causes: changes to RLHF (reinforcement learning from human feedback) data that inadvertently overfit for different preferences; architecture changes that improve some capabilities while degrading others; or inference-time optimizations (quantization, speculative decoding) that trade subtle quality for speed. Higher version numbers do not guarantee better performance on all tasks — they reflect a different set of trade-offs.
Note: This report is based on community observations and third-party evaluations. Google has not officially confirmed a regression. We will update this report when official information is available. Compare models directly on our models page.
About this page: Degradation reports are maintained by the TokenCalculator team based on developer community reports, benchmark comparisons, and internal testing. Reports do not constitute official statements from any AI provider. Follow @tokencalculator on X for updates.