Question 1

Is Claude actually nerfed in April 2026?

Accepted Answer

Yes. A 6,852-session study published on Scortier Substack in April 2026 showed median thinking length dropped from 2,200 characters in January to 600 characters in March — a 73% reduction. Boris Cherny, who leads Claude Code at Anthropic, confirmed on April 13 that the default reasoning effort was lowered to medium earlier this year. Our own 50-task benchmark on an identical production codebase shows plan-before-code behavior dropped from 48/50 tasks to 14/50 at the new default.

Question 2

What did Boris Cherny say about the Claude degradation?

Accepted Answer

Boris Cherny, who leads the Claude Code product at Anthropic, posted on X on April 13, 2026 that Anthropic had lowered the default reasoning effort to medium earlier this year after internal benchmarks showed the old default was overkill for the majority of workloads. He noted users on paid plans can opt into high via the --effort=high flag or the effort parameter in the SDK, and that Anthropic is evaluating whether medium is the right default. The admission confirmed the nerf is real.

Question 3

How do I fix the Claude nerfing on Claude Code?

Accepted Answer

Five steps: (1) Flip --effort=high on every Claude Code session, which recovers roughly 95% of the pre-nerf behavior per our 50-task benchmark. (2) Pin your model to claude-opus-4-7 explicitly instead of the latest alias. (3) Add a manual planning prompt at the start of non-trivial tasks. (4) Log thinking length per session so you catch the next nerf on day two. (5) Budget for roughly 2.8 retries per complex task instead of 1.2.

Question 4

Is Opus 4.7 the fix for the Claude nerfing scandal?

Accepted Answer

Largely yes. Claude Opus 4.7, launched April 10, 2026, restores default reasoning behavior to roughly what Opus 4.6 with --effort=high produced. Our 50-task benchmark shows Opus 4.7 default is within 5% of the January 2026 Claude 4.6 default. The catch: Opus 4.7 costs more per token than 4.6, so API users are now paying more for what used to be the default behavior. For subscription users on Pro, Max, or Max x20, it is a clean upgrade.

Question 5

Why did Anthropic reduce Claude's thinking by default?

Accepted Answer

Anthropic's stated reason is that internal benchmarks showed the old default reasoning effort was overkill for the majority of workloads. Competitors and industry observers frame it differently — as a cost-reduction move dressed up as a UX default, pointing to the timing of Anthropic's Q4 2025 infrastructure expansion and the pattern of a silent default change with no changelog entry. Motive is unproven, but the behavior pattern is textbook cost-reduction-disguised-as-UX.

Question 6

Should I switch from Claude to GPT-5.4 or Gemini 3?

Accepted Answer

Depends on your workload. For agentic coding (multi-step tool use on a codebase), Claude Opus 4.7 with --effort=high still beats GPT-5.4 and Gemini 3 on tool-use reliability and agent harness quality. For raw chat or very long documents, GPT-5.4 and Gemini 3 Ultra are both credible alternatives this week and neither has a recent nerf scandal on record. Gemini 3 Ultra has the longest context window at 2M tokens and the lowest price per million output tokens.

Question 7

What is the Scortier 6852 sessions study?

Accepted Answer

It is a Substack post published on April 12, 2026 by a developer using the handle Scortier. Scortier logged every one of their Claude Code sessions between January and April 2026 — session ID, thinking length, tool calls, retries, final output quality. By April they had 6,852 sessions. The plotted data showed median thinking length dropping from 2,200 characters in January to 600 in March, a 73% reduction, along with retry rates up 80x on complex tasks and plan-before-act behavior collapsing from 71% to 19% of sessions.

Question 8

Can you lock or pin a Claude model version to avoid nerfing?

Accepted Answer

Yes and no. You can pin an explicit model string like claude-opus-4-7 instead of using the latest alias, which protects you against Anthropic changing the default pointer. You cannot pin against server-side behavior changes to a specific version — Anthropic can adjust reasoning effort, sampling parameters, or tool-use heuristics on a pinned model without changing its name. The only true guarantee is self-hosted open-weight models, which is why this scandal is a marketing gift for Llama, Mistral, and DeepSeek.

Question 9

How much are retries costing because of the Claude nerfing?

Accepted Answer

On our 50-task production benchmark, median retries per complex task went from 1.2 (January 2026) to 2.8 (April 2026 default) — a 2.3x increase. If your Claude API spend was $1,000 per month on retry-sensitive workloads in January, expect roughly $2,300 per month on the same workload in April at the nerfed default. Flipping --effort=high cuts this back to $1,080 per month. The compute savings Anthropic captured by lowering the default are now showing up on your invoice as higher retry volume.

Question 10

Is this the first time an AI company silently degraded a model?

Accepted Answer

No. Users have accused OpenAI of silent GPT-4 degradations multiple times since 2023, and similar complaints have surfaced against Gemini and smaller providers. The Claude April 2026 case is the first time an AI company executive publicly acknowledged reducing default reasoning effort on a production model without a changelog. That acknowledgement is what makes this different — it moves the conversation from users imagining things to confirmed policy change. Expect every major AI platform to face renewed pressure for changelog transparency as a result.

Question 11

Does the Claude nerfing affect the free Claude.ai product?

Accepted Answer

Yes. The default reasoning effort change applies platform-wide, including on the free Claude.ai web product. Free users have no --effort=high flag available — that toggle requires a paid plan. In practice this means free users got the degraded experience with no opt-out path, and the just flip a flag mitigation only works for Claude Pro, Max, Max x20, or API customers. For free users the only fix is upgrading to a paid plan or switching platforms.

Question 12

Will Anthropic raise the default reasoning effort back to high?

Accepted Answer

Boris Cherny said on April 13, 2026 that Anthropic is evaluating whether medium is the right default. That is diplomatic language for we are watching the backlash before committing to a fix. The launch of Opus 4.7 on April 10 with restored default reasoning behavior suggests Anthropic is already walking the change back on the newest model while leaving 4.6 as-is. Our guess: the default on 4.6 stays at medium; the narrative shifts to upgrade to 4.7 for the experience you expected. Expect no public default rollback on 4.6 — expect quiet migration incentives to 4.7 instead.

Metric	Jan 2026 Claude	Apr 2026 default	Apr 2026 --effort=high
Plan-before-code behavior	48/50	14/50	46/50
Tasks that compiled first try	39/50	22/50	37/50
Median tool calls per session	13	6	14
Retries needed to reach "good" output	1.2 avg	2.8 avg	1.3 avg
Silent shortcuts (skipped error handling, missed edge cases)	8/50	27/50	9/50
Unprompted planning docs (markdown)	21/50	4/50	19/50
Our gut feel — "did this feel like the old Claude?"	yes	no	yes

Capability	Claude Opus 4.7	GPT-5.4 (via Codex)	Gemini 3 Ultra
Best for agentic coding	Yes (Claude Code is still the cleanest agent harness)	Strong via ChatGPT Codex	Strong on very long codebases
Context window	1M tokens	400K tokens	2M tokens
Default thinking behavior	Restored on 4.7	Consistent (no nerf reported)	Consistent
Tool use reliability	Best in class	Good	Improving fast
Price per million output tokens	~$75	~$60	~$45
Trust after this week	Bruised	Unchanged	Unchanged

Claude Got Dumber: 73% Less Thinking, 80x More Retries — Inside the Silent Nerf

The tweet that broke the internet

The 6,852 sessions study — the data doesn't lie

Boris Cherny's admission

Our own tests: 50 real Claude Code tasks before vs after

The "degrade for compute" accusation

Why this matters for the AI industry

How to mitigate — if you're stuck on Claude

Is Opus 4.7 the fix? Our honest take

Alternative: should you switch to GPT-5.4 or Gemini 3?

Our recommendation

Frequently asked questions