Claude Opus 4.8 Release: Benchmarks and Fast Mode

Q: What about dynamic Workflows: one conversation, two modes?

The marquee feature is Dynamic Workflows, which lets Opus 4.8 switch between quick responses and deep reasoning inside a single conversation. Earlier models forced users to choose a reasoning mode up front; 4.8 decides per turn. For developers, that means a fast answer to a syntax question and an extended reasoning pass on an architecture problem — without restarting the session or swapping models. The friction of mode-switching, a quiet tax on every long coding session, largely disappears.

Q: What about fast Mode cuts cost by 3x?

A new Fast Mode offers Claude Sonnet-level speed at roughly one-third the cost for simpler tasks. It is a direct answer to the criticism that Anthropic’s frontier models are priced for premium workloads only — a complaint Microsoft’s AI chief made publicly. By routing easy queries to a cheaper tier inside the same model family, Anthropic keeps high-volume usage on Claude rather than ceding it to lower-cost rivals. The pricing move matters more than any single benchmark for teams ru

Q: What about 4% of every public GitHub commit?

Anthropic disclosed that Claude Code now authors 4% of all public GitHub commits worldwide, double the figure from a month earlier. More than half of Claude Code revenue comes from enterprise customers. That penetration helps explain both the release pace and the valuation behind it — Anthropic recently raised at a $965 billion valuation. When a coding tool writes a measurable share of the world’s code, faster model shipping is a revenue strategy, not a vanity metric.

Anthropic released Claude Opus 4.8 on May 28, 2026, just 41 days after Opus 4.7. The company calls it its “most honest model ever,” and the benchmarks back a second claim: it has reclaimed the lead on coding.

The 41-day cadence is itself the headline. Frontier labs are now shipping flagship upgrades on a roughly six-week clock, and the gap between releases has become the real competitive battleground.

The benchmarks that matter

Opus 4.8 posted gains on the two benchmarks closest to real software work. A single-release jump of this size is unusual for a point upgrade.

Benchmark	Opus 4.7	Opus 4.8	Change
Terminal-Bench 2.1	65.8%	74.2%	+8.4 pts
SWE-bench Pro	baseline	improved	+4.9%

The coding lead is the strategic prize. Software development is the highest-revenue use case for frontier models, and benchmark leadership translates directly into enterprise seat sales.

Dynamic Workflows: one conversation, two modes

The marquee feature is Dynamic Workflows, which lets Opus 4.8 switch between quick responses and deep reasoning inside a single conversation. Earlier models forced users to choose a reasoning mode up front; 4.8 decides per turn.

For developers, that means a fast answer to a syntax question and an extended reasoning pass on an architecture problem — without restarting the session or swapping models. The friction of mode-switching, a quiet tax on every long coding session, largely disappears.

Fast Mode cuts cost by 3x

A new Fast Mode offers Claude Sonnet-level speed at roughly one-third the cost for simpler tasks. It is a direct answer to the criticism that Anthropic’s frontier models are priced for premium workloads only — a complaint Microsoft’s AI chief made publicly.

By routing easy queries to a cheaper tier inside the same model family, Anthropic keeps high-volume usage on Claude rather than ceding it to lower-cost rivals. The pricing move matters more than any single benchmark for teams running millions of calls a month.

What “most honest” actually means

The “most honest model” label is technical, not marketing. Anthropic optimized 4.8 to detect and refuse erroneous prompts rather than confidently produce a wrong answer. In practice, the model is tuned to say “I can’t verify that” instead of fabricating a plausible response.

For enterprise deployments, refusal-over-fabrication is a feature. A model that flags its own uncertainty is easier to trust in workflows where a confident error costs more than a hedge — legal review, medical triage, financial reporting.

4% of every public GitHub commit

Anthropic disclosed that Claude Code now authors 4% of all public GitHub commits worldwide, double the figure from a month earlier. More than half of Claude Code revenue comes from enterprise customers.

That penetration helps explain both the release pace and the valuation behind it — Anthropic recently raised at a $965 billion valuation. When a coding tool writes a measurable share of the world’s code, faster model shipping is a revenue strategy, not a vanity metric.

The takeaway for teams: Fast Mode changes the cost calculus enough to re-test Claude on workloads previously routed elsewhere for price reasons.

Claude Opus 4.8 Ships: Anthropic’s ‘Most Honest Model’ Reclaims Coding

The benchmarks that matter

Dynamic Workflows: one conversation, two modes

Fast Mode cuts cost by 3x

What “most honest” actually means

4% of every public GitHub commit

Enjoyed this story?

Claude Opus 4.8 Ships: Anthropic’s ‘Most Honest Model’ Reclaims Coding

The benchmarks that matter

Dynamic Workflows: one conversation, two modes

Fast Mode cuts cost by 3x

What “most honest” actually means

4% of every public GitHub commit

Enjoyed this story?

Claude Sonnet 5 Just Launched at $2/$10 — The Price That Ends the AI Budget Crisis [Migration Guide]

China’s Kimi K3 Just Took #1 on the Coding Leaderboard — 2.8 Trillion Parameters, Free, and It Beats Claude in 76% of Matchups

ChatGPT Now Rewrites Its Own Memories While You Sleep: Inside Dreaming V3