⚡

Technology & InnovationNeutral

95% confidence

Claude Opus 4.8 Excels at Coding, Stumbles in Creativity

Anthropic's Claude Opus 4.8 shows major gains in coding and math tests, fixing bugs autonomously, but creative writing remains flat and token consumption is extreme—one prompt drained a Pro plan quota. The review notes the model's strengths are sharper, but limitations hamper practical use.

Jun 7, 2026, 1:01 PM UTCDecryptJose Antonio Lanz

Quick Take

Coding now top-tier: autonomous bug fixing and good multi-shot improvement

Creative writing barely improved over 4.7, missing narrative fluidity

High token usage makes large projects impractical on Pro plan

Market Impact Analysis

Neutral

The article is a review of an AI model with no direct implications for cryptocurrency markets.

Timeframeshort

Speculation Analysis

Factuality75/100

RumorsVerified

Speculation Trigger5/100

MinimalExtreme FOMO

Key Takeaways

Claude Opus 4.8 delivers significant gains in coding and math, excelling at autonomous bug fixing and producing a polished zombie game in a single prompt.
Creative writing performance stalled, with prose that lacks the narrative fluidity and imagination seen in rival models like MiMo v2.5.
A single coding prompt drained an entire Pro plan token quota, making the model impractical for large projects without upgrading to Max or heavy API spend.
Token pricing remains unchanged at $5 per million input and $25 per million output, but excessive consumption sharply increases real-world costs.

Input Token Cost $5 per million tokens

Output Token Cost $25 per million tokens

Release Cycle 6 weeks after Opus 4.7

Token Quota Drain 1 prompt drained Pro plan

What Happened

Six weeks after Opus 4.7, Anthropic shipped Claude Opus 4.8 with higher safety scores and unchanged per-token pricing. The model showed clear improvement in math and coding benchmarks—producing the cleanest one-prompt game design ever seen from an Anthropic model—but creative writing scores barely moved. In a head-to-head prompt asking for a time-travel story anchored to the writer’s cultural background, Opus 4.8 set the scene in the Orinoco delta in the year 1000. The narrative paradox was structurally sound, yet the prose felt descriptive rather than alive, trailing the fluency of MiMo v2.5. Most critically, a single coding prompt consumed the entire token quota of a Pro subscription, rendering the model unusable for substantial projects without a higher-tier plan or API access.

The Numbers

Token prices stayed at $5 per million input and $25 per million output. The coding test—a zombie-typing game called Typing Dead—showed strong graphics and design, but the prompt used so many tokens it instantly hit Pro plan limits. Creative writing evaluation saw Opus 4.8 match Opus 4.7 on a default pass, with no measurable advance. In math and multi-shot reasoning, however, Opus 4.8 demonstrated clearly superior error correction and task completion, confirming the model’s technical edge has been sharpened.

Why It Happened

Anthropic’s rapid-release cadence prioritized safety and technical capability, leaving creative fluency and token efficiency on the back burner. Opus 4.8’s enhanced reasoning likely requires deeper internal processing, bloating output length for complex tasks. The model excels when given multiple attempts or guided thinking, but its default one-shot performance—especially in creative domains—remains flat. This trade-off favors developers and researchers with Max subscriptions or API keys, while Pro users face a hard ceiling on practical use.

Broader Impact

The token gluttony sets up a tension between advancing capability and real-world usability. Competitors like MiMo v2.5 already offer more balanced creative and technical performance at lower effective cost. If Anthropic can’t optimize token consumption soon, it risks losing prosumer users to leaner rivals, potentially forcing pricing adjustments or efficiency patches in future iterations.

What to Watch Next

Look for updates that cut token usage per request, which could dramatically improve Pro plan utility without a price change.
Track whether Anthropic introduces a mid-tier pricing adjustment or efficiency-focused release cycle.
Monitor creative writing benchmarks: if the next version stays flat, it will signal a strategic decision to deprioritize narrative ability.

This article is for informational purposes only and does not constitute financial advice.

SourceRead the full article on Decrypt

Read full article

Always late to trends?

Join for the latest news, insights & more.

Disclaimer: Bytewit is an independent media outlet that delivers news, research, and data.

Claude Opus 4.8 Excels at Coding, Stumbles in Creativity

Quick Take

Market Impact Analysis

Speculation Analysis

Key Takeaways

What Happened

The Numbers

Why It Happened

Broader Impact

What to Watch Next

Always late to trends?

TAGS

Read Next

KelpDAO $292M Exploit Triggers Aave Bank Run, DeFi in Crisis

Ethereum Risks $1.5K Drop from Vitalik's ETH Sales

Most Read

Bitcoin ETF Outflows Surge as Institutions Flip Bearish at $60K

NYDIG: AI, IPOs, Quantum, Strategy Sale Hit Bitcoin

Hayes Dumps HYPE, Warns of Wall Street Threat to Hyperliquid

Abra Plans Nasdaq Listing as Tokenization Hype Builds

Bitcoin Bear Bottom Elusive as Realized Losses Lag 2022 Record

Claude Opus 4.8 Excels at Coding, Stumbles in Creativity

Ethereum Foundation Restructuring Is Evolution, Not Crisis: Lubin

Platform

Company

Legal