Technology & InnovationNeutral
33

Claude Opus 4.8 Excels at Coding, Stumbles in Creativity

Anthropic's Claude Opus 4.8 shows major gains in coding and math tests, fixing bugs autonomously, but creative writing remains flat and token consumption is extreme—one prompt drained a Pro plan quota. The review notes the model's strengths are sharper, but limitations hamper practical use.

DecryptJose Antonio Lanz

Quick Take

1

Coding now top-tier: autonomous bug fixing and good multi-shot improvement

2

Creative writing barely improved over 4.7, missing narrative fluidity

3

High token usage makes large projects impractical on Pro plan

Market Impact Analysis

Neutral

The article is a review of an AI model with no direct implications for cryptocurrency markets.

Timeframeshort

Speculation Analysis

Factuality75/100
RumorsVerified
Speculation Trigger5/100
MinimalExtreme FOMO

Key Takeaways

  • Claude Opus 4.8 delivers significant gains in coding and math, excelling at autonomous bug fixing and producing a polished zombie game in a single prompt.
  • Creative writing performance stalled, with prose that lacks the narrative fluidity and imagination seen in rival models like MiMo v2.5.
  • A single coding prompt drained an entire Pro plan token quota, making the model impractical for large projects without upgrading to Max or heavy API spend.
  • Token pricing remains unchanged at $5 per million input and $25 per million output, but excessive consumption sharply increases real-world costs.
Input Token Cost $5 per million tokens
Output Token Cost $25 per million tokens
Release Cycle 6 weeks after Opus 4.7
Token Quota Drain 1 prompt drained Pro plan

What Happened

Six weeks after Opus 4.7, Anthropic shipped Claude Opus 4.8 with higher safety scores and unchanged per-token pricing. The model showed clear improvement in math and coding benchmarks—producing the cleanest one-prompt game design ever seen from an Anthropic model—but creative writing scores barely moved. In a head-to-head prompt asking for a time-travel story anchored to the writer’s cultural background, Opus 4.8 set the scene in the Orinoco delta in the year 1000. The narrative paradox was structurally sound, yet the prose felt descriptive rather than alive, trailing the fluency of MiMo v2.5. Most critically, a single coding prompt consumed the entire token quota of a Pro subscription, rendering the model unusable for substantial projects without a higher-tier plan or API access.

The Numbers

Token prices stayed at $5 per million input and $25 per million output. The coding test—a zombie-typing game called Typing Dead—showed strong graphics and design, but the prompt used so many tokens it instantly hit Pro plan limits. Creative writing evaluation saw Opus 4.8 match Opus 4.7 on a default pass, with no measurable advance. In math and multi-shot reasoning, however, Opus 4.8 demonstrated clearly superior error correction and task completion, confirming the model’s technical edge has been sharpened.

Why It Happened

Anthropic’s rapid-release cadence prioritized safety and technical capability, leaving creative fluency and token efficiency on the back burner. Opus 4.8’s enhanced reasoning likely requires deeper internal processing, bloating output length for complex tasks. The model excels when given multiple attempts or guided thinking, but its default one-shot performance—especially in creative domains—remains flat. This trade-off favors developers and researchers with Max subscriptions or API keys, while Pro users face a hard ceiling on practical use.

Broader Impact

The token gluttony sets up a tension between advancing capability and real-world usability. Competitors like MiMo v2.5 already offer more balanced creative and technical performance at lower effective cost. If Anthropic can’t optimize token consumption soon, it risks losing prosumer users to leaner rivals, potentially forcing pricing adjustments or efficiency patches in future iterations.

What to Watch Next

  • Look for updates that cut token usage per request, which could dramatically improve Pro plan utility without a price change.
  • Track whether Anthropic introduces a mid-tier pricing adjustment or efficiency-focused release cycle.
  • Monitor creative writing benchmarks: if the next version stays flat, it will signal a strategic decision to deprioritize narrative ability.

Source: Decrypt

This article is for informational purposes only and does not constitute financial advice.

SourceRead the full article on Decrypt
Read full article

Always late to trends?

Join for the latest news, insights & more.

Disclaimer: Bytewit is an independent media outlet that delivers news, research, and data.

© 2026 Bytewit. All Rights Reserved. This article is for informational purposes only.

Read Next

Most Read

🏛️
Market AnalysisBearish
79

Bitcoin ETF Outflows Surge as Institutions Flip Bearish at $60K

Bitcoin hovers near $60,000 but this time institutional investors are aggressively selling, with spot ETFs recording a record $1.72 billion in weekly outflows. In February, when BTC hit similar levels, outflows slowed, signaling a bullish dip-buying sentiment. Now, accelerating outflows suggest a fundamental shift in market dynamics.

BTC
90% confidence
Jun 7, 2026, 4:14 PM UTC · CoinDesk
Claude Opus 4.8 Boosts Coding, Drains Token Quota | Bytewit