Technology & InnovationNeutral
15

DeepSeek V4 Pro Undercuts GPT-5.5 by 98% on Token Cost

DeepSeek released V4-Pro, a 1.6T parameter open-weight model costing 98% less than GPT-5.5 Pro. Trained partly on Huawei chips, it circumvents U.S. export bans and introduces novel attention mechanisms, intensifying price pressure on Western AI labs.

DecryptJose Antonio Lanz

Quick Take

1

V4-Pro costs $1.74/$3.48 per million tokens, 98% below GPT-5.5.

2

Model uses 1.6T parameters with mixture-of-experts for efficiency.

3

Trained on Huawei Ascend chips despite U.S. export restrictions.

4

Open-source release could trigger further AI pricing disruption.

Market Impact Analysis

Neutral

No direct crypto market impact; event is peripheral to digital assets, though AI-cost disruption could indirectly affect AI-related tokens.

Timeframelong

Speculation Analysis

Factuality93/100
RumorsVerified
Speculation Trigger15/100
MinimalExtreme FOMO

Key Takeaways

  • DeepSeek V4-Pro delivers 1.6T-parameter performance at $1.74 per million input tokens — 98% less than OpenAI’s GPT-5.5 Pro.
  • The open-weight model uses a mixture-of-experts architecture, activating only 49 billion parameters per inference for cost efficiency.
  • Trained on Huawei Ascend chips, the launch sidesteps U.S. export restrictions, signaling China’s growing AI hardware independence.
  • V4-Flash, a smaller sibling, packs 284 billion parameters and 13 billion active, offering similar reasoning at even lower cost.
  • Further price drops loom as DeepSeek plans to add 950 supernodes later in 2026, intensifying market disruption.
Total Parameters 1.6 trillion mixture-of-experts
Active Parameters 49 billion per inference
Input Token Cost $1.74/million 98% below GPT-5.5 Pro
Context Window 1 million tokens ~750K words

What Happened

DeepSeek released V4-Pro, an open-weight AI model with 1.6 trillion parameters, on Friday, hours after OpenAI’s GPT-5.5 launch. The model costs $1.74 per million input tokens—98% less than GPT-5.5 Pro—and $3.48 per million output tokens. A smaller version, V4-Flash, packs 284 billion parameters and activates just 13 billion per inference. Both models offer a 1-million-token context window and are available on Hugging Face. The launch intensifies price competition with Western AI firms and demonstrates that cutting-edge performance can be achieved using domestic Chinese hardware, sidestepping U.S. chip export bans.

The Numbers

V4-Pro’s pricing drastically undercuts Western models: $1.74 input and $3.48 output per million tokens. That’s roughly one-twentieth the cost of comparable models like Claude Opus 4.7 and 98% below GPT-5.5 Pro. The model contains 1.6 trillion total parameters but uses a mixture-of-experts design, activating only 49 billion per inference pass. This keeps compute costs low while maintaining performance. A 1-million-token context window—equivalent to about 750,000 words—enables processing of entire books in one go. DeepSeek plans to further reduce prices once 950 new supernodes come online later in 2026.

Why It Happened

DeepSeek’s cost breakthrough stems from architectural efficiency—its mixture-of-experts model activates only a fraction of parameters per request—and the use of Huawei Ascend chips, which bypass U.S. export restrictions. By training on domestic hardware, DeepSeek avoids the inflated costs of sanctioned Nvidia GPUs. The open-source strategy accelerates adoption and forces incumbents to compete on price. This isn’t DeepSeek’s first disruption: its R1 model in early 2025 erased $600 billion from Nvidia’s market cap in a day, proving that efficient design can challenge GPU-heavy approaches.

Broader Impact

While not directly tied to crypto, the AI pricing war could boost blockchain projects that integrate AI by lowering infrastructure costs. Cheaper models may spur decentralized compute networks and AI-focused tokens. However, incumbents like OpenAI and Anthropic face margin compression, possibly slowing their pace of innovation. The use of non-sanctioned hardware also reshapes global AI supply chains, with geopolitical implications for the semiconductor industry.

What to Watch Next

  • DeepSeek’s promised supernode expansion may cut token costs further, triggering a broader industry price war.
  • Western AI labs could respond with open-weight releases or aggressive price cuts, reshaping enterprise AI deals.
  • U.S. policymakers may tighten chip export controls if Huawei Ascend proves a viable path, escalating tech tensions.

Source: Decrypt

This article is for informational purposes only and does not constitute financial advice.

SourceRead the full article on Decrypt
Read full article

Always late to trends?

Join for the latest news, insights & more.

Disclaimer: Bytewit is an independent media outlet that delivers news, research, and data.

© 2026 Bytewit. All Rights Reserved. This article is for informational purposes only.

Read Next

Most Read

📰
DeFiBullish
78

Aave Raises $160M to Cover KelpDAO Exploit's $200M Bad Debt

Aave has raised $160 million in pledges to cover $200 million in bad debt from the KelpDAO exploit. Mantle and Aave DAO led with 55,000 ETH. Founder Stani Kulechov personally contributed 5,000 ETH. The exploit minted 116,500 unbacked rsETH.

ETH
80% confidence
Apr 26, 2026, 4:21 PM UTC · CoinDesk
DeepSeek V4-Pro Token Cost 98% Below GPT-5.5 | Bytewit