⚡

Technology & InnovationNeutral

10% confidence

DeepSeek V4 Pro Undercuts GPT-5.5 by 98% on Token Cost

DeepSeek released V4-Pro, a 1.6T parameter open-weight model costing 98% less than GPT-5.5 Pro. Trained partly on Huawei chips, it circumvents U.S. export bans and introduces novel attention mechanisms, intensifying price pressure on Western AI labs.

Apr 24, 2026, 5:34 PM UTCDecryptJose Antonio Lanz

Quick Take

V4-Pro costs $1.74/$3.48 per million tokens, 98% below GPT-5.5.

Model uses 1.6T parameters with mixture-of-experts for efficiency.

Trained on Huawei Ascend chips despite U.S. export restrictions.

Open-source release could trigger further AI pricing disruption.

Market Impact Analysis

Neutral

No direct crypto market impact; event is peripheral to digital assets, though AI-cost disruption could indirectly affect AI-related tokens.

Timeframelong

Speculation Analysis

Factuality93/100

RumorsVerified

Speculation Trigger15/100

MinimalExtreme FOMO

Key Takeaways

DeepSeek V4-Pro delivers 1.6T-parameter performance at $1.74 per million input tokens — 98% less than OpenAI’s GPT-5.5 Pro.
The open-weight model uses a mixture-of-experts architecture, activating only 49 billion parameters per inference for cost efficiency.
Trained on Huawei Ascend chips, the launch sidesteps U.S. export restrictions, signaling China’s growing AI hardware independence.
V4-Flash, a smaller sibling, packs 284 billion parameters and 13 billion active, offering similar reasoning at even lower cost.
Further price drops loom as DeepSeek plans to add 950 supernodes later in 2026, intensifying market disruption.

Total Parameters 1.6 trillion mixture-of-experts

Active Parameters 49 billion per inference

Input Token Cost $1.74/million 98% below GPT-5.5 Pro

Context Window 1 million tokens ~750K words

What Happened

DeepSeek released V4-Pro, an open-weight AI model with 1.6 trillion parameters, on Friday, hours after OpenAI’s GPT-5.5 launch. The model costs $1.74 per million input tokens—98% less than GPT-5.5 Pro—and $3.48 per million output tokens. A smaller version, V4-Flash, packs 284 billion parameters and activates just 13 billion per inference. Both models offer a 1-million-token context window and are available on Hugging Face. The launch intensifies price competition with Western AI firms and demonstrates that cutting-edge performance can be achieved using domestic Chinese hardware, sidestepping U.S. chip export bans.

The Numbers

V4-Pro’s pricing drastically undercuts Western models: $1.74 input and $3.48 output per million tokens. That’s roughly one-twentieth the cost of comparable models like Claude Opus 4.7 and 98% below GPT-5.5 Pro. The model contains 1.6 trillion total parameters but uses a mixture-of-experts design, activating only 49 billion per inference pass. This keeps compute costs low while maintaining performance. A 1-million-token context window—equivalent to about 750,000 words—enables processing of entire books in one go. DeepSeek plans to further reduce prices once 950 new supernodes come online later in 2026.

Why It Happened

DeepSeek’s cost breakthrough stems from architectural efficiency—its mixture-of-experts model activates only a fraction of parameters per request—and the use of Huawei Ascend chips, which bypass U.S. export restrictions. By training on domestic hardware, DeepSeek avoids the inflated costs of sanctioned Nvidia GPUs. The open-source strategy accelerates adoption and forces incumbents to compete on price. This isn’t DeepSeek’s first disruption: its R1 model in early 2025 erased $600 billion from Nvidia’s market cap in a day, proving that efficient design can challenge GPU-heavy approaches.

Broader Impact

While not directly tied to crypto, the AI pricing war could boost blockchain projects that integrate AI by lowering infrastructure costs. Cheaper models may spur decentralized compute networks and AI-focused tokens. However, incumbents like OpenAI and Anthropic face margin compression, possibly slowing their pace of innovation. The use of non-sanctioned hardware also reshapes global AI supply chains, with geopolitical implications for the semiconductor industry.

What to Watch Next

DeepSeek’s promised supernode expansion may cut token costs further, triggering a broader industry price war.
Western AI labs could respond with open-weight releases or aggressive price cuts, reshaping enterprise AI deals.
U.S. policymakers may tighten chip export controls if Huawei Ascend proves a viable path, escalating tech tensions.

This article is for informational purposes only and does not constitute financial advice.

SourceRead the full article on Decrypt

Read full article

Always late to trends?

Join for the latest news, insights & more.

Disclaimer: Bytewit is an independent media outlet that delivers news, research, and data.

DeepSeek V4 Pro Undercuts GPT-5.5 by 98% on Token Cost

Quick Take

Market Impact Analysis

Speculation Analysis

Key Takeaways

What Happened

The Numbers

Why It Happened

Broader Impact

What to Watch Next

Always late to trends?

TAGS

Read Next

Ethereum Risks $1.5K Drop from Vitalik's ETH Sales

Vitalik Buterin: Ethereum Conquers Blockchain Trilemma

Most Read

Aave Raises $160M to Cover KelpDAO Exploit's $200M Bad Debt

DeFi Endures: Why $13B Exodus After Exploit Isn't Fatal

Bitcoin Whales Go Long as Funding Stays Negative for 47 Days

Freezing 5.6M Dormant BTC Could Trigger Worst Single-Day Repricing

Survey: 1 in 3 Crypto Traders Cut Spending Amid Downturn

Study Reveals Just 3% of Traders Drive Polymarket Accuracy

AI Agent Groans Over Your Vibe-Coded Mess Thanks to New Plugin

Platform

Company

Legal