⚡

Technology & InnovationNeutral

90% confidence

Rio's AI Model Claimed to Beat DeepSeek, Revealed as Nex Merge

IplanRIO released Rio 3.5, a 397B parameter AI model with benchmark scores surpassing DeepSeek. However, Nex proved it was a direct weight merge of Nex and Qwen models. The city agency later retracted its benchmark claims and credited Nex, sparking controversy.

Jun 15, 2026, 7:43 PM UTCDecryptJose Antonio Lanz

Quick Take

IplanRIO released Rio 3.5, claiming frontier performance with SwiReasoning framework.

Nex proved the model is a 0.6/0.4 weight merge of Nex N2 Pro and Qwen.

IplanRIO updated the model card, credited Nex, and retracted benchmark figures.

Market Impact Analysis

Neutral

No direct crypto market implications; the story is about an AI model controversy.

Timeframeshort

Speculation Analysis

Factuality80/100

RumorsVerified

Speculation Trigger20/100

MinimalExtreme FOMO

Key Takeaways

IplanRIO’s Rio 3.5 AI model appeared to outperform DeepSeek and Qwen, but was later exposed as a simple weight merge of existing models.
Nex mathematically proved the 397B-parameter model was a 0.6 Nex / 0.4 Qwen blend, prompting IplanRIO to retract its benchmark claims.
The incident raises questions about transparency and credibility in AI development, especially as open-source merging becomes more prevalent.

Model Size 397B parameters Mixture-of-Experts

Development Cost R$500,000 (~$100K USD) claimed cost

Merge Ratio 0.6 Nex / 0.4 Qwen proven by Nex

Benchmark Claim Terminal-Bench 70.8% later retracted

What Happened

On June 13, IplanRIO released Rio 3.5 Open, a 397-billion-parameter AI model with a permissive MIT license. The city of Rio de Janeiro’s IT agency touted it as a government-built frontier model, claiming its SwiReasoning framework outperformed top models like DeepSeek v4 Pro and Qwen 3.7 Plus on key benchmarks. The mayor amplified the news, framing it as a Global South tech triumph. But within days, AI company Nex published a mathematical proof that Rio 3.5 wasn’t newly trained—it was a direct weight merge of Nex N2 Pro and Qwen models at a 0.6/0.4 ratio. IplanRIO quickly updated the model card, credited Nex, and removed all benchmark claims, blaming an “incorrect upload.”

The Numbers

Self-reported scores were striking: Terminal-Bench 2.1 at 70.8% (vs. Qwen 3.7 Plus’s 70.3%), IMOAnswerBench at 89.5%, and HLE at 36.5%. But these figures were pulled shortly after release. Nex’s analysis revealed that weight merging—a technique combining multiple models’ parameters—required no training, contradicting IplanRIO’s original description of a “post-train” with a new reasoning layer. The claimed development cost of R$500,000 (~$100,000) now appears inflated, given that merging is computationally trivial. The model’s Mixture-of-Experts architecture (17B active parameters per token) made inference cheap, but its benchmark-lifting trick was merely blending existing strengths.

Why It Happened

The controversy underscores the ease of fabricating benchmark wins in the open-source AI race. Weight merging can produce models that superficially perform well on standard tests by averaging strengths, but it doesn’t create genuinely novel capabilities. IplanRIO likely aimed to generate political capital and attention for Rio’s tech ambitions, riding the viral timing of Brazil’s World Cup opener. The initial lack of technical scrutiny—until Nex stepped in—reveals how claims from non-traditional AI players can gain traction if benchmark results look good on paper. As open-source licensing makes model mixing trivial, the barrier to making fraudulent performance claims drops.

Broader Impact

This episode will intensify demands for transparent model documentation and independent verification. Expect heightened skepticism toward government-announced AI breakthroughs, especially from the Global South where such projects carry symbolic weight. For the AI community, it highlights the double-edged nature of open-source merging: while democratizing access, it also enables credibility-hacking. Future benchmark systems may need to incorporate anti-merge detection.

What to Watch Next

Check for third-party audits of Rio 3.5’s actual performance versus the retracted benchmarks.
Monitor IplanRIO’s future AI releases for genuine training efforts or further merge-based shortcuts.
Watch for industry-wide calls for benchmark integrity standards, particularly from organizations like Hugging Face or EleutherAI.

This article is for informational purposes only and does not constitute financial advice.

SourceRead the full article on Decrypt

Read full article

Always late to trends?

Join for the latest news, insights & more.

Disclaimer: Bytewit is an independent media outlet that delivers news, research, and data.

Rio's AI Model Claimed to Beat DeepSeek, Revealed as Nex Merge

Quick Take

Market Impact Analysis

Speculation Analysis

Key Takeaways

What Happened

The Numbers

Why It Happened

Broader Impact

What to Watch Next

Always late to trends?

TAGS

Read Next

KelpDAO $292M Exploit Triggers Aave Bank Run, DeFi in Crisis

Ethereum Risks $1.5K Drop from Vitalik's ETH Sales

Most Read

Saylor: Bitcoin Needs No Staking or Yield, Financial Products Suffice

IREN Expands Into Europe With Nostrum Deal, AI Pivot Gains Speed

Bitcoin's Recovery Hinges on US-Iran Deal, On-Chain Weakness

Strategy's $100M BTC Buy, US-Iran Deal Key for Recovery

Bybit Launches Dedicated Options Market for Tether Gold

Bitcoin Recovery Hinges on US-Iran Peace Deal, Metrics Weak

GAO Urges FDIC to Coordinate Crypto Risk Oversight

Platform

Company

Legal