Technology & InnovationNeutral
37

Rio's AI Model Claimed to Beat DeepSeek, Revealed as Nex Merge

IplanRIO released Rio 3.5, a 397B parameter AI model with benchmark scores surpassing DeepSeek. However, Nex proved it was a direct weight merge of Nex and Qwen models. The city agency later retracted its benchmark claims and credited Nex, sparking controversy.

DecryptJose Antonio Lanz

Quick Take

1

IplanRIO released Rio 3.5, claiming frontier performance with SwiReasoning framework.

2

Nex proved the model is a 0.6/0.4 weight merge of Nex N2 Pro and Qwen.

3

IplanRIO updated the model card, credited Nex, and retracted benchmark figures.

Market Impact Analysis

Neutral

No direct crypto market implications; the story is about an AI model controversy.

Timeframeshort

Speculation Analysis

Factuality80/100
RumorsVerified
Speculation Trigger20/100
MinimalExtreme FOMO

Key Takeaways

  • IplanRIO’s Rio 3.5 AI model appeared to outperform DeepSeek and Qwen, but was later exposed as a simple weight merge of existing models.
  • Nex mathematically proved the 397B-parameter model was a 0.6 Nex / 0.4 Qwen blend, prompting IplanRIO to retract its benchmark claims.
  • The incident raises questions about transparency and credibility in AI development, especially as open-source merging becomes more prevalent.
Model Size 397B parameters Mixture-of-Experts
Development Cost R$500,000 (~$100K USD) claimed cost
Merge Ratio 0.6 Nex / 0.4 Qwen proven by Nex
Benchmark Claim Terminal-Bench 70.8% later retracted

What Happened

On June 13, IplanRIO released Rio 3.5 Open, a 397-billion-parameter AI model with a permissive MIT license. The city of Rio de Janeiro’s IT agency touted it as a government-built frontier model, claiming its SwiReasoning framework outperformed top models like DeepSeek v4 Pro and Qwen 3.7 Plus on key benchmarks. The mayor amplified the news, framing it as a Global South tech triumph. But within days, AI company Nex published a mathematical proof that Rio 3.5 wasn’t newly trained—it was a direct weight merge of Nex N2 Pro and Qwen models at a 0.6/0.4 ratio. IplanRIO quickly updated the model card, credited Nex, and removed all benchmark claims, blaming an “incorrect upload.”

The Numbers

Self-reported scores were striking: Terminal-Bench 2.1 at 70.8% (vs. Qwen 3.7 Plus’s 70.3%), IMOAnswerBench at 89.5%, and HLE at 36.5%. But these figures were pulled shortly after release. Nex’s analysis revealed that weight merging—a technique combining multiple models’ parameters—required no training, contradicting IplanRIO’s original description of a “post-train” with a new reasoning layer. The claimed development cost of R$500,000 (~$100,000) now appears inflated, given that merging is computationally trivial. The model’s Mixture-of-Experts architecture (17B active parameters per token) made inference cheap, but its benchmark-lifting trick was merely blending existing strengths.

Why It Happened

The controversy underscores the ease of fabricating benchmark wins in the open-source AI race. Weight merging can produce models that superficially perform well on standard tests by averaging strengths, but it doesn’t create genuinely novel capabilities. IplanRIO likely aimed to generate political capital and attention for Rio’s tech ambitions, riding the viral timing of Brazil’s World Cup opener. The initial lack of technical scrutiny—until Nex stepped in—reveals how claims from non-traditional AI players can gain traction if benchmark results look good on paper. As open-source licensing makes model mixing trivial, the barrier to making fraudulent performance claims drops.

Broader Impact

This episode will intensify demands for transparent model documentation and independent verification. Expect heightened skepticism toward government-announced AI breakthroughs, especially from the Global South where such projects carry symbolic weight. For the AI community, it highlights the double-edged nature of open-source merging: while democratizing access, it also enables credibility-hacking. Future benchmark systems may need to incorporate anti-merge detection.

What to Watch Next

  • Check for third-party audits of Rio 3.5’s actual performance versus the retracted benchmarks.
  • Monitor IplanRIO’s future AI releases for genuine training efforts or further merge-based shortcuts.
  • Watch for industry-wide calls for benchmark integrity standards, particularly from organizations like Hugging Face or EleutherAI.

Source: Decrypt

This article is for informational purposes only and does not constitute financial advice.

SourceRead the full article on Decrypt
Read full article

Always late to trends?

Join for the latest news, insights & more.

Disclaimer: Bytewit is an independent media outlet that delivers news, research, and data.

© 2026 Bytewit. All Rights Reserved. This article is for informational purposes only.

Read Next

Most Read

🏛️
Top StoriesBullish
60

Saylor: Bitcoin Needs No Staking or Yield, Financial Products Suffice

Michael Saylor outlines a 'Digital Asset Stack' positioning Bitcoin as pure capital, with returns generated through financial products like Strategy's STRC preferred stock rather than protocol-based yields, reinforcing BTC's role as a treasury reserve asset.

BTC
80% confidence
Jun 16, 2026, 10:11 AM UTC · Cointelegraph
Rio AI Model Claimed to Beat DeepSeek, Was Merge | Bytewit