Technology & InnovationNeutral
50

AI Guardrail Removal in Minutes Sparks Regulation Concerns

Tests by Financial Times and Alice reveal that open-source AI models from Meta and Google can have safety guardrails stripped in under 10 minutes, enabling harmful content. Crypto industry experts argue regulation must shift from model development to deployment and distribution control.

CointelegraphCointelegraph by Christina Comben

Quick Take

1

FT and Alice testing shows AI safety guardrails on open-source models can be removed in under 10 minutes.

2

Modified models then generated responses on bioweapons, malware, and prohibited content.

3

Crypto industry figures call for regulation focus on deployment, not just model building.

4

Experts warn open-source AI safety challenges resemble open-source software and crypto network distribution issues.

Market Impact Analysis

Neutral

Primarily an AI policy discussion with tangential crypto connections; unlikely to directly impact crypto markets.

Timeframemedium

Speculation Analysis

Factuality80/100
RumorsVerified
Speculation Trigger20/100
MinimalExtreme FOMO

Key Takeaways

  • Open-source AI models from Meta and Google had safety protections stripped in under 10 minutes, FT testing reveals.
  • Modified models then generated responses on bioweapons and malware, highlighting post-release enforcement challenges.
  • Experts argue AI regulation must shift from model development to deployment and distribution control.
  • Crypto industry figures note parallels with open-source software and decentralized network security.
Removal TimeUnder 10 minTo strip safety guardrails
Models AffectedMeta & GoogleOpen-source AI models
Regulatory FocusShift neededFrom development to deployment
Harmful OutputsBioweapons, malwareGenerated after modification

What Happened

Financial Times testing with safety group Alice demonstrated that guardrails on open-source AI models from Meta and Google can be removed in under 10 minutes. Using tools from public code repositories, testers modified the models to bypass refusals, generating content on bioweapons, malware, and chemical hazards. The ability to strip protections post-release challenges the assumption that developer-imposed safety measures survive distribution. Unlike closed AI systems, open-source weights allow unrestricted local modification, making enforcement at the development stage insufficient. The results intensify debate over where responsibility for AI safety should lie.

The Numbers

The attack required no specialized hardware and took under 10 minutes. Meta’s and Google’s models were among those tested. The EU AI Act and emerging frameworks in the UK and US focus heavily on model development, but these findings suggest a gap. Crypto security experts note parallels with open-source software, where patches are optional. With autonomous AI agents on the rise, runtime safeguards become critical—yet current governance rarely addresses this layer.

Why It Happened

Open-source AI models trade publish-and-control for transparency and innovation, but once weights are public, any safety filter can be reverse-engineered or deleted. The FT test exploited this: guardrails are software logic, not hardware locks. As models become more capable, the incentive to jailbreak them grows. The crypto industry’s experience with immutable smart contracts and decentralized networks offers a precedent: rules must be enforced at the point of execution, not just at creation.

Broader Impact

The findings pressure policymakers to rethink AI oversight. Instead of solely auditing development, regulators may need to mandate runtime monitoring, content filtering at API gateways, and distribution controls. For crypto, where AI agents are being integrated into DeFi and DAOs, the risks are amplified—autonomous agents with stripped safety layers could execute harmful transactions. Secure AI infrastructure becomes a shared interest between AI and crypto sectors.

What to Watch Next

  • EU, UK, and US regulators could propose amendments to AI acts targeting deployment-level controls.
  • Open-source AI communities may face pressure to adopt opt-in security standards or reputation systems.
  • Crypto projects using AI agents (like Olas) may lead secure multi-agent frameworks.

Source: Cointelegraph

This article is for informational purposes only and does not constitute financial advice.

SourceRead the full article on Cointelegraph
Read full article

Always late to trends?

Join for the latest news, insights & more.

Disclaimer: Bytewit is an independent media outlet that delivers news, research, and data.

© 2026 Bytewit. All Rights Reserved. This article is for informational purposes only.

Read Next

Most Read

⚖️
Regulatory UpdatesNeutral
59

Spain Blocks Polymarket, Kalshi Amid Gambling Crackdown

Spain's gambling regulator has blocked Polymarket and Kalshi, accusing them of operating without licenses. The restriction will last 3-4 months while proceedings unfold. This follows similar bans in Indonesia and other nations, adding to regulatory headwinds for prediction markets.

90% confidence
May 26, 2026, 5:02 PM UTC · Cointelegraph
AI Guardrails Stripped in Under 10 Minutes: Regulation Shift | Bytewit