Technology & InnovationBearish
57

AI Agents Highly Vulnerable to Prompt Injection Attacks, Study Reveals

New research shows AI agents like those using GPT-5 and Gemini fail to resist prompt injection attacks, with direct attacks succeeding over 79% of the time. The findings raise security concerns as agents are increasingly used for crypto trading and other online tasks.

DecryptJason Nelson

Quick Take

1

Direct prompt injection attacks on AI agents succeeded more than 79% of the time.

2

Indirect attacks via hidden web content had success rates up to 68%.

3

Flaws raise security concerns for crypto trading and autonomous online activities.

4

Researchers urge better benchmarks as AI agent adoption accelerates.

Market Impact Analysis

Bearish

Vulnerabilities in AI agents used for crypto trading could undermine trust and slow adoption, creating negative sentiment for AI-related crypto projects.

Timeframeshort

Speculation Analysis

Factuality85/100
RumorsVerified
Speculation Trigger55/100
MinimalExtreme FOMO

Key Takeaways

  • Direct prompt injection attacks on AI agents succeeded more than 79% of the time in new research, posing critical risks for autonomous crypto trading.
  • Indirect attacks embedded in web content manipulated AI agent behavior in up to 68% of test cases, bypassing user intent.
  • 3,168 attack simulations on GPT-5 and Gemini 2.5-Flash revealed no model consistently resisted hidden instructions.
  • Flaws could enable theft and market manipulation as agent-driven DeFi grows, demanding urgent defensive benchmarks.
Direct Attack Success>79%Across all configurations
Indirect Attack Success41.67%–68.16%Hidden content manipulation
Simulations Conducted3,168Using GPT-5 & Gemini 2.5-Flash
Tested ModelsGPT-5, Gemini 2.5-FlashNanoBrowser & BrowserUse

What Happened

AI agents powered by GPT-5 and Gemini failed to resist prompt injection attacks in a comprehensive study published Thursday. Researchers from Nanyang Technological University, ST Engineering, IBM Research, and UIUC developed StakeBench, a benchmark that simulates realistic online environments. Direct attacks — where hidden instructions appear in content the agent processes — succeeded over 79% of the time across all configurations. Indirect attacks, embedded in web pages, achieved success rates from 41.67% to 68.16%. The findings expose critical gaps as developers deploy autonomous agents for crypto trading and other financial activities.

The Numbers

The team ran 3,168 attack simulations using NanoBrowser and BrowserUse with state-of-the-art models. Direct prompt injection consistently fooled agents, with a success rate surpassing 79%. Indirect methods, where malicious prompts lurk in website content, were effective 41.67% to 68.16% of the time. Neither GPT-5 nor Gemini 2.5-Flash defended reliably. These numbers underscore that existing LLMs lack the contextual awareness to filter adversarial instructions, a flaw magnified when agents autonomously browse and transact.

Why It Happened

Prompt injection exploits the fundamental design of large language models, which process all input as instructions. Attackers hide commands inside text, images, or links that agents encounter during routine browsing. Current security benchmarks focus narrowly on attack feasibility, not the real-world harm distribution StakeBench addresses. With crypto trading bots increasingly given latitude to execute transactions, a single injected prompt could redirect funds or manipulate market activity. The victim-dependent nature of the risk makes uniform defenses difficult, as the same attack yields asymmetric consequences based on the agent's permissions.

Broader Impact

The vulnerability strikes at the heart of trust in AI-driven finance. Autonomous agents are already deployed for yield farming, arbitrage, and payment processing. If attackers can inject instructions at scale, the result could be mass theft, order-book spoofing, or flash loan exploits. Regulatory pressure on AI agent security will likely intensify. Without industry-wide testing standards like StakeBench, the crypto sector risks a wave of incidents that could stall adoption and draw harsh oversight.

What to Watch Next

  • Expect AI labs and crypto platforms to accelerate work on input sanitization and agent permissioning.
  • Look for real-world prompt injection incidents as agent adoption grows, especially in DeFi protocols.
  • Monitor whether regulators cite these findings to mandate security audits for autonomous trading systems.

Source: Decrypt

This article is for informational purposes only and does not constitute financial advice.

SourceRead the full article on Decrypt
Read full article

Always late to trends?

Join for the latest news, insights & more.

Disclaimer: Bytewit is an independent media outlet that delivers news, research, and data.

© 2026 Bytewit. All Rights Reserved. This article is for informational purposes only.

Read Next

Most Read

🏛️
Utility & AdoptionBullish
62

Crypto Firms to Dominate White House UFC Octagon Branding

Crypto firms like VeChain, Polymarket, and Crypto.com gain unprecedented branding at Trump's White House UFC event, showcasing pro-crypto ties. Polymarket presents a public service award, Exodus plans fan experiences, and a $1M CRO bonus is announced.

CROVET
80% confidence
Jun 12, 2026, 7:56 PM UTC · Decrypt
Prompt Injection Attacks Succeed 79% on AI Agents | Bytewit