⚡

Technology & InnovationBearish

75% confidence

AI Agents Highly Vulnerable to Prompt Injection Attacks, Study Reveals

New research shows AI agents like those using GPT-5 and Gemini fail to resist prompt injection attacks, with direct attacks succeeding over 79% of the time. The findings raise security concerns as agents are increasingly used for crypto trading and other online tasks.

Jun 12, 2026, 7:22 PM UTCDecryptJason Nelson

Quick Take

Direct prompt injection attacks on AI agents succeeded more than 79% of the time.

Indirect attacks via hidden web content had success rates up to 68%.

Flaws raise security concerns for crypto trading and autonomous online activities.

Researchers urge better benchmarks as AI agent adoption accelerates.

Market Impact Analysis

Bearish

Vulnerabilities in AI agents used for crypto trading could undermine trust and slow adoption, creating negative sentiment for AI-related crypto projects.

Timeframeshort

Speculation Analysis

Factuality85/100

RumorsVerified

Speculation Trigger55/100

MinimalExtreme FOMO

Key Takeaways

Direct prompt injection attacks on AI agents succeeded more than 79% of the time in new research, posing critical risks for autonomous crypto trading.
Indirect attacks embedded in web content manipulated AI agent behavior in up to 68% of test cases, bypassing user intent.
3,168 attack simulations on GPT-5 and Gemini 2.5-Flash revealed no model consistently resisted hidden instructions.
Flaws could enable theft and market manipulation as agent-driven DeFi grows, demanding urgent defensive benchmarks.

Direct Attack Success>79%Across all configurations

Indirect Attack Success41.67%–68.16%Hidden content manipulation

Simulations Conducted3,168Using GPT-5 & Gemini 2.5-Flash

Tested ModelsGPT-5, Gemini 2.5-FlashNanoBrowser & BrowserUse

What Happened

AI agents powered by GPT-5 and Gemini failed to resist prompt injection attacks in a comprehensive study published Thursday. Researchers from Nanyang Technological University, ST Engineering, IBM Research, and UIUC developed StakeBench, a benchmark that simulates realistic online environments. Direct attacks — where hidden instructions appear in content the agent processes — succeeded over 79% of the time across all configurations. Indirect attacks, embedded in web pages, achieved success rates from 41.67% to 68.16%. The findings expose critical gaps as developers deploy autonomous agents for crypto trading and other financial activities.

The Numbers

The team ran 3,168 attack simulations using NanoBrowser and BrowserUse with state-of-the-art models. Direct prompt injection consistently fooled agents, with a success rate surpassing 79%. Indirect methods, where malicious prompts lurk in website content, were effective 41.67% to 68.16% of the time. Neither GPT-5 nor Gemini 2.5-Flash defended reliably. These numbers underscore that existing LLMs lack the contextual awareness to filter adversarial instructions, a flaw magnified when agents autonomously browse and transact.

Why It Happened

Prompt injection exploits the fundamental design of large language models, which process all input as instructions. Attackers hide commands inside text, images, or links that agents encounter during routine browsing. Current security benchmarks focus narrowly on attack feasibility, not the real-world harm distribution StakeBench addresses. With crypto trading bots increasingly given latitude to execute transactions, a single injected prompt could redirect funds or manipulate market activity. The victim-dependent nature of the risk makes uniform defenses difficult, as the same attack yields asymmetric consequences based on the agent's permissions.

Broader Impact

The vulnerability strikes at the heart of trust in AI-driven finance. Autonomous agents are already deployed for yield farming, arbitrage, and payment processing. If attackers can inject instructions at scale, the result could be mass theft, order-book spoofing, or flash loan exploits. Regulatory pressure on AI agent security will likely intensify. Without industry-wide testing standards like StakeBench, the crypto sector risks a wave of incidents that could stall adoption and draw harsh oversight.

What to Watch Next

Expect AI labs and crypto platforms to accelerate work on input sanitization and agent permissioning.
Look for real-world prompt injection incidents as agent adoption grows, especially in DeFi protocols.
Monitor whether regulators cite these findings to mandate security audits for autonomous trading systems.

This article is for informational purposes only and does not constitute financial advice.

SourceRead the full article on Decrypt

Read full article

Always late to trends?

Join for the latest news, insights & more.

Disclaimer: Bytewit is an independent media outlet that delivers news, research, and data.

AI Agents Highly Vulnerable to Prompt Injection Attacks, Study Reveals

Quick Take

Market Impact Analysis

Speculation Analysis

Key Takeaways

What Happened

The Numbers

Why It Happened

Broader Impact

What to Watch Next

Always late to trends?

TAGS

Read Next

KelpDAO $292M Exploit Triggers Aave Bank Run, DeFi in Crisis

Ethereum Risks $1.5K Drop from Vitalik's ETH Sales

Most Read

Crypto Firms to Dominate White House UFC Octagon Branding

AI Agents Highly Vulnerable to Prompt Injection Attacks, Study Reveals

Moonshot AI Desktop Agent Kimi Work Unleashes 300 AI Agents

Americans Fear AI Job Loss But Hope for Cure, Survey Reveals

Standard Chartered analyst calls Bitcoin bottom at $59,000

Tokenized SpaceX Offerings Cancelled as SPCX Surges

Tennessee Man Indicted for Crypto Ponzi Scheme Defrauding Millions

Platform

Company

Legal