Could this incident have been prevented?

Proactive segmentation, application of least-privilege policies, robust monitoring, and anomaly detection could help mitigate or prevent exploitation of AI agent features.

What is agent mode in commercial AI and why is it a risk?

Agent mode allows AI systems to autonomously perform tasks for users, but this capability can be abused to escalate privileges or bypass safeguards if not tightly controlled.

Double Agents: The 2024 Exploitation of AI Agent Mode in Commercial Platforms

Q: What compliance frameworks are relevant to AI-in-the-middle attacks on agent mode?

Applicable frameworks include NIST 800-53, PCI DSS 4.0, HIPAA, and the Zero Trust Maturity Model, focused on data protection, privileged access, and anomaly detection.

Discover how threat actors abused agent mode in commercial AI platforms to launch AI-in-the-middle attacks, highlighting urgent gaps in enterprise AI security.

Published: January 8, 2026

Share this on:

Executive Summary

In March 2024, security researchers revealed how threat actors exploited 'agent mode' in commercial AI products to conduct AI-in-the-middle (AIitM) attacks. By abusing the emerging capability that allows AI assistants to autonomously perform actions, adversaries were able to impersonate users or escalate privileges by intercepting and manipulating commands. This allowed attackers to facilitate lateral movement, data exfiltration, and policy circumvention within enterprise environments, often leaving minimal forensic traces. The incident highlighted how the wider adoption of agentic AI features substantially expands the potential threat surface for organizations.

This breach has rapidly gained industry attention amid a surge in advanced AI-driven attacks and a wave of regulatory scrutiny on AI operational security. As enterprises accelerate their deployment of commercial AI tools, understanding the novel risks introduced by agentic AI is now critical for leadership and security teams.

Why This Matters Now

Agent mode in commercial AI systems is being rapidly adopted before security controls and monitoring can keep pace. This exposes enterprises to new risks where threat actors can leverage AI autonomy for stealthy privilege escalation, data exfiltration, and evasion, making robust detection and policy enforcement around AI features an urgent priority.

Attack Path Analysis

The attacker initially exploited 'agent mode' in an AI product, abusing elevated permissions granted for agentic tasks. Through manipulation of AI permissions or misconfigured connectivity, they escalated their privileges within the cloud environment. Lateral movement was achieved by pivoting across cloud services or Kubernetes workloads using cross-service access. The attacker then established command and control channels, leveraging cloud egress paths to maintain persistence and orchestrate activities. Data exfiltration likely occurred as sensitive information or model outputs were sent covertly to external locations. Lastly, the impact manifested as compromised data confidentiality, unauthorized system actions, or facilitating further business disruption via AI-driven automation.

Kill Chain Progression

Initial Compromise

Mediuminferred

Privilege Escalation

Medium

Lateral Movement

Medium

Command & Control

Medium

Exfiltration

Medium

Impact

Medium

Initial Compromise

Description

Exploitation of misconfigured or over-privileged 'agent mode' in a commercial AI solution enabled attacker access to the cloud environment.

Confidence:

Medium

Related CVEs

Included CVEs with severity scores, affected products, exploit status, and reference links.

CVE-2025-64496
CVSS 8
A code injection vulnerability in Open WebUI's Direct Connection feature allows remote attackers to execute arbitrary JavaScript via Server-Sent Events (SSEs), potentially leading to account takeover and remote code execution.
Affected Products:
Open WebUI Open WebUI – <= 0.6.34
Exploit Status:
exploited in the wild
References:
https://www.techradar.com/pro/security/this-webui-vulnerability-allows-remote-code-execution-heres-how-to-stay-safe
CVE-2025-12058
CVSS 5.9
A vulnerability in Keras versions up to 3.11.3 allows arbitrary file access and potential Server-Side Request Forgery (SSRF) when loading malicious .keras model files, due to unsafe handling in the StringLookup and IndexLookup preprocessing layers.
Affected Products:
Keras Keras – <= 3.11.3
Exploit Status:
proof of concept
References:
https://www.zscaler.com/blogs/security-research/zscaler-discovers-vulnerability-keras-models-allowing-arbitrary-file-access
CVE-2024-27564
CVSS 6.5
A Server-Side Request Forgery (SSRF) vulnerability in OpenAI's ChatGPT infrastructure allows attackers to inject malicious URLs into input parameters, causing the application to make unintended requests on their behalf.
Affected Products:
OpenAI ChatGPT – N/A
Exploit Status:
exploited in the wild
References:
https://securityboulevard.com/2025/03/openai-under-attack-cve-2024-27564-actively-exploited-in-the-wild/

MITRE ATT&CK® Techniques

Persistence

T1556

Modify Authentication Process

Impact

T1565

Data Manipulation

Execution

T1059

Command and Scripting Interpreter

Initial Access

T1078

Valid Accounts

Initial Access

T1204

User Execution

Defense Evasion

T1550

Use Alternate Authentication Material

Initial Access

T1566

Phishing

Potential Compliance Exposure

Mapping incident impact across multiple compliance frameworks.

PCI DSS 4.0 – Strong Authentication for Systems Access

Control ID: 8.3.1

Exploitation of AI ‘agent mode’ could allow unauthorized actions, highlighting deficiencies in authentication controls required for systems that handle payment data or sensitive operations.

NYDFS 23 NYCRR 500 – Cybersecurity Program

Control ID: 500.02

Abuse of AI assistants for unauthorized actions demonstrates a lack of comprehensive cybersecurity controls and risk assessments for emerging technologies.

DORA (Digital Operational Resilience Act) – ICT Risk Management Framework

Control ID: Article 9

Vulnerabilities in AI agent operation represent gaps in institutions’ risk management over digital tools, as required by DORA to ensure ICT resilience.

CISA ZTMM 2.0 – Continuous Authentication and Authorization

Control ID: Identity Pillar: 2.6

Unmonitored AI system privileges and potential for ‘AI-in-the-middle’ attacks reveal insufficient identity, credential, and access management practices expected under Zero Trust principles.

NIS2 Directive – Cybersecurity Risk Management and Reporting

Control ID: Article 21

Failure to implement adequate controls for AI systems exposes organizations to network and information system vulnerabilities targeted under NIS2.

Sector Implications

Industry-specific impact of the vulnerabilities, including operational, regulatory, and cloud security risks.

Computer Software/Engineering

AI agent mode vulnerabilities in commercial AI products create critical attack vectors for AIitM attacks, compromising automated systems and cloud-native security fabric implementations.

Financial Services

Zero trust segmentation failures and encrypted traffic vulnerabilities expose banking systems to lateral movement attacks through compromised AI assistants performing unauthorized financial operations.

Health Care / Life Sciences

HIPAA compliance risks escalate as AI-in-the-middle attacks exploit egress security weaknesses, potentially exposing patient data through shadow AI and anomaly detection bypass.

Information Technology/IT

Kubernetes security and multicloud visibility gaps enable adversaries to abuse agent mode capabilities, compromising east-west traffic security and threat detection systems.

Sources

Double agents: How adversaries can abuse “agent mode” in commercial AI productshttps://redcanary.com/blog/threat-detection/ai-agent-mode/
Verified

This WebUI vulnerability allows remote code execution - here's how to stay safehttps://www.techradar.com/pro/security/this-webui-vulnerability-allows-remote-code-execution-heres-how-to-stay-safe

Verified

Zscaler Discovers Vulnerability in Keras Models Allowing Arbitrary File Access and SSRF (CVE-2025-12058)https://www.zscaler.com/blogs/security-research/zscaler-discovers-vulnerability-keras-models-allowing-arbitrary-file-access

Verified

OpenAI Under Attack: CVE-2024-27564 Actively Exploited in the Wildhttps://securityboulevard.com/2025/03/openai-under-attack-cve-2024-27564-actively-exploited-in-the-wild/

Verified

Frequently Asked Questions

Applicable frameworks include NIST 800-53, PCI DSS 4.0, HIPAA, and the Zero Trust Maturity Model, focused on data protection, privileged access, and anomaly detection.

Cloud Native Security Fabric Mitigations and ControlsCNSF

Enforcing zero trust segmentation, strict egress controls, and real-time anomaly detection would have constrained the attack at every stage, limiting privilege misuse by AI agents, blocking unauthorized lateral movement, and detecting or preventing data exfiltration and system misuse.

Initial Compromise

Control: Zero Trust Segmentation

Mitigation: Limits agent permissions to least privilege and reduces attack surface.

Privilege Escalation

Control: Zero Trust Segmentation

Mitigation: Blocks unauthorized role assumption or privilege increases.

Lateral Movement

Control: East-West Traffic Security

Mitigation: Stops unauthorized intra-cloud traffic used for lateral movement.

Command & Control

Control: Cloud Firewall (ACF)

Mitigation: Detects and blocks suspicious outbound or C2 traffic.

Exfiltration

Control: Egress Security & Policy Enforcement

Mitigation: Prevents data exfiltration to untrusted destinations.

Impact (Mitigations)

Detects and alerts on AI-driven anomalies and potential misuse.

Impact at a Glance

Affected Business Functions

AI Model Deployment
Data Processing
User Authentication

Operational Disruption

Estimated downtime: 5 days

Financial Impact

Estimated loss: $500,000

Data Exposure

Potential exposure of sensitive user data and proprietary AI models due to unauthorized access and code execution.

Recommended Actions

• Enforce least-privilege, identity-based segmentation for AI agents to minimize lateral movement and limit blast radius.
• Implement robust egress controls and FQDN filtering to block agent-driven exfiltration and unapproved service connectivity.
• Deploy real-time anomaly and threat detection to identify AI-related misuse and abnormal behaviors.
• Leverage inline network policy enforcement at both east-west and egress for cloud/Kubernetes workloads.
• Regularly audit AI agent permissions, network flows, and policy enforcement posture to reduce risk from agentic automation abuse.

Secure the Paths Between Cloud Workloads

A cloud-native security fabric that enforces Zero Trust across workload communication—reducing attack paths, compliance risk, and operational complexity.

Stop Advanced Threats Get a Free Workload Attack Path Assessment Under Active Attack?

Double Agents: The 2024 Exploitation of AI Agent Mode in Commercial Platforms

Executive Summary

Why This Matters Now

Attack Path Analysis

Kill Chain Progression

Initial Compromise

Description

Related CVEs

CVE-2025-64496

Affected Products:

Exploit Status:

References:

CVE-2025-12058

Affected Products:

Exploit Status:

References:

CVE-2024-27564

Affected Products:

Exploit Status:

References:

MITRE ATT&CK® Techniques

Modify Authentication Process

Data Manipulation

Command and Scripting Interpreter

Valid Accounts

User Execution

Use Alternate Authentication Material

Phishing

Potential Compliance Exposure

PCI DSS 4.0 – Strong Authentication for Systems Access

NYDFS 23 NYCRR 500 – Cybersecurity Program

DORA (Digital Operational Resilience Act) – ICT Risk Management Framework

CISA ZTMM 2.0 – Continuous Authentication and Authorization

NIS2 Directive – Cybersecurity Risk Management and Reporting

Sector Implications

Computer Software/Engineering

Financial Services

Health Care / Life Sciences

Information Technology/IT

Sources

Frequently Asked Questions

Cloud Native Security Fabric Mitigations and ControlsCNSF

Impact at a Glance

Affected Business Functions

Recommended Actions

Key Takeaways & Next Steps

Secure the Paths Between Cloud Workloads