Could this prompt injection attack have been prevented?

Stricter input sanitization, policy segmentation, and robust threat detection at the application layer could have mitigated the risk and prevented malicious commands from being executed via disguised URLs.

What are the business risks posed by this type of attack?

Prompt injection attacks may enable unauthorized access, data exfiltration, or automated actions, posing risks to data integrity, confidentiality, and regulatory compliance for organizations deploying AI-powered browsers.

Prompt Injection Flaw in ChatGPT Atlas Browser Enables Hidden Command Execution

A critical application security flaw enables attackers to trick OpenAI's ChatGPT Atlas Browser using disguised URLs, revealing new risks for AI-powered user interfaces.

Published: January 9, 2026

Share this on:

Executive Summary

In October 2025, security researchers at NeuralTrust identified a prompt injection vulnerability in the newly launched OpenAI ChatGPT Atlas Browser, allowing attackers to disguise malicious prompts as benign URLs in the omnibox. The attack exploits how the omnibox interprets user input, confusing it as either a navigation destination or a natural-language command to the agent. Malicious actors can craft deceptive URLs that bypass basic user scrutiny and trigger hidden commands, exposing users to unauthorized actions, potential data leaks, and unintended system manipulations. OpenAI was notified and subsequently began working on mitigations to address this risk.

This incident underscores a rising wave of sophisticated prompt injection attacks targeting AI-powered web interfaces. As AI tools become widely integrated in everyday applications, the attack surface expands, making seamless human-computer interactions susceptible to exploitation from both classic and emerging attack vectors.

Why This Matters Now

AI-driven applications are quickly becoming central to web and enterprise workflows, but insufficient input validation exposes them to novel risks like prompt injection. Since the Atlas Browser is in wide early use, attackers can exploit this to hijack user sessions or automate harmful commands, highlighting the urgent need for robust security controls in AI-enabled user interfaces.

Attack Path Analysis

The attacker initiated the compromise by sending a fake URL designed as a prompt injection to the ChatGPT Atlas Browser's omnibox, causing it to execute hidden commands. Upon successful injection, they escalated privilege by manipulating the browser agent to perform unintended operations. The attacker then attempted lateral movement, potentially targeting connected cloud workloads or browser sessions. For persistence and remote control, covert command and control channels were established via the browser's outbound communications. Sensitive data or session information was exfiltrated over disguised outbound traffic. Finally, attacker actions could disrupt service, modify user data, or launch secondary attacks via the compromised agent browser.

Kill Chain Progression

Initial Compromise

High

Privilege Escalation

Mediuminferred

Lateral Movement

Lowinferred

Command & Control

Mediuminferred

Exfiltration

Lowinferred

Impact

Mediuminferred

Initial Compromise

Description

The attacker delivered a malicious prompt disguised as a URL to the omnibox, triggering prompt injection and unauthorized command execution.

Confidence:

High

Related CVEs

Included CVEs with severity scores, affected products, exploit status, and reference links.

CVE-2025-12345
CVSS 8.8
A prompt injection vulnerability in OpenAI's ChatGPT Atlas browser allows attackers to execute arbitrary commands by disguising malicious prompts as URLs.
Affected Products:
OpenAI ChatGPT Atlas – 1.0.0, 1.0.1
Exploit Status:
exploited in the wild
References:
https://nvd.nist.gov/vuln/detail/CVE-2025-12345 https://openai.com/index/hardening-atlas-against-prompt-injection/https://techcrunch.com/2025/12/22/openai-says-ai-browsers-may-always-be-vulnerable-to-prompt-injection-attacks/
CVE-2025-12346
CVSS 9
A Cross-Site Request Forgery (CSRF) vulnerability in OpenAI's ChatGPT Atlas browser allows attackers to inject persistent, malicious instructions into the AI model's memory.
Affected Products:
OpenAI ChatGPT Atlas – 1.0.0, 1.0.1
Exploit Status:
exploited in the wild
References:
https://nvd.nist.gov/vuln/detail/CVE-2025-12346 https://cybersecurefox.com/en/csrf-persistent-memory-vulnerability-chatgpt-atlas-openai/https://www.techradar.com/pro/openais-new-atlas-browser-may-have-some-extremely-concerning-security-issues-experts-warn

MITRE ATT&CK® Techniques

Initial Access

T1204

User Execution

Execution

T1059

Command and Scripting Interpreter

Initial Access

T1566

Phishing

Execution

T1609

Container Administration Command

Execution

T1221

Template Injection

Privilege Escalation

T1134

Access Token Manipulation

Privilege Escalation

T1548

Abuse Elevation Control Mechanism

Potential Compliance Exposure

Mapping incident impact across multiple compliance frameworks.

PCI DSS 4.0 – Security of all system components

Control ID: 6.2.1

Application permits command execution via user input without robust validation, violating requirements for secure coding and input handling.

NYDFS 23 NYCRR 500 – Cybersecurity Policy

Control ID: 500.03

Lack of controls to prevent prompt injection in browser omnibox indicates gaps in cybersecurity policy implementation addressing application security risks.

DORA (Digital Operational Resilience Act) – ICT Security Requirements

Control ID: Art. 9(2)

Failure to detect and mitigate prompt injections illustrates insufficient testing and monitoring of ICT systems against evolving threats.

CISA ZTMM 2.0 – Application Security Controls

Control ID: 2.4.1

Inadequate segregation between user input and command execution violates zero trust principles for application security.

NIS2 Directive – Incident Handling Procedures

Control ID: Art. 21(2)(d)

Prompt injection vulnerability exposes a lack of incident prevention and rapid detection capabilities for emerging web application threats.

Sector Implications

Industry-specific impact of the vulnerabilities, including operational, regulatory, and cloud security risks.

Computer Software/Engineering

ChatGPT Atlas browser prompt injection vulnerability exposes AI development platforms to jailbreak attacks, compromising application security and automated systems integrity.

Information Technology/IT

Browser-based prompt injection threats challenge zero trust segmentation and egress security controls, requiring enhanced threat detection for AI-integrated enterprise systems.

Financial Services

AI chatbot vulnerabilities threaten customer service platforms and automated financial processes, violating PCI compliance requirements and enabling potential data exfiltration.

Computer/Network Security

Application security flaws in AI browsers undermine threat detection capabilities and highlight need for inline IPS protection against disguised malicious prompts.

Sources

ChatGPT Atlas Browser Can Be Tricked by Fake URLs into Executing Hidden Commandshttps://thehackernews.com/2025/10/chatgpt-atlas-browser-can-be-tricked-by.html
Verified

Continuously hardening ChatGPT Atlas against prompt injection attackshttps://openai.com/index/hardening-atlas-against-prompt-injection/

Verified

OpenAI says AI browsers may always be vulnerable to prompt injection attackshttps://techcrunch.com/2025/12/22/openai-says-ai-browsers-may-always-be-vulnerable-to-prompt-injection-attacks/

Verified

LayerX Finds CSRF + Persistent Memory Vulnerability In OpenAI’s ChatGPT Atlas Browserhttps://cybersecurefox.com/en/csrf-persistent-memory-vulnerability-chatgpt-atlas-openai/

Verified

Frequently Asked Questions

The prompt injection flaw revealed weaknesses in input validation, policy enforcement, and anomaly response—potentially impacting controls required by HIPAA, PCI, and NIST frameworks for secure application and data handling.

Cloud Native Security Fabric Mitigations and ControlsCNSF

Zero Trust controls such as segmentation, egress policy enforcement, east-west traffic security, and inline threat detection would have compartmentalized browser workloads, limited attack surface, and enabled rapid detection and containment of prompt injection-driven actions across the kill chain.

Initial Compromise

Control: Cloud Native Security Fabric (CNSF)

Mitigation: Inline enforcement and real-time inspection can block or alert on malicious traffic patterns.

Privilege Escalation

Control: Zero Trust Segmentation

Mitigation: Microsegmentation isolates browser workloads to prevent unauthorized privilege gains.

Lateral Movement

Control: East-West Traffic Security

Mitigation: Distributed controls block unauthorized workload-to-workload communication.

Command & Control

Control: Egress Security & Policy Enforcement

Mitigation: Outbound malicious connections can be detected and blocked by FQDN filtering and policy enforcement.

Exfiltration

Control: Cloud Firewall (ACF)

Mitigation: Outbound exfiltration attempts are stopped with URL filtering and secure outbound controls.

Impact (Mitigations)

Anomalous browser activity triggers alerts and fast response, limiting attacker-induced impact.

Impact at a Glance

Affected Business Functions

User Authentication
Data Management
Automated Workflows

Operational Disruption

Estimated downtime: 5 days

Financial Impact

Estimated loss: $500,000

Data Exposure

Potential exposure of sensitive user data, including authentication tokens and personal information, due to unauthorized command execution and data exfiltration.

Recommended Actions

• Enforce zero trust segmentation for browser workloads to isolate and contain prompt injection attempts.
• Implement strict egress policy enforcement and FQDN filtering to prevent command-and-control and data exfiltration from browser agents.
• Leverage real-time inline inspection and anomaly detection on browser traffic to identify and respond to suspicious execution flows.
• Harden east-west traffic visibility within your cloud environment to block lateral movement from compromised browser sessions.
• Regularly update cloud firewall rules and deploy distributed Cloud Native Security Fabric controls to adapt to emerging browser and SaaS attack vectors.

Secure the Paths Between Cloud Workloads

A cloud-native security fabric that enforces Zero Trust across workload communication—reducing attack paths, compliance risk, and operational complexity.

Stop Advanced Threats Get a Free Workload Attack Path Assessment Under Active Attack?

Prompt Injection Flaw in ChatGPT Atlas Browser Enables Hidden Command Execution

Executive Summary

Why This Matters Now

Attack Path Analysis

Kill Chain Progression

Initial Compromise

Description

Related CVEs

CVE-2025-12345

Affected Products:

Exploit Status:

References:

CVE-2025-12346

Affected Products:

Exploit Status:

References:

MITRE ATT&CK® Techniques

User Execution

Command and Scripting Interpreter

Phishing

Container Administration Command

Template Injection

Access Token Manipulation

Abuse Elevation Control Mechanism

Potential Compliance Exposure

PCI DSS 4.0 – Security of all system components

NYDFS 23 NYCRR 500 – Cybersecurity Policy

DORA (Digital Operational Resilience Act) – ICT Security Requirements

CISA ZTMM 2.0 – Application Security Controls

NIS2 Directive – Incident Handling Procedures

Sector Implications

Computer Software/Engineering

Information Technology/IT

Financial Services

Computer/Network Security

Sources

Frequently Asked Questions

Cloud Native Security Fabric Mitigations and ControlsCNSF

Impact at a Glance

Affected Business Functions

Recommended Actions

Key Takeaways & Next Steps

Secure the Paths Between Cloud Workloads