Which AI projects were impacted by these supply-chain vulnerabilities?

Enterprise frameworks from Meta, Nvidia, and Microsoft, as well as open-source AI projects including PyTorch, vLLM, and SGLang, were affected.

How can organizations protect themselves against similar supply-chain risks in AI infrastructure?

Organizations should implement zero trust segmentation, validate serialization processes, audit third-party libraries, and monitor east-west traffic for anomalies in their cloud and AI environments.

Critical AI Inference Framework Vulnerabilities Expose Meta, Nvidia, and Microsoft to Supply Chain Risk

Q: What led to the AI inference engine vulnerabilities in Meta, Nvidia, and Microsoft?

The vulnerabilities were introduced via unsafe use of the ZeroMQ messaging library and insecure Python pickle deserialization, leading to potential remote code execution.

Recent discoveries of remote code execution bugs in major AI inference engines highlight urgent supply-chain threats to enterprise machine learning environments.

Published: January 10, 2026

Share this on:

Executive Summary

In late 2025, cybersecurity researchers discovered critical remote code execution vulnerabilities in leading AI inference frameworks developed by Meta, Nvidia, and Microsoft, as well as popular open-source projects including PyTorch, vLLM, and SGLang. The flaws stem from unsafe implementations of the ZeroMQ (ZMQ) messaging library and insecure Python pickle deserialization processes, enabling attackers to exploit affected models and potentially execute malicious commands on targeted systems. The exposure threatens AI infrastructure across major cloud and hybrid environments, raising concerns about data integrity and confidentiality for enterprises deploying advanced machine learning workloads.

This incident underscores a growing trend of supply-chain vulnerabilities hijacking foundational AI technologies, with attackers increasingly targeting interdependent machine learning frameworks. Heightened regulatory pressure and intensified focus on software supply-chain security emphasize the urgent need for improved cryptographic practices and zero trust segmentation in AI environments.

Why This Matters Now

With the rapid adoption of generative AI and machine learning across industries, vulnerabilities in core inference frameworks represent a high-severity, supply-chain risk. The urgent need to address insecure serialization and communication channels in widely-used AI infrastructure has become critical as threat actors shift to targeting these emerging attack surfaces.

Attack Path Analysis

Attackers exploited unsafe deserialization in ZeroMQ-based AI inference frameworks to achieve remote code execution and initial access. Leveraging the compromise, they sought to escalate privileges on the targeted system or container. With elevated access, adversaries attempted lateral movement across internal cloud resources and Kubernetes workloads. Attackers established command and control by tunneling outbound communication with covert channels, possibly over unmonitored egress paths. Sensitive data from AI models or underlying infrastructure was exfiltrated through allowed egress routes. Finally, the attack could result in data manipulation or service disruption, affecting AI model integrity and business operations.

Kill Chain Progression

Initial Compromise

High

Privilege Escalation

Mediuminferred

Lateral Movement

Mediuminferred

Command & Control

Mediuminferred

Exfiltration

Mediuminferred

Impact

Mediuminferred

Initial Compromise

Description

Attackers exploited unsafe ZeroMQ and Python pickle deserialization vulnerabilities in exposed AI inference services to gain remote code execution.

Confidence:

High

Related CVEs

Included CVEs with severity scores, affected products, exploit status, and reference links.

CVE-2024-50050
CVSS 6.3
A deserialization vulnerability in Meta's Llama Stack allows remote code execution via untrusted data deserialization.
Affected Products:
Meta Llama Stack – < 0.0.41
Exploit Status:
no public exploit
References:
https://vulnera.com/newswire/critical-security-flaw-identified-in-metas-llama-framework-exposing-ai-systems-to-potential-remote-code-execution/https://www.gov.mn/en/news/all/ac72f994-283f-409d-b5d3-0881aa59dfa9
CVE-2025-30165
CVSS 8
vLLM's use of ZeroMQ with pickle deserialization allows remote code execution in multi-node deployments.
Affected Products:
vLLM vLLM – < 0.8.0
Exploit Status:
no public exploit
References:
https://nvd.nist.gov/vuln/detail/CVE-2025-30165 https://jnrmr.com/shadowmq-critical-bugs-expose-ai-frameworks-from-meta-nvidia-%26-microsoft-to-remote-code-execution.html
CVE-2025-23254
CVSS 8.8
NVIDIA TensorRT-LLM's IPC implementation uses pickle over unsecured channels, allowing local attackers to execute arbitrary code.
Affected Products:
NVIDIA TensorRT-LLM – < 0.18.2
Exploit Status:
no public exploit
References:
https://cybersecuritynews.com/nvidia-tensorrt-llm-high-severity-vulnerability/
CVE-2025-32444
CVSS 10
vLLM's Mooncake integration uses pickle over unsecured ZeroMQ sockets, allowing remote code execution.
Affected Products:
vLLM vLLM – 0.6.5 to 0.8.5
Exploit Status:
no public exploit
References:
https://www.wiz.io/vulnerability-database/cve/cve-2025-32444
CVE-2025-29783
CVSS 10
vLLM's Mooncake component deserializes untrusted data using pickle, leading to remote code execution.
Affected Products:
vLLM vLLM – < 0.8.0
Exploit Status:
no public exploit
References:
https://www.cvereports.com/cve-2025-29783-remote-code-execution-in-vllm-via-unsafe-deserialization-in-mooncake/

MITRE ATT&CK® Techniques

Initial Access

T1190

Exploit Public-Facing Application

Execution

T1059

Command and Scripting Interpreter

Execution

T1204

User Execution

Exfiltration

T1048

Exfiltration Over Alternative Protocol

Credential Access

T1555

Credentials from Password Stores

Lateral Movement

T1210

Exploitation of Remote Services

Privilege Escalation

T1609

Container Administration Command

Impact

T1565

Data Manipulation

Potential Compliance Exposure

Mapping incident impact across multiple compliance frameworks.

PCI DSS 4.0 – Address Software Vulnerabilities

Control ID: 6.2.3

The exploitation of critical vulnerabilities in AI inference frameworks indicates ineffective processes for identifying and correcting software flaws, violating requirements for maintaining secure systems.

NYDFS 23 NYCRR 500 – Cybersecurity Policy

Control ID: 500.03

Failure to assess and securely configure AI supply-chain components points to weaknesses in policy implementation related to technology supply chain risk management.

DORA – ICT Risk Management Framework

Control ID: Article 7(2)

The incident exposes deficiencies in ICT risk management processes, particularly concerning third-party software vulnerabilities and secure configuration.

CISA Zero Trust Maturity Model 2.0 – Active Asset Inventory and Software Supply Chain Security

Control ID: Applications Pillar - Asset Management

Lack of active inventory and secure configuration of external AI libraries leads to risks from unvetted components, opposing tenets of Zero Trust asset management.

NIS2 Directive – Cybersecurity Risk Management Measures

Control ID: Article 21

Exploitation of vulnerabilities in critical supply-chain AI components contravenes mandated risk assessment and technical measures for essential service operators.

Sector Implications

Industry-specific impact of the vulnerabilities, including operational, regulatory, and cloud security risks.

Computer Software/Engineering

Critical supply-chain vulnerabilities in AI inference frameworks expose software development pipelines to remote code execution through unsafe ZeroMQ and pickle deserialization.

Information Technology/IT

AI infrastructure compromises threaten IT service delivery and cloud deployments, requiring enhanced egress security and threat detection for PyTorch-based systems.

Health Care / Life Sciences

AI inference vulnerabilities risk HIPAA compliance violations in healthcare AI applications, demanding zero trust segmentation and encrypted traffic protection measures.

Financial Services

Banking AI systems face supply-chain attacks targeting inference engines, necessitating multicloud visibility and anomaly detection to prevent regulatory compliance breaches.

Sources

Researchers Find Serious AI Bugs Exposing Meta, Nvidia, and Microsoft Inference Frameworkshttps://thehackernews.com/2025/11/researchers-find-serious-ai-bugs.html
Verified

Critical Security Flaw Identified in Meta’s Llama Framework, Exposing AI Systems to Potential Remote Code Executionhttps://vulnera.com/newswire/critical-security-flaw-identified-in-metas-llama-framework-exposing-ai-systems-to-potential-remote-code-execution/

Verified

Meta's Llama Framework Flaw Exposes AI Systems to Remote Code Execution Riskshttps://www.gov.mn/en/news/all/ac72f994-283f-409d-b5d3-0881aa59dfa9

Verified

NVD - CVE-2025-30165https://nvd.nist.gov/vuln/detail/CVE-2025-30165

Verified

ShadowMQ: Critical Bugs Expose AI Frameworks from Meta, Nvidia, & Microsoft to Remote Code Executionhttps://jnrmr.com/shadowmq-critical-bugs-expose-ai-frameworks-from-meta-nvidia-%26-microsoft-to-remote-code-execution.html

Verified

NVIDIA TensorRT-LLM High-Severity Vulnerability Let Attackers Remote Codehttps://cybersecuritynews.com/nvidia-tensorrt-llm-high-severity-vulnerability/

Verified

Frequently Asked Questions

The vulnerabilities were introduced via unsafe use of the ZeroMQ messaging library and insecure Python pickle deserialization, leading to potential remote code execution.

Cloud Native Security Fabric Mitigations and ControlsCNSF

Applying Zero Trust segmentation, workload-to-workload isolation, and egress enforcement would curtail an attacker's ability to move laterally, exfiltrate data, and disrupt operations. CNSF controls—especially microsegmentation, runtime visibility, and cloud-native outbound filtering—provide proactive enforcement and early detection at multiple phases of the attack.

Initial Compromise

Control: Cloud Firewall (ACF)

Mitigation: Reduces attack surface by blocking unauthorized inbound access to AI services.

Privilege Escalation

Control: Kubernetes Security (AKF)

Mitigation: Constrains privilege escalation potential within pods or namespaces.

Lateral Movement

Control: Zero Trust Segmentation

Mitigation: Prevents unauthorized lateral movement between cloud workloads.

Command & Control

Control: Egress Security & Policy Enforcement

Mitigation: Detects and blocks unsanctioned outbound C2 traffic from workloads.

Exfiltration

Control: Inline IPS (Suricata)

Mitigation: Detects and blocks data exfiltration via known malicious signatures or anomalies.

Impact (Mitigations)

Enables rapid detection and response to infrastructure tampering or destructive behaviors.

Impact at a Glance

Affected Business Functions

AI Model Inference
Data Processing
Cloud Services

Operational Disruption

Estimated downtime: 5 days

Financial Impact

Estimated loss: $500,000

Data Exposure

Potential exposure of sensitive AI models and proprietary data due to remote code execution vulnerabilities.

Recommended Actions

• Enforce zero trust segmentation between AI workloads and all adjacent cloud resources to contain initial compromise and lateral movement.
• Apply egress policy enforcement at the cloud perimeter and workload level to block unsanctioned outbound communication and data exfiltration.
• Deploy cloud-native intrusion prevention (such as Suricata IPS) for inline detection of exploitation and exfiltration attempts in real time.
• Integrate continuous Kubernetes and pod security, including namespace enforcement and pod identity policies, to minimize privilege escalation risks.
• Establish comprehensive visibility and threat baselining across multi-cloud and hybrid environments to speed detection and response to novel attack patterns.

Secure the Paths Between Cloud Workloads

A cloud-native security fabric that enforces Zero Trust across workload communication—reducing attack paths, compliance risk, and operational complexity.

Stop Advanced Threats Get a Free Workload Attack Path Assessment Under Active Attack?

Critical AI Inference Framework Vulnerabilities Expose Meta, Nvidia, and Microsoft to Supply Chain Risk

Executive Summary

Why This Matters Now

Attack Path Analysis

Kill Chain Progression

Initial Compromise

Description

Related CVEs

CVE-2024-50050

Affected Products:

Exploit Status:

References:

CVE-2025-30165

Affected Products:

Exploit Status:

References:

CVE-2025-23254

Affected Products:

Exploit Status:

References:

CVE-2025-32444

Affected Products:

Exploit Status:

References:

CVE-2025-29783

Affected Products:

Exploit Status:

References:

MITRE ATT&CK® Techniques

Exploit Public-Facing Application

Command and Scripting Interpreter

User Execution

Exfiltration Over Alternative Protocol

Credentials from Password Stores

Exploitation of Remote Services

Container Administration Command

Data Manipulation

Potential Compliance Exposure

PCI DSS 4.0 – Address Software Vulnerabilities

NYDFS 23 NYCRR 500 – Cybersecurity Policy

DORA – ICT Risk Management Framework

CISA Zero Trust Maturity Model 2.0 – Active Asset Inventory and Software Supply Chain Security

NIS2 Directive – Cybersecurity Risk Management Measures

Sector Implications

Computer Software/Engineering

Information Technology/IT

Health Care / Life Sciences

Financial Services

Sources

Frequently Asked Questions

Cloud Native Security Fabric Mitigations and ControlsCNSF

Impact at a Glance

Affected Business Functions

Recommended Actions

Key Takeaways & Next Steps

Secure the Paths Between Cloud Workloads