How can organizations mitigate this vulnerability?

Organizations should update SGLang to a patched version and implement secure template rendering practices, such as using ImmutableSandboxedEnvironment for Jinja2 templates.

What are the potential impacts of this vulnerability?

Exploitation could lead to data exfiltration, system manipulation, denial-of-service attacks, or full system compromise.

Critical RCE Vulnerability in SGLang via Malicious GGUF Model Files

Discover how CVE-2026-5760 exposes SGLang to remote code execution and learn the steps to secure your AI deployments.

Published: April 20, 2026

Share this on:

Executive Summary

In April 2026, a critical vulnerability (CVE-2026-5760) was identified in SGLang, an open-source framework for serving large language models. The flaw resides in the reranking endpoint (/v1/rerank), where unsandboxed Jinja2 template rendering allows remote code execution (RCE) when processing malicious GPT-Generated Unified Format (GGUF) model files. Exploitation enables attackers to execute arbitrary code on the server, potentially leading to data exfiltration, system manipulation, or denial-of-service attacks. (kb.cert.org)

This incident underscores the importance of secure template rendering practices in AI model serving frameworks. Organizations utilizing SGLang should promptly update to a patched version and implement recommended mitigations to prevent exploitation. (thehackernews.com)

Why This Matters Now

The rapid adoption of AI and machine learning frameworks increases the attack surface for cyber threats. Ensuring the security of these systems is paramount to prevent potential breaches and maintain trust in AI deployments.

Attack Path Analysis

An attacker crafts a malicious GGUF model file with a Jinja2 SSTI payload and distributes it through public repositories. A victim downloads and loads the malicious model into SGLang, triggering the unsandboxed template rendering vulnerability. The attacker gains remote code execution on the SGLang server, potentially escalating privileges to access sensitive data. The compromised server allows the attacker to move laterally within the network, targeting other systems. The attacker establishes a command and control channel to maintain persistent access. Finally, the attacker exfiltrates sensitive data from the compromised systems.

Kill Chain Progression

Initial Compromise

High

Privilege Escalation

Medium

Lateral Movement

Medium

Command & Control

Medium

Exfiltration

Medium

Impact

Lowinferred

Initial Compromise

Description

An attacker crafts a malicious GGUF model file with a Jinja2 SSTI payload and distributes it through public repositories.

Confidence:

High

Related CVEs

Included CVEs with severity scores, affected products, exploit status, and reference links.

CVE-2026-3060
CVSS 9.8
SGLang's encoder parallel disaggregation system is vulnerable to unauthenticated remote code execution through the disaggregation module, which deserializes untrusted data using pickle.loads() without authentication.
Affected Products:
LMSys SGLang – 0.5.5, 0.5.6, 0.5.7, 0.5.8, 0.5.9
Exploit Status:
proof of concept
References:
https://nvd.nist.gov/vuln/detail/CVE-2026-3060 https://github.com/sgl-project/sglang/releases/tag/v0.5.10 https://orca.security/resources/blog/sglang-llm-framework-rce-vulnerabilities/
CVE-2026-3059
CVSS 9.8
SGLang's multimodal generation module is vulnerable to unauthenticated remote code execution through the ZMQ broker, which deserializes untrusted data using pickle.loads() without authentication.
Affected Products:
LMSys SGLang – 0.5.5, 0.5.6, 0.5.7, 0.5.8, 0.5.9
Exploit Status:
proof of concept
References:
https://nvd.nist.gov/vuln/detail/CVE-2026-3059 https://github.com/sgl-project/sglang/releases/tag/v0.5.10 https://orca.security/resources/blog/sglang-llm-framework-rce-vulnerabilities/

MITRE ATT&CK® Techniques

Execution

T1203

Exploitation for Client Execution

Execution

T1059.006

Command and Scripting Interpreter: Python

Initial Access

T1190

Exploit Public-Facing Application

Defense Evasion

T1202

Indirect Command Execution

Defense Evasion

T1055

Process Injection

Potential Compliance Exposure

Mapping incident impact across multiple compliance frameworks.

PCI DSS 4.0 – Ensure all system components are protected from known vulnerabilities

Control ID: 6.2

The exploitation of CVE-2026-5760 indicates a failure to protect system components from known vulnerabilities, violating PCI DSS requirement 6.2.

NYDFS 23 NYCRR 500 – Cybersecurity Policy

Control ID: 500.03

The incident suggests inadequacies in the cybersecurity policy, particularly in addressing vulnerabilities and implementing secure coding practices as required by NYDFS 500.03.

DORA – ICT Risk Management Framework

Control ID: Article 5

The breach highlights deficiencies in the ICT risk management framework, failing to identify and mitigate risks associated with third-party software components, contravening DORA Article 5.

CISA ZTMM 2.0 – Applications and Workloads

Control ID: Pillar 3

The exploitation of the vulnerability indicates a lapse in securing applications and workloads, undermining the principles outlined in CISA ZTMM 2.0 Pillar 3.

NIS2 Directive – Cybersecurity Risk Management Measures

Control ID: Article 21

The incident reflects a failure to implement appropriate cybersecurity risk management measures, as mandated by NIS2 Directive Article 21.

Sector Implications

Industry-specific impact of the vulnerabilities, including operational, regulatory, and cloud security risks.

Computer Software/Engineering

SGLang CVE-2026-5760 RCE vulnerability directly impacts software development organizations using AI/ML frameworks, requiring immediate supply-chain security controls and inline inspection capabilities.

Information Technology/IT

Critical CVSS 9.8 command injection vulnerability threatens IT infrastructure deploying AI models, necessitating enhanced egress filtering and zero trust segmentation policies.

Health Care / Life Sciences

Supply-chain compromise via malicious GGUF models poses severe HIPAA compliance risks, demanding encrypted traffic monitoring and multicloud visibility for protected health information.

Financial Services

Remote code execution through compromised AI models creates substantial financial data exposure risks, requiring threat detection capabilities and PCI compliance enforcement mechanisms.

Sources

SGLang CVE-2026-5760 (CVSS 9.8) Enables RCE via Malicious GGUF Model Fileshttps://thehackernews.com/2026/04/sglang-cve-2026-5760-cvss-98-enables.html
Verified

SGLang LLM Framework RCE Vulnerabilitieshttps://orca.security/resources/blog/sglang-llm-framework-rce-vulnerabilities/

Verified

SGLang Release v0.5.10https://github.com/sgl-project/sglang/releases/tag/v0.5.10

Verified

Frequently Asked Questions

CVE-2026-5760 is a critical vulnerability in SGLang's reranking endpoint that allows remote code execution via malicious GGUF model files.

Cloud Native Security Fabric Mitigations and ControlsCNSF

Aviatrix Zero Trust CNSF is pertinent to this incident as it embeds security directly into the cloud fabric, potentially reducing the attacker's ability to move laterally and exfiltrate data.

Initial Compromise

Control: Cloud Native Security Fabric (CNSF)

Mitigation: The attacker's ability to distribute malicious files through public repositories would likely be constrained, reducing the risk of initial compromise.

Privilege Escalation

Control: Zero Trust Segmentation

Mitigation: The attacker's ability to escalate privileges and access sensitive data would likely be constrained, reducing the risk of unauthorized access.

Lateral Movement

Control: East-West Traffic Security

Mitigation: The attacker's ability to move laterally within the network would likely be constrained, reducing the risk of further system compromises.

Command & Control

Control: Multicloud Visibility & Control

Mitigation: The attacker's ability to establish and maintain command and control channels would likely be constrained, reducing the risk of persistent access.

Exfiltration

Control: Egress Security & Policy Enforcement

Mitigation: The attacker's ability to exfiltrate sensitive data would likely be constrained, reducing the risk of data loss.

Impact (Mitigations)

The attacker's ability to disrupt services or deploy ransomware would likely be constrained, reducing the risk of operational impact.

Impact at a Glance

Affected Business Functions

Model Deployment
Inference Services
Data Processing Pipelines

Operational Disruption

Estimated downtime: 7 days

Financial Impact

Estimated loss: $500,000

Data Exposure

Potential exposure of proprietary model data and sensitive client information.

Recommended Actions

• Implement Zero Trust Segmentation to restrict access between workloads and prevent lateral movement.
• Deploy Inline IPS (Suricata) to detect and block malicious payloads in network traffic.
• Utilize Cloud Firewall (ACF) to enforce egress filtering and prevent unauthorized outbound connections.
• Enhance Threat Detection & Anomaly Response capabilities to identify and respond to suspicious activities promptly.
• Regularly update and patch systems to mitigate known vulnerabilities and reduce the attack surface.

Secure the Paths Between Cloud Workloads

A cloud-native security fabric that enforces Zero Trust across workload communication—reducing attack paths, compliance risk, and operational complexity.

Stop Advanced Threats Get a Free Workload Attack Path Assessment Under Active Attack?

Critical RCE Vulnerability in SGLang via Malicious GGUF Model Files

Executive Summary

Why This Matters Now

Attack Path Analysis

Kill Chain Progression

Initial Compromise

Description

Related CVEs

CVE-2026-3060

Affected Products:

Exploit Status:

References:

CVE-2026-3059

Affected Products:

Exploit Status:

References:

MITRE ATT&CK® Techniques

Exploitation for Client Execution

Command and Scripting Interpreter: Python

Exploit Public-Facing Application

Indirect Command Execution

Process Injection

Potential Compliance Exposure

PCI DSS 4.0 – Ensure all system components are protected from known vulnerabilities

NYDFS 23 NYCRR 500 – Cybersecurity Policy

DORA – ICT Risk Management Framework

CISA ZTMM 2.0 – Applications and Workloads

NIS2 Directive – Cybersecurity Risk Management Measures

Sector Implications

Computer Software/Engineering

Information Technology/IT

Health Care / Life Sciences

Financial Services

Sources

Frequently Asked Questions

Cloud Native Security Fabric Mitigations and ControlsCNSF

Impact at a Glance

Affected Business Functions

Recommended Actions

Key Takeaways & Next Steps

Secure the Paths Between Cloud Workloads