Safeguarding AI Assets: Lessons from the 2025 Model Extraction Attack

Q: What is an AI model extraction attack?

An AI model extraction attack involves adversaries systematically querying a machine learning model's API to replicate its functionality, effectively stealing the model's intellectual property.

Q: How can organizations defend against model extraction attacks?

Organizations can implement defenses such as rate limiting, output perturbation, and behavioral monitoring to protect their AI models from unauthorized replication.

Explore the 2025 AI model extraction attack, its implications, and effective strategies to protect your organization's AI models from similar threats.

Published: January 30, 2026

Share this on:

Executive Summary

In 2025, a significant AI model extraction attack was identified, where adversaries systematically queried a proprietary machine learning model's API to replicate its functionality. By sending carefully crafted inputs and analyzing the outputs, attackers reconstructed a substitute model that closely mirrored the original's behavior. This breach exposed the model's intellectual property, leading to potential competitive disadvantages and financial losses for the organization. The incident underscores the vulnerabilities inherent in exposing AI models through APIs without adequate security measures. (techtarget.com)

The rise of such model extraction attacks highlights the urgent need for organizations to implement robust defenses, including rate limiting, output perturbation, and behavioral monitoring, to protect their AI assets from unauthorized replication and misuse. (snyk.io)

Why This Matters Now

As AI models become integral to business operations, the threat of model extraction attacks poses significant risks to intellectual property and competitive advantage. Organizations must prioritize securing their AI systems to prevent unauthorized replication and potential misuse.

Attack Path Analysis

An adversary exploited unrestricted access to a machine learning model's API to systematically query the model, collecting input-output pairs. Using this data, they trained a substitute model that closely mimicked the original's behavior. The attacker then utilized this replica model to develop adversarial inputs, potentially compromising the integrity of the original system.

Kill Chain Progression

Initial Compromise

High

Privilege Escalation

High

Lateral Movement

High

Command & Control

Mediuminferred

Exfiltration

Mediuminferred

Impact

High

Initial Compromise

Description

The adversary gained access to the machine learning model's API, which lacked proper access controls, allowing unrestricted querying.

Confidence:

High

Related CVEs

Included CVEs with severity scores, affected products, exploit status, and reference links.

CVE-2025-12058
CVSS 5.9
A vulnerability in Keras models allows arbitrary file access and server-side request forgery (SSRF) due to improper handling of model imports.
Affected Products:
Keras Keras – All versions prior to the fix
Exploit Status:
proof of concept
References:
https://www.zscaler.com/blogs/security-research/zscaler-discovers-vulnerability-keras-models-allowing-arbitrary-file-access
CVE-2024-6868
CVSS 9.8
mudler/LocalAI version 2.17.1 allows for arbitrary file write due to improper handling of automatic archive extraction, leading to potential remote code execution.
Affected Products:
mudler LocalAI – 2.17.1
Exploit Status:
no public exploit
References:
https://nvd.nist.gov/vuln/detail/CVE-2024-6868
CVE-2021-41127
CVSS 7.1
Rasa versions prior to 2.8.10 contain a Zip Slip vulnerability in model loading functionality, allowing potential arbitrary write within specific directories.
Affected Products:
Rasa Rasa – < 2.8.10
Exploit Status:
no public exploit
References:
https://www.wiz.io/vulnerability-database/cve/cve-2021-41127
CVE-2023-48299
CVSS 5.3
TorchServe versions 0.1.0 to 0.9.0 contain a Zip Slip vulnerability in the model/workflow management API, allowing extraction of harmful archives to any location on the filesystem.
Affected Products:
TorchServe TorchServe – 0.1.0 to 0.9.0
Exploit Status:
no public exploit
References:
https://www.wiz.io/vulnerability-database/cve/cve-2023-48299

MITRE ATT&CK® Techniques

Resource Development

T1588.007

Obtain Capabilities: Artificial Intelligence

Command and Control

T1071.001

Application Layer Protocol: Web Protocols

Credential Access

T1110.003

Brute Force: Password Spraying

Lateral Movement

T1021.001

Remote Services: Remote Desktop Protocol

Collection

T1005

Data from Local System

Exfiltration

T1041

Exfiltration Over C2 Channel

Potential Compliance Exposure

Mapping incident impact across multiple compliance frameworks.

PCI DSS 4.0 – Ensure that security policies and operational procedures for managing vulnerabilities are documented, in use, and known to all affected parties.

Control ID: 6.4.3

The incident highlights a failure to document and enforce security policies related to AI model exposure, leading to unauthorized access and potential data breaches.

NYDFS 23 NYCRR 500 – Cybersecurity Policy

Control ID: 500.03

The breach indicates inadequate cybersecurity policies governing the protection of proprietary AI models, contravening the requirement for comprehensive policies addressing information security.

DORA – ICT Risk Management Framework

Control ID: Article 5

The attack reveals deficiencies in the institution's ICT risk management framework, particularly in identifying and mitigating risks associated with AI model deployment and API security.

CISA ZTMM 2.0 – Implement robust authentication and authorization mechanisms.

Control ID: Identity Pillar: Authentication and Authorization

The incident underscores the absence of stringent authentication and authorization controls for AI model APIs, facilitating unauthorized access and model extraction.

NIS2 Directive – Cybersecurity Risk Management Measures

Control ID: Article 21

The breach demonstrates non-compliance with mandated risk management measures, particularly in securing network and information systems against unauthorized access to AI models.

Sector Implications

Industry-specific impact of the vulnerabilities, including operational, regulatory, and cloud security risks.

Computer Software/Engineering

AI model extraction attacks threaten proprietary ML models exposed via APIs, enabling intellectual property theft and competitive disadvantage through behavioral replication techniques.

Health Care / Life Sciences

Medical imaging and diagnostic ML models face extraction risks, potentially compromising patient data privacy and enabling unauthorized replication of specialized healthcare AI systems.

Financial Services

Fraud detection and risk assessment models vulnerable to extraction attacks, allowing adversaries to understand decision boundaries and develop evasion techniques against financial controls.

Computer/Network Security

Security ML models protecting against threats become targets themselves, with extracted models revealing detection capabilities and enabling adversaries to craft targeted attack payloads.

Sources

Stealing AI Models Through the API: A Practical Model Extraction Attackhttps://www.praetorian.com/blog/stealing-ai-models-through-the-api-a-practical-model-extraction-attack/
Verified

Zscaler Discovers Vulnerability in Keras Models Allowing Arbitrary File Access and SSRF (CVE-2025-12058)https://www.zscaler.com/blogs/security-research/zscaler-discovers-vulnerability-keras-models-allowing-arbitrary-file-access

Verified

NVD - CVE-2024-6868https://nvd.nist.gov/vuln/detail/CVE-2024-6868

Verified

CVE-2021-41127 Impact, Exploitability, and Mitigation Steps | Wizhttps://www.wiz.io/vulnerability-database/cve/cve-2021-41127

Verified

CVE-2023-48299 Impact, Exploitability, and Mitigation Steps | Wizhttps://www.wiz.io/vulnerability-database/cve/cve-2023-48299

Verified

Frequently Asked Questions

An AI model extraction attack involves adversaries systematically querying a machine learning model's API to replicate its functionality, effectively stealing the model's intellectual property.

Cloud Native Security Fabric Mitigations and ControlsCNSF

Aviatrix Zero Trust CNSF is pertinent to this incident as it could have restricted unauthorized access to the machine learning model's API, thereby limiting the attacker's ability to extract data and develop adversarial inputs.

Initial Compromise

Control: Cloud Native Security Fabric (CNSF)

Mitigation: Implementing Aviatrix CNSF would likely have restricted unauthorized access to the API, thereby preventing the adversary from initiating unrestricted queries.

Privilege Escalation

Control: Zero Trust Segmentation

Mitigation: With Zero Trust Segmentation, the attacker's ability to escalate privileges by sending crafted inputs would likely have been constrained.

Lateral Movement

Control: East-West Traffic Security

Mitigation: East-West Traffic Security would likely have restricted the attacker's ability to move laterally and access other resources to train a substitute model.

Command & Control

Control: Multicloud Visibility & Control

Mitigation: Multicloud Visibility & Control would likely have detected and restricted unauthorized communications between the replica model and the original system.

Exfiltration

Control: Egress Security & Policy Enforcement

Mitigation: Egress Security & Policy Enforcement would likely have prevented the exfiltration of sensitive data by controlling outbound traffic.

Impact (Mitigations)

While prior controls would likely have mitigated earlier stages, the residual risk includes potential exposure of proprietary algorithms and data.

Impact at a Glance

Affected Business Functions

AI Model Development
API Services
Intellectual Property Management

Operational Disruption

Estimated downtime: N/A

Financial Impact

Estimated loss: N/A

Data Exposure

Potential exposure of proprietary AI model architectures and training data.

Recommended Actions

• Implement strict access controls and authentication mechanisms for all machine learning model APIs to prevent unauthorized access.
• Monitor and limit the rate of API queries to detect and mitigate potential model extraction attempts.
• Utilize output perturbation techniques to reduce the information disclosed in model responses, thereby hindering adversarial learning.
• Regularly audit and monitor API usage patterns to identify and respond to anomalous behaviors indicative of model extraction.
• Apply data loss prevention (DLP) measures to detect and prevent unauthorized exfiltration of sensitive data through model APIs.

Secure the Paths Between Cloud Workloads

A cloud-native security fabric that enforces Zero Trust across workload communication—reducing attack paths, compliance risk, and operational complexity.

Stop Advanced Threats Get a Free Workload Attack Path Assessment Under Active Attack?

Safeguarding AI Assets: Lessons from the 2025 Model Extraction Attack

Executive Summary

Why This Matters Now

Attack Path Analysis

Kill Chain Progression

Initial Compromise

Description

Related CVEs

CVE-2025-12058

Affected Products:

Exploit Status:

References:

CVE-2024-6868

Affected Products:

Exploit Status:

References:

CVE-2021-41127

Affected Products:

Exploit Status:

References:

CVE-2023-48299

Affected Products:

Exploit Status:

References:

MITRE ATT&CK® Techniques

Obtain Capabilities: Artificial Intelligence

Application Layer Protocol: Web Protocols

Brute Force: Password Spraying

Remote Services: Remote Desktop Protocol

Data from Local System

Exfiltration Over C2 Channel

Potential Compliance Exposure

PCI DSS 4.0 – Ensure that security policies and operational procedures for managing vulnerabilities are documented, in use, and known to all affected parties.

NYDFS 23 NYCRR 500 – Cybersecurity Policy

DORA – ICT Risk Management Framework

CISA ZTMM 2.0 – Implement robust authentication and authorization mechanisms.

NIS2 Directive – Cybersecurity Risk Management Measures

Sector Implications

Computer Software/Engineering

Health Care / Life Sciences

Financial Services

Computer/Network Security

Sources

Frequently Asked Questions

Cloud Native Security Fabric Mitigations and ControlsCNSF

Impact at a Glance

Affected Business Functions

Recommended Actions

Key Takeaways & Next Steps

Secure the Paths Between Cloud Workloads