Executive Summary
In February 2026, Anthropic, a U.S.-based AI startup, reported that three Chinese AI laboratories—DeepSeek, Moonshot, and MiniMax—conducted large-scale 'distillation' attacks to extract capabilities from Anthropic's Claude model. These labs utilized 24,000 fraudulent accounts to send approximately 16 million requests to Claude, aiming to enhance their own AI models. This unauthorized extraction of intellectual property not only violated Anthropic's terms of service but also posed significant national security risks by potentially enabling offensive cyber operations and mass surveillance. (cyberscoop.com)
This incident underscores the growing threat of AI model distillation as a method for intellectual property theft. The scale and sophistication of these attacks highlight the urgent need for robust security measures and regulatory frameworks to protect proprietary AI technologies from unauthorized exploitation.
Why This Matters Now
The Anthropic incident highlights the escalating risk of AI model distillation attacks, emphasizing the need for immediate action to safeguard intellectual property and national security in the rapidly evolving AI landscape.
Attack Path Analysis
Chinese AI labs DeepSeek, Moonshot, and MiniMax initiated large-scale distillation attacks by creating fraudulent accounts to interact with Anthropic's Claude chatbot. Through these interactions, they extracted advanced capabilities, potentially bypassing safety measures. The labs then integrated these capabilities into their own AI models, enhancing their functionalities. This unauthorized data extraction could lead to the development of AI systems lacking essential safeguards, posing national security risks. The impact includes potential misuse of AI for offensive cyber operations, disinformation campaigns, and mass surveillance.
Kill Chain Progression
Initial Compromise
Description
Chinese AI labs created approximately 24,000 fraudulent accounts to interact with Anthropic's Claude chatbot, initiating unauthorized access.
MITRE ATT&CK® Techniques
Obtain Capabilities: Artificial Intelligence
Valid Accounts
Application Layer Protocol
Automated Exfiltration
Taint Shared Content
Brute Force
Potential Compliance Exposure
Mapping incident impact across multiple compliance frameworks.
PCI DSS 4.0 – Ensure that security policies and operational procedures for managing system and software vulnerabilities are documented, in use, and known to all affected parties.
Control ID: 6.4.3
NYDFS 23 NYCRR 500 – Cybersecurity Policy
Control ID: 500.03
DORA – ICT Risk Management Framework
Control ID: Article 5
CISA ZTMM 2.0 – Implement robust identity and access management controls.
Control ID: Identity and Access Management
NIS2 Directive – Cybersecurity Risk Management Measures
Control ID: Article 21
Sector Implications
Industry-specific impact of the vulnerabilities, including operational, regulatory, and cloud security risks.
Computer Software/Engineering
AI model distillation attacks directly threaten intellectual property and proprietary algorithms, requiring enhanced egress security and threat detection capabilities to prevent unauthorized capability extraction.
Defense/Space
Nation-state AI capability theft poses critical national security risks as stolen models could enable offensive cyber operations, surveillance systems, and military intelligence applications without safeguards.
Government Administration
Fraudulent account creation and proxy service abuse highlight need for zero trust segmentation and multicloud visibility to prevent unauthorized access to sensitive government AI systems.
Computer/Network Security
Industrial-scale distillation campaigns demonstrate advanced persistent threats requiring enhanced anomaly detection, encrypted traffic analysis, and runtime security fabric deployment for AI model protection.
Sources
- Anthropic accuses Chinese labs of trying to illicitly take Claude’s capabilitieshttps://cyberscoop.com/anthropic-accuses-chinese-labs-ai-distillation-cyber-risk/Verified
- Anthropic accuses Chinese AI labs of mining Claude as US debates AI chip exportshttps://techcrunch.com/2026/02/23/anthropic-accuses-chinese-ai-labs-of-mining-claude-as-us-debates-ai-chip-exports/Verified
- US AI giant accuses Chinese rivals of mass data thefthttps://www.theguardian.com/technology/2026/feb/23/us-ai-anthropic-chinaVerified
- Anthropic accuses Chinese AI labs of stealing data from Claudehttps://www.investing.com/news/company-news/anthropic-accuses-chinese-ai-labs-of-stealing-data-from-claude-4519588Verified
Frequently Asked Questions
Cloud Native Security Fabric Mitigations and ControlsCNSF
Aviatrix Zero Trust CNSF is pertinent to this incident as it could have constrained the unauthorized data extraction by implementing strict segmentation and identity-aware routing, thereby reducing the attacker's ability to exploit and exfiltrate sensitive AI capabilities.
Control: Cloud Native Security Fabric (CNSF)
Mitigation: The creation and utilization of fraudulent accounts may have been limited, reducing unauthorized access attempts.
Control: Zero Trust Segmentation
Mitigation: Elevated access to advanced capabilities could have been constrained, reducing unauthorized privilege escalation.
Control: East-West Traffic Security
Mitigation: Systematic extraction of functionalities may have been restricted, reducing unauthorized lateral interactions.
Control: Multicloud Visibility & Control
Mitigation: Centralized coordination of data extraction could have been hindered, reducing the effectiveness of command structures.
Control: Egress Security & Policy Enforcement
Mitigation: Data transfer to external systems may have been blocked, reducing unauthorized data exfiltration.
The development of AI models lacking safeguards could have been limited, reducing potential misuse in cyber operations.
Impact at a Glance
Affected Business Functions
- AI Model Development
- Intellectual Property Management
- Cybersecurity Operations
Estimated downtime: N/A
Estimated loss: N/A
Potential exposure of proprietary AI model outputs and capabilities.
Recommended Actions
Key Takeaways & Next Steps
- • Implement Zero Trust Segmentation to restrict unauthorized access and lateral movement within AI systems.
- • Enhance Threat Detection & Anomaly Response mechanisms to identify and mitigate large-scale fraudulent activities.
- • Enforce Egress Security & Policy Enforcement to monitor and control data exfiltration attempts.
- • Strengthen Multicloud Visibility & Control to detect and respond to unauthorized interactions across platforms.
- • Apply Inline IPS (Suricata) to inspect and prevent malicious traffic patterns targeting AI models.



