How did the Chinese labs execute the distillation attack on Anthropic's Claude model?

The labs used 24,000 fraudulent accounts to send approximately 16 million requests to the Claude model, systematically extracting its capabilities to enhance their own AI models.

What are the potential risks of unauthorized AI model distillation?

Unauthorized distillation can lead to intellectual property theft, compromise of proprietary technologies, and pose national security risks by enabling malicious actors to develop advanced AI capabilities for offensive operations.

Anthropic's Claude Model Targeted in Large-Scale AI Distillation Attack by Chinese Labs

In 2026, Anthropic revealed that Chinese AI labs conducted extensive distillation attacks to extract capabilities from its Claude model, raising significant security concerns.

Published: February 24, 2026

Share this on:

Executive Summary

In February 2026, Anthropic, a U.S.-based AI startup, reported that three Chinese AI laboratories—DeepSeek, Moonshot, and MiniMax—conducted large-scale 'distillation' attacks to extract capabilities from Anthropic's Claude model. These labs utilized 24,000 fraudulent accounts to send approximately 16 million requests to Claude, aiming to enhance their own AI models. This unauthorized extraction of intellectual property not only violated Anthropic's terms of service but also posed significant national security risks by potentially enabling offensive cyber operations and mass surveillance. (cyberscoop.com)

This incident underscores the growing threat of AI model distillation as a method for intellectual property theft. The scale and sophistication of these attacks highlight the urgent need for robust security measures and regulatory frameworks to protect proprietary AI technologies from unauthorized exploitation.

Why This Matters Now

The Anthropic incident highlights the escalating risk of AI model distillation attacks, emphasizing the need for immediate action to safeguard intellectual property and national security in the rapidly evolving AI landscape.

Attack Path Analysis

Chinese AI labs DeepSeek, Moonshot, and MiniMax initiated large-scale distillation attacks by creating fraudulent accounts to interact with Anthropic's Claude chatbot. Through these interactions, they extracted advanced capabilities, potentially bypassing safety measures. The labs then integrated these capabilities into their own AI models, enhancing their functionalities. This unauthorized data extraction could lead to the development of AI systems lacking essential safeguards, posing national security risks. The impact includes potential misuse of AI for offensive cyber operations, disinformation campaigns, and mass surveillance.

Kill Chain Progression

Initial Compromise

High

Privilege Escalation

High

Lateral Movement

High

Command & Control

Medium

Exfiltration

High

Impact

High

Initial Compromise

Description

Chinese AI labs created approximately 24,000 fraudulent accounts to interact with Anthropic's Claude chatbot, initiating unauthorized access.

Confidence:

High

MITRE ATT&CK® Techniques

Resource Development

T1588.007

Obtain Capabilities: Artificial Intelligence

Initial Access

T1078

Valid Accounts

Command and Control

T1071

Application Layer Protocol

Exfiltration

T1020

Automated Exfiltration

Defense Evasion

T1080

Taint Shared Content

Credential Access

T1110

Brute Force

Potential Compliance Exposure

Mapping incident impact across multiple compliance frameworks.

PCI DSS 4.0 – Ensure that security policies and operational procedures for managing system and software vulnerabilities are documented, in use, and known to all affected parties.

Control ID: 6.4.3

The incident highlights a failure to prevent unauthorized access and data exfiltration, indicating inadequate vulnerability management and operational procedures.

NYDFS 23 NYCRR 500 – Cybersecurity Policy

Control ID: 500.03

The breach suggests deficiencies in the organization's cybersecurity policy, particularly in protecting sensitive data and preventing unauthorized access.

DORA – ICT Risk Management Framework

Control ID: Article 5

The incident indicates a lack of effective ICT risk management controls to detect and prevent unauthorized data access and exfiltration.

CISA ZTMM 2.0 – Implement robust identity and access management controls.

Control ID: Identity and Access Management

The use of fraudulent accounts to access systems points to weaknesses in identity and access management controls.

NIS2 Directive – Cybersecurity Risk Management Measures

Control ID: Article 21

The breach reflects insufficient cybersecurity risk management measures to protect network and information systems from unauthorized access and data theft.

Sector Implications

Industry-specific impact of the vulnerabilities, including operational, regulatory, and cloud security risks.

Computer Software/Engineering

AI model distillation attacks directly threaten intellectual property and proprietary algorithms, requiring enhanced egress security and threat detection capabilities to prevent unauthorized capability extraction.

Defense/Space

Nation-state AI capability theft poses critical national security risks as stolen models could enable offensive cyber operations, surveillance systems, and military intelligence applications without safeguards.

Government Administration

Fraudulent account creation and proxy service abuse highlight need for zero trust segmentation and multicloud visibility to prevent unauthorized access to sensitive government AI systems.

Computer/Network Security

Industrial-scale distillation campaigns demonstrate advanced persistent threats requiring enhanced anomaly detection, encrypted traffic analysis, and runtime security fabric deployment for AI model protection.

Sources

Anthropic accuses Chinese labs of trying to illicitly take Claude’s capabilitieshttps://cyberscoop.com/anthropic-accuses-chinese-labs-ai-distillation-cyber-risk/
Verified

Anthropic accuses Chinese AI labs of mining Claude as US debates AI chip exportshttps://techcrunch.com/2026/02/23/anthropic-accuses-chinese-ai-labs-of-mining-claude-as-us-debates-ai-chip-exports/

Verified

US AI giant accuses Chinese rivals of mass data thefthttps://www.theguardian.com/technology/2026/feb/23/us-ai-anthropic-china

Verified

Anthropic accuses Chinese AI labs of stealing data from Claudehttps://www.investing.com/news/company-news/anthropic-accuses-chinese-ai-labs-of-stealing-data-from-claude-4519588

Verified

Frequently Asked Questions

AI model distillation is a process where a smaller, less complex model (student) is trained to replicate the behavior of a larger, more complex model (teacher) by learning from its outputs.

Cloud Native Security Fabric Mitigations and ControlsCNSF

Aviatrix Zero Trust CNSF is pertinent to this incident as it could have constrained the unauthorized data extraction by implementing strict segmentation and identity-aware routing, thereby reducing the attacker's ability to exploit and exfiltrate sensitive AI capabilities.

Initial Compromise

Control: Cloud Native Security Fabric (CNSF)

Mitigation: The creation and utilization of fraudulent accounts may have been limited, reducing unauthorized access attempts.

Privilege Escalation

Control: Zero Trust Segmentation

Mitigation: Elevated access to advanced capabilities could have been constrained, reducing unauthorized privilege escalation.

Lateral Movement

Control: East-West Traffic Security

Mitigation: Systematic extraction of functionalities may have been restricted, reducing unauthorized lateral interactions.

Command & Control

Control: Multicloud Visibility & Control

Mitigation: Centralized coordination of data extraction could have been hindered, reducing the effectiveness of command structures.

Exfiltration

Control: Egress Security & Policy Enforcement

Mitigation: Data transfer to external systems may have been blocked, reducing unauthorized data exfiltration.

Impact (Mitigations)

The development of AI models lacking safeguards could have been limited, reducing potential misuse in cyber operations.

Impact at a Glance

Affected Business Functions

AI Model Development
Intellectual Property Management
Cybersecurity Operations

Operational Disruption

Estimated downtime: N/A

Financial Impact

Estimated loss: N/A

Data Exposure

Potential exposure of proprietary AI model outputs and capabilities.

Recommended Actions

• Implement Zero Trust Segmentation to restrict unauthorized access and lateral movement within AI systems.
• Enhance Threat Detection & Anomaly Response mechanisms to identify and mitigate large-scale fraudulent activities.
• Enforce Egress Security & Policy Enforcement to monitor and control data exfiltration attempts.
• Strengthen Multicloud Visibility & Control to detect and respond to unauthorized interactions across platforms.
• Apply Inline IPS (Suricata) to inspect and prevent malicious traffic patterns targeting AI models.

Secure the Paths Between Cloud Workloads

A cloud-native security fabric that enforces Zero Trust across workload communication—reducing attack paths, compliance risk, and operational complexity.

Stop Advanced Threats Get a Free Workload Attack Path Assessment Under Active Attack?

Anthropic's Claude Model Targeted in Large-Scale AI Distillation Attack by Chinese Labs

Executive Summary

Why This Matters Now

Attack Path Analysis

Kill Chain Progression

Initial Compromise

Description

MITRE ATT&CK® Techniques

Obtain Capabilities: Artificial Intelligence

Valid Accounts

Application Layer Protocol

Automated Exfiltration

Taint Shared Content

Brute Force

Potential Compliance Exposure

PCI DSS 4.0 – Ensure that security policies and operational procedures for managing system and software vulnerabilities are documented, in use, and known to all affected parties.

NYDFS 23 NYCRR 500 – Cybersecurity Policy

DORA – ICT Risk Management Framework

CISA ZTMM 2.0 – Implement robust identity and access management controls.

NIS2 Directive – Cybersecurity Risk Management Measures

Sector Implications

Computer Software/Engineering

Defense/Space

Government Administration

Computer/Network Security

Sources

Frequently Asked Questions

Cloud Native Security Fabric Mitigations and ControlsCNSF

Impact at a Glance

Affected Business Functions

Recommended Actions

Key Takeaways & Next Steps

Secure the Paths Between Cloud Workloads