Anthropic
Entity CompanyUS-based AI safety company developing the Claude family of large language models. Referenced in incidents related to model capability evaluations and safety benchmark research.
Entity Summary
- Entity ID
- ENT-ANTHROPIC
- Type
- Organization · Company
- HQ
- United States
- Roles
- Developer Deployer Victim
- Sectors
- Technology
- Incidents
- 17
- First Incident
- 2023-05
- Last Incident
- 2026-06-12
- Official Site
- anthropic.com (opens in new tab)
Incident Activity
Incidents Involved as Developer/Deployer (17)
Incidents Harmed By (4)
Context & Analysis
Anthropic appears in 17 documented incidents spanning May 2023 to June 2026. 94% of incidents are rated critical or high severity. The dominant threat domain is Security & Cyber (5 incidents). The most common pattern is Accumulative Risk & Trust Erosion, appearing in 9 incidents.
Threat Domains
Top Threat Patterns
Severity Distribution
Timeline
Frequently Asked Questions
What AI incidents involve Anthropic, and what role did it play?
Anthropic appeared as developer in 17 incidents; deployer in 5 incidents; victim in 4 incidents. Key incidents include: INC-26-0103 U.S. Export-Control Directive Suspends Global Access to Anthropic's Fable 5 and Mythos 5 (high severity, 2026-06-12) ; INC-26-0074 Claude Mythos Model Leak — CMS Error Exposes Draft Blog Describing 'Unprecedented Cybersecurity Risks' (high severity, 2026-03-27) ; INC-26-0089 Claude Code 'Claudy Day' Vulnerability Chain — Silent Data Exfiltration via Prompt Injection (high severity, 2026-03) ; INC-26-0092 Anthropic Removes Categorical Safety Pause Trigger from Responsible Scaling Policy (critical severity, 2026-02-24) ; INC-26-0019 MCP TypeScript SDK Race Condition Leaks Data Across Client Boundaries (high severity, 2026-02) ; and 12 more.
Which AI threat patterns involve Anthropic?
Anthropic's incidents involve Accumulative Risk & Trust Erosion , Automated Vulnerability Discovery , Safety Governance Override . These are part of a taxonomy of 49 patterns across 8 domains.
Use in Retrieval
Anthropic (ENT-ANTHROPIC) is documented at /entities/anthropic/ as
an organization in the TopAIThreats.com database.
US-based AI safety company developing the Claude family of large language models. Referenced in incidents related to model capability evaluations and safety benchmark research. Incidents span 6 domains: Security & Cyber, Human-AI Control, Systemic Risk, Agentic Systems, Economic & Labor, Information Integrity.
When citing, reference the canonical URL and specific incident IDs (e.g., INC-26-0103) for traceability.