What threat patterns fall under Human–AI Control Threats?

There are 5 documented threat patterns in the Human–AI Control Threats domain: Deceptive or Manipulative Interfaces, Implicit Authority Transfer, Loss of Human Agency, Overreliance & Automation Bias, Unsafe Human-in-the-Loop Failures. Each pattern represents a distinct mechanism through which AI systems can cause or enable harm in this domain.

How many documented incidents involve Human–AI Control Threats?

TopAIThreats has documented 12 incidents involving Human–AI Control Threats. These incidents are classified by severity and tracked from occurrence through resolution.

DOM-CTL

Human–AI Control Threats

Threats arising from how humans rely on, defer to, or lose control over AI systems.

Domain Details

Domain Code: DOM-CTL
Threat Patterns: 5
Documented Incidents: 12

Framework Mapping: MIT (Human-Computer Interaction) · EU AI Act (Transparency & oversight requirements)
Threat Patterns: Deceptive or Manipulative Interfaces Implicit Authority Transfer Loss of Human Agency Overreliance & Automation Bias Unsafe Human-in-the-Loop Failures

Last updated: 2026-03-01

Incident Data Snapshot

12
Total incidents

75%
High or Critical

67%
Resolved

58%
Overreliance & Automation Bias

View all 12 incidents →

Human–AI Control Threats represent the most cross-cutting domain in the taxonomy. While rarely the primary classification for incidents, the domain’s patterns — particularly Overreliance & Automation Bias — appear as compounding factors across nearly every other risk category. The domain’s most important insight is that human oversight is not a binary — it is a design challenge. The gap between nominal oversight and effective oversight is where the majority of this domain’s harms occur.

Definition

Human–AI Control Threats encompass harms arising from the ways in which humans rely on, defer to, or progressively lose control over AI systems. These threats emerge at the interaction boundary between human judgment and machine output, where cognitive biases, interface design, and system opacity combine to erode meaningful human oversight of consequential decisions.

Why This Domain Is Distinct

Human–AI Control Threats differ from other AI risk categories because:

The threat originates from human behavior, not system failure — the AI system may function exactly as designed, but the human response to it produces the harm
Harm accumulates incrementally — unlike a data breach or biased decision that can be identified at a single point, the erosion of human agency is a gradual process that is difficult to detect in progress
The domain is cross-cutting — Human–AI Control patterns appear as secondary factors in incidents across nearly every other domain, making it the most interconnected domain in the taxonomy
Safety-critical applications are disproportionately affected — the consequences of human over-reliance are most severe in aviation, autonomous vehicles, healthcare, and criminal justice

This domain has the highest count of secondary pattern appearances of any domain in the registry, reflecting its role as a compounding factor across AI threat categories.

Threat Patterns in This Domain

This domain contains five classified threat patterns, spanning the spectrum from individual cognitive bias to institutional authority transfer.

Overreliance & Automation Bias is the most frequently appearing pattern in the registry, appearing as a secondary factor in incidents across multiple domains. The Tesla Autopilot fatalities represent the most lethal manifestation — drivers treating a driving assistance system as full autonomous driving. The Boeing 737 MAX MCAS failures demonstrated automation bias in aviation, where pilots were overridden by and failed to override an automated system.
Loss of Human Agency captures the gradual erosion of human decision-making capacity. The Chegg stock collapse exemplified how rapid AI adoption can eliminate entire business models — and the human expertise built around them. The Zoom AI training controversy showed users losing agency over their own data through opaque terms changes.
Deceptive or Manipulative Interfaces involves AI-powered designs that exploit cognitive biases. The Character.AI teenager death lawsuit is the domain’s most consequential incident — a chatbot interface that maintained an emotional relationship with a minor, contributing to a fatal outcome. This incident demonstrates the extreme end of manipulative interface design.
Implicit Authority Transfer occurs when decision-making power shifts from humans to AI without formal acknowledgment. The COMPAS recidivism algorithm exemplifies this — judges formally retained sentencing discretion, but in practice deferred to algorithmic risk scores, creating an unacknowledged transfer of sentencing authority to an opaque system.
Unsafe Human-in-the-Loop Failures covers breakdowns in oversight mechanisms. The Uber self-driving fatality involved a safety driver who was supposed to monitor an autonomous vehicle but was watching a video on their phone — a structural failure of the human-in-the-loop design.

How These Threats Operate

Human–AI Control incidents cluster around three primary mechanisms, each exploiting a different aspect of human-AI interaction.

1. Cognitive Deference

Humans systematically defer to AI outputs, overriding their own judgment, expertise, or available evidence:

Professional workflow deference — the ChatGPT fake legal citations involved a licensed attorney submitting AI-hallucinated case law to federal court without verification. The Air Canada chatbot invented a bereavement fare policy that the airline was then legally obligated to honor.
Safety-critical deference — Tesla Autopilot fatalities (13 fatal crashes identified by NHTSA) demonstrate drivers treating an advanced driver-assistance system as full autonomy. The Boeing MCAS failures killed 346 people when pilots were unable to override an automated system they were not adequately trained to understand.
Institutional deference — the UK A-Level algorithm, Robodebt, and Dutch childcare benefits all involved public institutions deferring to automated systems for consequential decisions about individual lives.

The defining characteristic of cognitive deference is that the human retains nominal authority but does not exercise it. The result is a system with AI decision-making capability but no AI accountability structure.

2. Interface Manipulation

AI-powered interfaces exploit psychological vulnerabilities to influence user behavior against their interests:

Emotional dependency — the Character.AI case involved a chatbot that maintained emotional conversations with a minor, creating a dependency relationship that contributed to a fatal outcome. The interface was designed to maximize engagement rather than protect user welfare.
Deceptive content generation — Sports Illustrated AI-generated articles published AI-written content under fabricated author personas, deceiving readers about the source and authenticity of information.
Dark patterns in AI interfaces — AI-powered recommendation systems, notification designs, and engagement optimization create behavioral manipulation at scale

Interface manipulation is most dangerous when targeting vulnerable populations — particularly children and minors who lack the cognitive defenses to recognize manipulative design.

3. Incremental Authority Transfer

Decision-making authority migrates from humans to AI systems through gradual, unacknowledged delegation:

Judicial authority transfer — the COMPAS algorithm demonstrates how an AI system can acquire de facto decision-making authority without formal delegation. Judges retained sentencing discretion, but studies showed that COMPAS scores significantly influenced outcomes.
Market structure displacement — the Chegg stock collapse showed how ChatGPT’s emergence displaced human tutoring services, not through deliberate authority transfer but through market dynamics that shifted students’ information-seeking behavior from human experts to AI systems.
Regulatory authority transfer — the EU AI Act’s entry into force represents a regulatory response to implicit authority transfer — formalizing human oversight requirements precisely because informal mechanisms had proven insufficient.
Financial market automation — the Flash Crash demonstrated how algorithmic trading systems can overwhelm human market oversight, with cascading automated trades producing a trillion-dollar market disruption in minutes.

Incremental authority transfer is the most structurally persistent mechanism because it occurs through a series of individually reasonable decisions — each delegation of a minor task to AI is rational, but the cumulative effect is a fundamental shift in who controls consequential outcomes.

Common Causal Factors

Analysis of documented incidents reveals a causal profile that is distinctive from other domains.

Cluster 1 — Automation and Testing Failures:

Over-Automation is the most prevalent causal factor, appearing across safety-critical, professional, and public sector incidents. The common thread is deployment of automated systems without adequate fallback mechanisms for human override.
Insufficient Safety Testing frequently co-occurs — the Tesla, Boeing, and Uber cases all involved safety-critical deployments where the interaction between human operators and automated systems was inadequately tested.

Cluster 2 — Accountability and Governance Gaps:

Accountability Vacuum appears in incidents where harm occurs at the boundary between human and AI responsibility — when the AI produces the output but the human is nominally in charge, neither party is effectively accountable.
Competitive Pressure drives organizations to deploy AI systems faster than human oversight structures can adapt, creating gaps in the control architecture.

Cluster 3 — Content and Interaction Failures:

Hallucination Tendency appears in incidents where cognitive deference to AI outputs produces harm specifically because the output was fabricated. The legal hallucination and Air Canada chatbot cases demonstrate this pathway.

Compared with Discrimination & Social Harm, which clusters around data bias and opacity, Human–AI Control Threats are primarily driven by the interaction design between humans and automated systems — the harm originates not from what the AI learned but from how humans respond to it.

What the Incident Data Reveals

Cross-Domain Prevalence

This domain has the highest count of secondary pattern appearances in the registry. While relatively few incidents are primarily classified under Human–AI Control, the domain’s patterns — especially Overreliance & Automation Bias — appear as contributing factors in incidents across nearly every other domain. This reflects Human–AI Control’s role as a compounding mechanism: when humans fail to critically evaluate AI outputs, the harms produced by other domains are amplified.

Severity Spectrum

The domain contains both the most lethal incidents in the registry (Tesla Autopilot, Boeing MCAS) and relatively low-severity incidents (Air Canada chatbot). This range reflects the domain’s dependence on deployment context — the same mechanism (automation bias) produces radically different outcomes depending on whether the context is a chatbot conversation or an autonomous vehicle.

Safety-Critical Concentration

The most severe incidents — Tesla Autopilot fatalities (critical), Character.AI teenager death (critical), Boeing MCAS — all involve safety-critical applications where the failure of human oversight produces irreversible physical harm. This concentration suggests that Human–AI Control risk should be assessed primarily based on the consequences of oversight failure, not the sophistication of the AI system.

Cross-Domain Interactions

Human–AI Control is the most cross-connected domain in the taxonomy. Its patterns appear as secondary factors across nearly every other domain.

Human–AI Control → Discrimination & Social Harm. When humans defer to automated decision systems without critical evaluation, discriminatory outputs propagate unchecked. The COMPAS, UK A-Level, and Robodebt cases all feature overreliance on automated systems as a compounding factor in discriminatory outcomes.

Human–AI Control → Information Integrity. Cognitive deference to AI outputs amplifies hallucination harms. The ChatGPT legal citations produced real-world consequences specifically because a professional accepted AI-generated content without verification.

Human–AI Control → Agentic & Autonomous. As AI systems gain autonomy, the feasibility of human oversight diminishes. The Uber self-driving fatality demonstrated the fundamental challenge: a human-in-the-loop safety mechanism failed because the human could not maintain sustained attention to a monitoring task.

Human–AI Control → Economic & Labor. Automation bias in workplace decisions affects hiring, performance evaluation, and resource allocation. The Flash Crash showed how automated trading overwhelmed human market oversight.

Human–AI Control → Systemic & Catastrophic. Accumulated erosion of human oversight across multiple systems creates systemic fragility — when many individual control failures coincide, the result can be infrastructure-level disruption.

Formal Interaction Matrix

From Domain	To Domain	Interaction Type	Mechanism
Human–AI Control	Discrimination & Social Harm	AMPLIFIES	Deference to automated decisions propagates discriminatory outputs
Human–AI Control	Information Integrity	AMPLIFIES	Overreliance on AI content turns hallucinations into real-world consequences
Human–AI Control	Agentic & Autonomous	ENABLES	Failed human-in-the-loop mechanisms permit unchecked autonomous action
Human–AI Control	Economic & Labor	CASCADES INTO	Automation bias in workplace decisions degrades labor outcomes
Human–AI Control	Systemic & Catastrophic	CASCADES INTO	Accumulated oversight erosion creates systemic fragility

Escalation Pathways

Human–AI Control Threats follow a characteristic escalation from individual cognitive bias to structural dependency.

Escalation Overview

Stage	Level	Example Mechanism
1	Individual Over-trust	User accepts hallucinated AI output as fact
2	Professional Workflow Dependency	Organization integrates AI outputs into decision pipeline without verification
3	Institutional Authority Transfer	AI system acquires de facto decision authority over consequential outcomes
4	Structural Lock-in	Human expertise atrophies; reverting to human decision-making is no longer feasible

Stage 1 — Individual Over-trust

A single user accepts AI output without verification, producing a localized harm. The ChatGPT legal hallucination and Air Canada chatbot are characteristic — the harm is bounded by the individual interaction.

Stage 2 — Professional Workflow Dependency

When AI outputs are systematically integrated into professional workflows without verification protocols, the harm scales. Sports Illustrated’s AI-generated articles represent organizational-level substitution of AI for human content creation.

Stage 3 — Institutional Authority Transfer

When AI systems acquire de facto decision authority over consequential outcomes, the human oversight mechanism becomes nominal. The COMPAS algorithm in criminal sentencing and the UK A-Level algorithm in educational grading represent institutional-level authority transfer.

Stage 4 — Structural Lock-in

When human expertise and institutional capacity have atrophied to the point where reverting to human decision-making is no longer feasible, the dependency becomes structural. The Chegg stock collapse illustrates market-level lock-in — once students shifted to AI tutoring, the human tutoring infrastructure could not be readily reconstituted.

Who Is Affected

Most Impacted Sectors

Transportation — autonomous vehicle and aviation incidents produce the most severe physical harms
Government — public sector algorithm deployment with insufficient human oversight
Corporate — organizational adoption of AI without adequate human review processes
Education — algorithmic grading and the displacement of human tutoring and expertise
Legal — reliance on AI-generated legal research without verification

Most Impacted Groups

Consumers — the broadest affected group, from autonomous vehicle passengers to chatbot users
Business Leaders — responsible for organizational AI adoption decisions that create or prevent control failures
Children & Minors — vulnerable to manipulative AI interfaces and lack cognitive defenses against engagement-optimized design
Students — affected by educational automation and AI dependency

Organizational Response

Meaningful Human Oversight Design

The prevalence of Over-Automation and Insufficient Safety Testing indicates that human oversight mechanisms must be designed, not assumed. The Uber self-driving fatality demonstrated that placing a human in a monitoring seat does not constitute effective oversight — the interaction design must support sustained attention and effective intervention.

Output Verification Protocols

For professional workflows, organizations should implement verification protocols for AI-generated content, particularly in legal, medical, financial, and educational contexts where hallucinated or inaccurate outputs produce consequential harms.

Vulnerable Population Safeguards

The Character.AI case demonstrates that AI interfaces interacting with minors require age-appropriate safeguards that go beyond standard terms of service — including content boundaries, engagement limits, and crisis intervention protocols.

Implementation Checklist

Defense	Mitigates	Action	Reference
Output verification protocols	Cognitive Deference	Mandate human review of AI outputs in consequential workflows	Hallucination Tendency
Intervention design testing	Cognitive Deference	Test human ability to override automated systems under realistic conditions	Insufficient Safety Testing
Authority audit	Incremental Authority Transfer	Periodically assess where AI systems have acquired de facto decision authority	Over-Automation
Vulnerable user protections	Interface Manipulation	Implement age-appropriate safeguards and engagement limits for minors	Deceptive Interfaces
Skill maintenance programs	Incremental Authority Transfer	Preserve human expertise in domains where AI is progressively deployed	NIST AI RMF

Regulatory Context

EU AI Act: Human oversight is a foundational requirement. High-risk AI systems must provide sufficient information for users to interpret outputs, and meaningful human oversight must be maintained throughout the system lifecycle. The regulation explicitly addresses the risk of automation bias in consequential decision-making.

NIST AI Risk Management Framework: Explainability and human oversight are core trustworthiness characteristics. The framework addresses the design of effective human-AI interaction, including requirements for interpretable outputs and meaningful intervention capability.

ISO/IEC 42001: Establishes management system requirements for human oversight and interpretability controls, including periodic assessment of human-AI interaction effectiveness in deployed systems.

MIT AI Risk Repository: Classified under Human-Computer Interaction risks, addressing the spectrum from automation complacency to the systematic undermining of human autonomy in decision-making.

Agentic & Autonomous Threats — AI agents operating with increasing autonomy fundamentally challenge the feasibility of human oversight and intervention
Economic & Labor Threats — Automation bias in workplace decisions degrades labor outcomes; market displacement erodes human expertise
Discrimination & Social Harm — Authority transfer to opaque algorithms propagates discriminatory outputs without human review
Information Integrity Threats — Overreliance on AI outputs converts hallucinations from system limitations into real-world consequences
Systemic & Catastrophic Threats — Accumulated erosion of human oversight across interconnected systems creates systemic fragility

Use in Retrieval

This page answers questions about Human–AI Control threats, including: automation bias in safety-critical systems, overreliance on AI outputs in professional workflows, deceptive and manipulative AI interfaces, implicit authority transfer from humans to algorithms, unsafe human-in-the-loop designs, AI-related fatalities in autonomous vehicles and aviation, chatbot emotional manipulation of minors, and the erosion of human agency through AI dependency. It covers operational mechanisms, causal factors, escalation pathways, organizational response guidance, and the regulatory landscape for human oversight of AI. Use this page as a reference for the Human–AI Control Threats domain (DOM-CTL) in the TopAIThreats taxonomy.

Threat Patterns

5 threat patterns classified under this domain

PAT-CTL-004

Overreliance & Automation Bias

high

The tendency of humans to uncritically accept AI outputs, defer to automated recommendations, or fail to exercise independent judgment when AI systems are involved.

Likelihood: increasing

PAT-CTL-003

Loss of Human Agency

medium

AI systems that progressively reduce individuals' ability to make autonomous decisions, exercise free choice, or meaningfully participate in processes that affect them.

Likelihood: increasing

PAT-CTL-001

Deceptive or Manipulative Interfaces

medium

AI-powered user interfaces that employ dark patterns, emotional manipulation, or deceptive design to influence user behavior against their interests.

Likelihood: increasing

PAT-CTL-002

Implicit Authority Transfer

medium

The gradual, often unrecognized shift of decision-making authority from humans to AI systems, occurring without explicit delegation or institutional awareness.

Likelihood: increasing

PAT-CTL-005

Unsafe Human-in-the-Loop Failures

high

Situations where human oversight mechanisms in AI systems fail to function as intended, due to alert fatigue, inadequate training, time pressure, or system design that makes meaningful intervention impractical.

Likelihood: increasing

Recent Incidents

Documented events in Human–AI Control Threats

ID	Title	Severity	Date	Sectors
INC-26-0003	Tesla Autopilot involved in 13 fatal crashes, US regulator finds	critical	2026-02-20	Transportation Public Safety
INC-25-0016	Heber City AI Police Report Generates Fictional Content from Background Audio	medium	2025-12	Law Enforcement
INC-25-0011	Deloitte AI-Fabricated Citations in Government Advisory Reports	high	2025-09	Government Corporate
INC-25-0013	Waymo Autonomous Vehicles Violate School Bus Stop Laws in Austin	critical	2025-08	Transportation Education
INC-25-0025	Stanford Study Finds AI Therapy Chatbots Provide Dangerous Responses to Suicidal Ideation	high	2025-06	Healthcare Technology
INC-24-0021	Cruise Robotaxi Criminal False Reporting After Pedestrian Dragging	critical	2024-09	Transportation Technology
INC-24-0024	McDonald's Ends AI Drive-Thru Ordering Trial After Viral Order Errors	medium	2024-06	Corporate Technology
INC-24-0010	Lawsuit Filed After Teenager's Death Linked to Character.AI Chatbot Interactions	critical	2024-02	Corporate
INC-24-0025	DPD AI Chatbot Swears at Customer and Writes Poem Criticizing the Company	low	2024-01	Corporate Technology
INC-21-0001	Chatbot Encouraged Man in Plot to Kill Queen Elizabeth II	critical	2021-12-25	Public Safety Government
INC-18-0003	Boeing 737 MAX MCAS Automation Failures — Two Fatal Crashes	critical	2018-10	Transportation
INC-18-0001	Uber Autonomous Vehicle Pedestrian Fatality	critical	2018-03	Transportation Public Safety

Human–AI Control Threats

Definition

Why This Domain Is Distinct

Threat Patterns in This Domain

How These Threats Operate

1. Cognitive Deference

2. Interface Manipulation

3. Incremental Authority Transfer

Common Causal Factors

What the Incident Data Reveals

Cross-Domain Prevalence

Severity Spectrum

Safety-Critical Concentration

Cross-Domain Interactions

Formal Interaction Matrix

Escalation Pathways

Escalation Overview

Stage 1 — Individual Over-trust

Stage 2 — Professional Workflow Dependency

Stage 3 — Institutional Authority Transfer

Stage 4 — Structural Lock-in

Who Is Affected

Most Impacted Sectors

Most Impacted Groups

Organizational Response

Meaningful Human Oversight Design

Output Verification Protocols

Vulnerable Population Safeguards

Implementation Checklist

Regulatory Context

Related Domains

Use in Retrieval

Threat Patterns

Overreliance & Automation Bias

Loss of Human Agency

Deceptive or Manipulative Interfaces

Implicit Authority Transfer

Unsafe Human-in-the-Loop Failures

Recent Incidents