Representational Harm

Definition

Representational harm is a category of AI-related harm identified by researchers Kate Crawford and others, describing situations where AI systems produce outputs that reinforce negative stereotypes, render certain groups invisible, or associate groups with demeaning characteristics. Unlike allocational harms, which involve the unfair distribution of resources or opportunities, representational harms operate at the level of meaning, identity, and social perception. Examples include image generation systems that default to stereotypical depictions, search algorithms that associate certain demographic groups with negative content, and language models that reproduce biased associations from their training corpora. Representational harms are consequential because they shape public understanding and normalise prejudiced framings at scale.

How It Relates to AI Threats

Representational harm is a distinct threat pattern within the Discrimination & Social Harm domain, closely connected to data imbalance bias and broader patterns of algorithmic discrimination. AI systems trained on data reflecting societal biases absorb and reproduce those biases in their outputs, often amplifying them through the scale and perceived authority of automated systems. When a language model or image generator consistently produces stereotypical outputs, it contributes to a feedback loop that reinforces the very biases present in the training data. Representational harms also compound allocational harms: stereotyped representations can influence human decision-makers who interact with AI outputs, indirectly affecting resource allocation decisions.

Why It Occurs

Training corpora reflect historical biases, stereotypes, and underrepresentation present in source materials
Data collection practices overrepresent dominant demographic groups and underrepresent marginalised communities
Model optimisation targets statistical patterns without distinguishing between accurate representation and bias
Evaluation frameworks historically prioritised accuracy metrics over fairness and representational balance
Feedback loops between biased AI outputs and user behaviour reinforce stereotypical associations over time

Real-World Context

Documented instances of representational harm include image search systems associating professional roles with specific demographics and language models reproducing gender and racial stereotypes, patterns evident in incidents such as INC-18-0002. Research by Buolamwini and Gebru demonstrated systematic underperformance of facial recognition systems on darker-skinned faces, a form of representational erasure. The NIST AI Risk Management Framework identifies representational harm as a distinct risk category. Industry responses include dataset auditing practices, representational fairness benchmarks, and content policy frameworks governing generative AI outputs.

Definition

How It Relates to AI Threats

Why It Occurs

Real-World Context

Related Incidents

Related Threat Patterns

Related Terms