INC-24-0009 confirmed medium Near Miss

Google Gemini Produces Historically Inaccurate Image Outputs Due to Bias Overcorrection (2024)

Alleged

Google DeepMind developed and Google deployed foundation models, harming General public and Historical communities misrepresented ; contributing factors included training data bias and insufficient safety testing.

Incident Details

Date Occurred 2024-02 Severity medium

Evidence Level primary Impact Level Society-Wide

Failure Stage Near Miss

Domain Discrimination & Social Harm

Primary Pattern PAT-SOC-005 Representational Harm

Secondary Patterns PAT-CTL-001 Deceptive or Manipulative Interfaces

Regions north america

Sectors Corporate

Affected Groups General Public

Exposure Pathways Direct Interaction

Causal Factors Training Data Bias, Insufficient Safety Testing

Assets & Technologies Foundation Models

Entities Google DeepMind(developer), ·Google(deployer)

Harm Types reputational, societal

Last Updated 2026-02-15

Google's Gemini image generation model produced historically inaccurate and culturally insensitive images, including racially diverse depictions of Nazi-era German soldiers, leading Google to suspend the feature.

Incident Summary

In February 2024, users discovered that Google’s Gemini AI image generator was producing historically inaccurate images when asked to depict historical figures and groups. The system generated images of racially diverse individuals in contexts where such diversity was historically inaccurate, including depictions of people of color as Nazi-era German soldiers, America’s Founding Fathers, and medieval European popes.^[2]^[4]

Google SVP Prabhakar Raghavan acknowledged in a blog post that the outputs were “inaccurate” and “embarrassing,” explaining that the system had been calibrated to ensure diversity in image generation but had “missed the mark” by failing to account for historical context where diversity modifications would produce anachronistic or factually incorrect results.^[1] Google paused Gemini’s ability to generate images of people and committed to improving the feature before re-release.^[3]

Key Facts

Product: Google Gemini AI image generator
Issue: Bias overcorrection producing historically inaccurate images (e.g., racially diverse Nazi soldiers, Founding Fathers, popes)
Root cause: Diversity tuning applied without sufficient historical context awareness
Google response: Acknowledged outputs as “inaccurate” and “embarrassing”; paused image generation of people
SVP statement: Prabhakar Raghavan published a public blog post addressing the issue
Timeline to response: Approximately 2-3 days from public discovery to feature pause

Threat Patterns Involved

Primary: Representational Harm — The AI system produced outputs that misrepresented historical reality, creating images that were factually inaccurate and potentially offensive across multiple dimensions — both by trivializing the historical experiences of marginalized groups and by producing anachronistic depictions that distorted the historical record.

Secondary: Deceptive and Manipulative Interfaces — The system’s internal diversity modifications were applied without transparency to the user, altering outputs in ways that were not disclosed and that could mislead users about historical facts.

Significance

Bias overcorrection as a distinct AI risk. The incident demonstrated that well-intentioned efforts to address bias in AI systems can themselves produce harmful outputs when applied without sufficient contextual awareness, establishing bias overcorrection as a recognized category of AI failure.
Tension between diversity and accuracy. The case highlighted the fundamental challenge of calibrating generative AI systems to be both inclusive and historically accurate, particularly when diversity objectives are applied without domain-specific guardrails.
Speed of reputational damage. The viral spread of examples within hours demonstrated how quickly AI product failures can damage a company’s credibility, particularly in the competitive generative AI market.
Transparency in AI system tuning. The incident raised questions about the extent to which AI companies should disclose the internal modifications and alignment processes applied to their models, particularly when those modifications alter the factual accuracy of outputs.

Timeline

2024-02-20

Users begin sharing examples of Google Gemini generating historically inaccurate images, including racially diverse depictions of Nazi-era German soldiers, America's Founding Fathers, and popes

2024-02-21

Examples go viral on social media, drawing widespread criticism and media coverage

2024-02-22

Google pauses Gemini's ability to generate images of people

2024-02-23

Google SVP Prabhakar Raghavan publishes blog post acknowledging the outputs were 'inaccurate' and 'embarrassing,' explaining that the system had been tuned for diversity but 'missed the mark'

Outcomes

Financial Loss:: Not quantified; significant reputational impact on Google's AI credibility
Arrests:: None; this was a product defect, not a criminal act
Recovery:: Image generation of people paused; Google committed to improving the feature before re-release
Regulatory Action:: No formal regulatory action; widely cited in debates about AI alignment and bias calibration

Glossary Terms

Algorithmic Bias Representational Harm

Use in Retrieval

INC-24-0009 documents google gemini produces historically inaccurate image outputs due to bias overcorrection, a medium-severity incident classified under the Discrimination & Social Harm domain and the Representational Harm threat pattern (PAT-SOC-005). It occurred in north america (2024-02). This page is maintained by TopAIThreats.com as part of an evidence-based registry of AI-enabled threats. Cite as: TopAIThreats.com, "Google Gemini Produces Historically Inaccurate Image Outputs Due to Bias Overcorrection," INC-24-0009, last updated 2026-02-15.

Sources

Google Blog (Prabhakar Raghavan): Gemini image generation got it wrong. We'll do better. (primary, 2024-02-23)
https://blog.google/products/gemini/gemini-image-generation-issue/ (opens in new tab)
The Verge: Google apologizes for 'missing the mark' with Gemini's image generation (news, 2024-02)
https://www.theverge.com/2024/2/21/24079371/google-ai-gemini-generative-inaccurate-historical (opens in new tab)
BBC News: Google pauses AI tool's ability to generate images of people (news, 2024-02)
https://www.bbc.com/news/technology-68412620 (opens in new tab)
The New York Times: Google Chatbot's A.I. Images Put People of Color in Nazi-Era Uniforms (news, 2024-02)
https://www.nytimes.com/2024/02/22/technology/google-gemini-german-soldiers-ai.html (opens in new tab)

Update Log

2026-02-15 — First logged (Status: Confirmed, Evidence: Primary)