Navigated to Humanity Centered Data | UN Refugee & IDP Tracker
    All AI articles
    Ethics

    Generative AI in Humanitarian Reporting: Risks, Workflows, and Editorial Standards

    By the Humanity Centered Data Editorial Team
    June 19, 202610 min read

    Why this is suddenly an editorial question

    By mid-2026, generative AI is in the workflow of nearly every major newsroom covering humanitarian crises. The Reuters Institute Digital News Report 2025 found that more than half of surveyed publishers were using generative tools in production. In humanitarian reporting the stakes are higher than in most beats: a wrong casualty figure travels into briefings, a hallucinated quote can be attributed to a vulnerable source, and machine-translated testimony can subtly distort meaning. Editorial standards have not kept pace, and several international outlets have publicly retracted AI-assisted humanitarian stories since 2023.

    What the workflow looks like when it works

    The defensible pattern shared across The Associated Press, The Guardian, and the BBC is convergent: AI is used for bounded tasks — first-pass summarisation of a primary source the reporter has already read, translation of a document the reporter will have a human translator verify, structuring of a dataset the reporter will spot-check, drafting of explanatory boilerplate that the reporter will rewrite. AI is not used for primary fact generation, for sourcing quotes, or for any task where a hallucination cannot be detected by a human reader of average attention.

    The hallucination failure modes that keep recurring

    Three patterns dominate post-mortems of AI-assisted humanitarian errors: plausible casualty figures with no source (the model interpolates a number consistent with the prose around it); synthesised quotes attributed to named officials, especially in translation tasks; and fabricated incident dates that match the narrative arc the model has been asked to produce. All three are detectable only by checking against the primary source. The model will not flag them; it is confident by design.

    Translation is the highest-risk surface

    Machine translation of testimony from Arabic, Tigrinya, Pashto, Dari, Ukrainian, and Burmese is where most humanitarian newsrooms see the biggest productivity gain and the biggest risk. LLM-based translation outperforms older statistical systems on fluency but introduces subtle meaning shifts that a non-speaker editor cannot catch. The defensible standard is: never publish translated direct quotes from a primary source without a human speaker verifying the translation; treat machine-translated background documents as leads, not as evidence.

    A red-line checklist newsrooms can adopt today

    • No AI-generated quotes, full stop.
    • No AI-generated casualty, displacement, or financial figures without a cited primary source the editor has read.
    • No AI-translated direct quotes without human verification by a speaker of the source language.
    • No AI-generated imagery of identifiable people in humanitarian contexts.
    • Every AI-assisted piece carries an internal log of which tool was used for which task; the log is auditable.
    • Public-facing disclosure where the AI contribution is material to the published output, consistent with the Partnership on AI responsible practices.

    The harder editorial debates that are still unresolved

    Three debates are unsettled. Synthetic voiceover of refugee testimony for radio and podcast accessibility — defensible if the speaker consented in writing to the specific synthesis, otherwise not. AI-assisted source aggregation in OSINT investigations, where the model proposes correlations the reporter then verifies; mostly defensible if the verification chain is documented. AI-generated images of past atrocities as illustration; consensus across major outlets in 2026 is do not do this.

    Further reading and primary sources

    • Reuters Institute Digital News Report: https://reutersinstitute.politics.ox.ac.uk/digital-news-report
    • BBC Editorial Guidelines: https://www.bbc.co.uk/editorialguidelines/
    • AP standards on AI: https://www.ap.org/
    • The Guardian on AI: https://www.theguardian.com/info/series/guardian-and-ai
    • Partnership on AI: https://partnershiponai.org/
    • IFRC Code of Conduct: https://www.ifrc.org/document/code-conduct
    Models

    Can Large Language Models Understand Humanitarian Data? We Tested It

    A structured stress test of leading LLMs against UNHCR, OCHA, IDMC, and ACLED data. Where they genuinely help, and where they confidently mislead.

    12 min read
    Ethics

    The Risks of AI in Humanitarian Work: Bias, Privacy, and Accountability (2026)

    AI tools are now woven through humanitarian operations. The benefits are real and so are the risks. A frank look at the bias, privacy, and accountability gaps shaping the sector in 2026.

    10 min read
    Ethics

    Who Is Responsible When AI Gets It Wrong in a Refugee Crisis? (2026)

    When a model misclassifies a protection case or a biometric system locks a refugee out of food assistance, accountability is rarely clear. A close look at how responsibility is distributed in 2026 and where the gaps sit.

    8 min read
    Ethics

    Can AI Be Neutral? The Problem of Bias in Humanitarian Data (2026)

    AI systems learn from data that reflects who was easy to count and easy to reach. In humanitarian work, those gaps map directly onto vulnerability. A practical look at why AI cannot be neutral and what to do about it in 2026.

    7 min read
    Ethics

    Data Privacy for Displaced People: What AI Systems Are Collecting in 2026

    Biometrics, location, family ties, medical records, and protection flags all flow through AI systems used by humanitarian agencies. A clear look at what is being collected, who can see it, and what the risks are for displaced people in 2026.

    9 min read
    Ethics

    AI and the Do No Harm Principle: Can They Coexist? (2026)

    Do No Harm is the foundational principle of humanitarian action. AI systems introduce new categories of risk that the original framing did not anticipate. A practical assessment of whether the two can be reconciled in 2026.

    6 min read
    Advertisement
    Advertisement