Hallucinated Citations 12x in Biomedical Papers

Columbia University-led researchers audited 2.47 million biomedical papers and 97.1 million references, finding 4,046 fabricated citations across 2,810 papers.
Fabricated-citation rate held at ~4 per 10,000 papers through 2023 but climbed to 51.3/10K by end-2025 and 56.9/10K in early 2026 — over 12x the baseline.
The study, led by Maxim Topaz and published in The Lancet, attributes the spike to widespread LLM use (with caveats for paper mills and other causes).
Researchers call for automated reference checks before publication; arXiv has already introduced one-year author bans.

What Happened

The rate of fabricated references in biomedical papers has increased more than twelvefold since 2023, according to a Columbia-led study published in The Lancet. The audit covered 2.47 million biomedical papers and 97.1 million references in PubMed Central, January 2023 through February 2026. The team was led by Maxim Topaz.

Why It Matters

The fabricated citations are particularly risky in review articles that shape clinical guidelines. Practicing physicians and guideline committees rely on the cited literature to update standards of care; when the underlying citations don’t exist, the chain breaks. Beyond the immediate clinical risk, the finding is among the cleanest empirical demonstrations of how generative AI is degrading scholarly publishing as a knowledge infrastructure.

The result lands alongside arXiv’s recently introduced one-year author ban for submissions with unchecked LLM-generated content — described by computer-science section chair Thomas Dietterich as a “one-strike” policy.

Technical Details

Out of 97.1 million references checked, 4,046 were flagged as fabricated, spread across 2,810 papers. A reference counted as fabricated if its listed title could not be found in any of four major databases: PubMed, Crossref, OpenAlex, and Google Scholar. Throughout 2023, the rate held steady at about 4 fabricated references per 10,000 papers. Starting in mid-2024, the rate climbed sharply — 51.3 per 10,000 by end of 2025 and 56.9 per 10,000 in the first seven weeks of 2026.

The authors suspect an obvious link to language models like ChatGPT, which took off in late 2022. Since biomedical papers typically take 100-200 days from submission to publication, AI-generated text would not show up in PubMed Central at scale until mid-2024 — exactly when the rate began climbing.

Who’s Affected

Clinicians using literature reviews to inform care decisions face a polluted information base. Editors at biomedical journals must implement detection workflows. Authors using LLM assistance face higher responsibility for verification. Publishers — Elsevier, Springer Nature, Wiley, BMJ — face procedural-process redesign. Patient-advocacy groups, regulators (FDA, EMA, MHRA), and clinical-guideline-issuing bodies face the deepest downstream risk if fabricated citations have shaped issued guidelines.

What’s Next

The Columbia team calls for automated reference checks before publication and retroactive screening of already-published papers. Major publishers and preprint servers will need to adopt these workflows. Industry tools for automated reference verification exist (Scite, Inciteful, Crossref Similarity Check) but are not universally deployed. Expect policy responses from major medical journals through the second half of 2026.

AI-Hallucinated Citations 12x More Common in Biomedical Papers

What Happened

Why It Matters

Technical Details

Who’s Affected

What’s Next

Related Reading

Enjoyed this story?

AI-Hallucinated Citations 12x More Common in Biomedical Papers

What Happened

Why It Matters

Technical Details

Who’s Affected

What’s Next

Related Reading

Enjoyed this story?

Alberta Government Scanned 466M Lines of Code With Claude in 20 Hours

OpenAI’s Head of Safety Departs Amid a Broader Safety-Team Exodus

Researcher Used Claude to Uncover a Ticketing Flaw at Most US Festivals