CiteOwl

Hallucinated Citations Are Now Getting Papers Retracted

Hallucinated citations and research misconduct used to be a problem of student drafts, where a fabricated reference cost a few marks. In 2026 they reached the published record: journals are retracting and correcting papers over invented references, peer review is letting them through, and publishers are starting to treat AI fake citations that get retracted as misconduct rather than an honest slip. For a student or early-career researcher, that changes the stakes. Checking your sources is no longer a nice habit, it is the thing standing between you and a correction notice with your name on it.

We have written before about why AI makes up citations and how common it has become. This piece is about what happens next: the consequences once a fabricated reference is in a finished paper, and why that should change how you work long before you ever submit to a journal.

The line that used to protect the literature has moved

The published scientific record was supposed to be the safe layer. Editors, peer reviewers, and copy editors all stood between a draft and print, and the assumption was that fabricated references would not survive all of them. That assumption is failing. Invented citations are now turning up in papers that cleared review and were published, which means the screening layer most people trusted is no longer catching them reliably.

The shift is measurable. An analysis covered by Retraction Watch in May 2026 found fabricated references in roughly one in 277 PubMed-indexed papers published in early 2026, up from about one in 2,828 in 2023 and one in 458 in 2025. The same audit, which examined nearly 2.5 million papers, flagged 4,406 fabricated references across 2,810 of them. These are not chatbot transcripts. They are entries in the formal literature that other researchers will read, cite, and build on.

Fabricated references appeared in about one in 277 PubMed-indexed papers in early 2026, up from one in 2,828 in 2023. An audit of nearly 2.5 million papers flagged 4,406 fabricated references across 2,810 of them.

Nature and STAT reported the same pattern: a steep rise in fraudulent citations slipping past peer review, with generative AI the most likely cause. The trend line is the story. A reference style that produced a handful of fakes a few years ago is now seeding them into the record at scale, and the rate is climbing year on year.

Peer review is not catching them

It is tempting to assume expert reviewers would notice an invented source. In practice they often do not, and the reason is structural. A peer reviewer reads for the argument, the method, the statistics, and whether the conclusions follow. Nobody hands them a list of every DOI to resolve one by one. A well-formed citation, with a plausible author, a real-sounding journal, and a correctly formatted DOI, reads as routine and gets waved through with the rest of the reference list.

This has already reached the top of the field. An examination of one major AI conference found fabricated references in dozens of accepted papers, each having passed review by several expert reviewers. If the people closest to how these models work miss invented citations in their own venue, the lesson for a student is blunt: do not assume anyone downstream will catch a fake for you. The check has to happen before the reference is in the document, not after.

There is a quieter point here too. When fabricated references survive review, they enter the citation graph. A later author finds the fake entry, trusts that it cleared peer review, and cites it again. One hallucinated source can propagate into several real papers before anyone notices it never existed, which is exactly why publishers are now treating the problem as urgent rather than cosmetic.

From honest error to misconduct

The framing is hardening, and this is the part that matters most for your own work. A fabricated reference used to be filed under sloppiness: an honest mistake, a citation manager gone wrong, a draft that needed another pass. Increasingly it is being treated as a misconduct problem instead.

The logic is simple once you state it. A citation is a factual claim. It asserts that a specific paper exists, that named authors wrote it, and that it says something relevant to the sentence in front of it. A hallucinated citation is false on every count, and it puts that false claim into the permanent record under the author's name. Whether a model produced it or a person typed it changes nothing about what is on the page. Institutions and publishers tend to judge the work by what it contains, not by which tool helped write it, so "the AI generated that reference" carries about as much weight as "the citation manager generated it." The author signed the paper.

That is why the response from journals is not just a correction. A single mangled reference might get fixed. A pattern of fabricated ones, or a fake that materially props up a result, is the kind of thing that leads to retraction and, increasingly, to a misconduct inquiry. A retraction follows a paper permanently. It is searchable, it attaches to your name, and it does not come off when you explain that you trusted a tool.

What this means for a student or early-career researcher

You might think this is a problem for published authors and large labs, not for someone writing a coursework essay or a first thesis chapter. The opposite is true. You are the most exposed, for three reasons.

First, you are likely leaning on general-purpose AI more heavily than a senior researcher with a decade of read papers in their head. The references you cannot personally vouch for are precisely the ones a model is most likely to have invented, because a fabrication thrives where you have the least ability to check it.

Second, the people grading you are stretched. If expert peer reviewers miss fabricated citations, a marker working through forty essays in a weekend will too, right up until the one time they decide to check a reference that looked too convenient. The fake does not have to be caught every time to ruin a submission. Once is enough.

Third, the consequence lands on you alone. A failed assignment, a delayed thesis, an academic-integrity meeting, or a note in your record does not care that the tool was confident. As we cover in why AI makes up citations, the model has no way to know whether the paper it just described is real, and adding "only use real sources" to your prompt does not give it one. The verification is yours to do, and now the cost of skipping it has moved from a few marks toward the same category of trouble that ends published papers.

Verifying is no longer optional

The practical takeaway is not complicated, it is just non-negotiable now. Every reference in your bibliography has to be one you could find again on your own, and every claim you keep has to trace back to a source you have actually opened.

At minimum, that means a fast check on each reference before it stays in your draft: search the exact title, confirm the DOI resolves, and make sure the source actually says what you cited it for. None of it is hard, and it takes seconds per entry. We lay out a repeatable routine in why you can't find the citation ChatGPT gave you, including the catch that trips most people up, which is that a fake reference can have a perfectly formatted DOI that resolves to nothing.

Verifying after the fact works, but it is the weaker half of the fix. It puts you in the position of an unpaid reviewer for your own chatbot, hunting for the fabrications it left behind and hoping you find all of them. The fabrications you miss are the ones that hurt you, and you only find out which those were when someone else does.

The fix that removes the failure at the source

The durable answer is to change the order of operations so there is nothing to fabricate. A general chatbot writes first and produces references to match, which is why it can invent one: the citation is generated text like any other. Flip that around. Find and read the real papers first, then write only the claims you can attach to one of them, and a hallucinated citation has nowhere to enter. When the source comes before the sentence, there is no gap for a fake to fill.

That is the design principle behind a tool built for this rather than a general assistant pointed at the task. It is also what separates the options when you compare AI tools for academic writing: not how fluent the prose is, but whether the system retrieves and reads a real paper before it cites one.

CiteOwl works that way on purpose. It searches for real papers, reads them, and only cites what it actually found, then shows you the quote behind each reference so you can see the basis for the claim. Every citation is a real source you can check, because none of them were guessed. That is verify-first, and in a year when a hallucinated citation can pull a paper out of the literature, it is the difference between speed you can trust and speed that signs your name to something that was never there.

CiteOwl reads the paper before it cites it

It searches real sources, reads them, and cites only what it found, with the quote behind each one, so there is no hallucinated reference to catch later.

Start writing

Things worth knowing.

Can a hallucinated citation really get a paper retracted?

Yes. Journals are now correcting and retracting papers when fabricated references are found, and the number of papers with invented citations in the published record is rising sharply. A reference that points to nothing real is a factual error in the paper, and a pattern of them can trigger a misconduct review.

Doesn't peer review catch fake references?

Often not. Reviewers read for argument and method, not by resolving every DOI, so fabricated citations have survived review even at major venues. One examination of a leading AI conference found invented references in dozens of accepted papers that several expert reviewers had passed.

Is a fabricated reference treated as misconduct or an honest mistake?

Increasingly as misconduct. Institutions and publishers tend to judge the work you submit by what is in it, not by which tool produced it. Intent matters less than the false claim on the page, so a fabricated reference can be handled the same way as one you invented by hand.

How do I avoid this in my own work?

Verify every reference before it lands in your bibliography, and write from sources you have actually opened. The most reliable path is to find and read the paper first, then write the claim, so there is never an unverified citation to catch later.

Read next.