RETURN_TO_BLOG
Updated: AI & SEO 11 min

How to Get Cited in ChatGPT — an LLM Citation Strategy

Paweł Wiszniewski
Paweł Wiszniewski
SEO & GEO Specialist · AI Engineer

More and more people no longer type questions into Google — they ask ChatGPT. And when ChatGPT answers, it often cites specific sources. The question is: is your site among them? This guide explains how ChatGPT selects and cites sources, what actually increases your chances of being cited (based on research, not wishful thinking), and how to prepare your content and brand step by step so the model reaches for you.

When ChatGPT answers, it cites specific sources — is yours among them? I explain how ChatGPT selects and cites sources (two knowledge layers + live retrieval), what actually increases citability per research, and how to prepare your content and brand step by step.

This is GEO (Generative Engine Optimization) in practice, narrowed to the single most important engine today. If you're just starting, begin with SEO vs GEO — Google vs visibility in AI; here we go deeper, specifically into ChatGPT.

How ChatGPT "knows" — two layers of knowledge

To optimize deliberately, you need to understand where ChatGPT's answers come from. There are two layers:

  • Training knowledge (parametric). What the model "learned" up to its cutoff date. Static, no citations, no fresh data. What matters here is whether your brand and the facts about it appeared in the training data at all (i.e. whether you exist on the web, are mentioned, described).
  • Live retrieval (search). When a question needs current information, ChatGPT reaches out to the web, fetches a few pages and synthesizes an answer with citations (usually a few linked sources). This is where the real fight for citation happens.

It's estimated that a portion of conversations trigger a search (on the order of low double-digit percent — third-party data, treat as indicative). Practical takeaway: you want to be visible in both layers — as a recognizable entity in the training data and as a citable source in live results.

The hard dependency: first you must be "retrievable"

The most common mistake: someone wants to "optimize for ChatGPT" while their site isn't even accessible to AI bots. Live retrieval is retrieval + grounding: the model pulls candidates from an index, reads passages and cites the most-used ones. If you're not in that set — you won't be cited.

  • Allow AI bots in `robots.txt`: GPTBot, OAI-SearchBot, ChatGPT-User.
  • Take care of classic indexability — ChatGPT search has historically relied on the Bing index, so being indexed in Bing (and Google) is a prerequisite. Don't neglect Bing Webmaster Tools.
  • Make sure content is available without a JavaScript wall and without login.

The same technical groundwork you do for SEO (Core Web Vitals, structured data, crawlability) is the foundation of AI visibility.

What actually increases your chances of being cited

Here's where the data comes in. The GEO-bench study (the academic paper "GEO: Generative Engine Optimization", KDD 2024) tested content modifications and measured their impact on visibility in generative answers. The three most effective edits were:

  • adding source citations,
  • adding quotations,
  • adding statistics.

They can raise a source's visibility by up to ~40% (in the benchmark). Crucially, keyword stuffing did not help. Models reward substantive, evidence-backed, quotable content. (Caveat: these are figures from a controlled benchmark, not a guarantee on live ChatGPT — treat as a strong direction.)

On top of that come selection signals confirmed by practice:

SignalWhy it works
Content extractabilityDirect answers, Q&A headings, passages that stand alone — the model lifts them easily
Statistics and quotesConcrete numbers and quotable sentences get woven into the answer more readily
Authority / E-E-A-TModels prefer to ground answers in credible sources
Entity consistencyConsistent brand data + schema help the model recognize "who you are"
Third-party corroborationMentions/citations on other credible sites (a large share of AI citations come from third parties)
FreshnessCurrent content is cited more often for questions that need recency

A citation strategy — step by step

  1. 1.Open the paragraph with the answer. The first sentence should answer the heading's question — no run-up. That's the sentence the model can lift and cite.
  2. 2.Write self-contained passages. Each paragraph should make sense out of context (the model rarely cites a whole page — it cites a passage).
  3. 3.Add numbers and sources. A concrete statistic with a source beats a generality. It's the highest lever per the study.
  4. 4.Structure for extraction. Descriptive H2/H3, FAQ sections, lists, tables.
  5. 5.Build the entity. Consistent brand, Schema.org and an entity graph, an author with credentials, a profile linked to Wikidata — so the model knows you're a credible, recognizable source.
  6. 6.Earn external mentions. Presence on authoritative third-party sites (industry media, Reddit, Wikipedia if you qualify) genuinely increases citability.
  7. 7.Keep it fresh. Refresh key content and dates — recency can be a factor.

A word on llms.txt: it's an emerging idea for a file pointing models to your content, but OpenAI has not confirmed using it as a signal. You can add it at low cost — don't treat it as a ranking factor.

How to measure whether ChatGPT cites you

You won't measure this in Search Console. You need AI Share of Voice — the share of answers (across a defined set of questions) in which the model mentions or cites your brand relative to competitors. Tools like Profound or Otterly serve this. I break down a practical approach in How to measure Share of Voice in AI.

Common mistakes

  • Site blocked for GPTBot/OAI-SearchBot or absent from the Bing index.
  • Fluffy content with no specifics — nothing to cite.
  • A JavaScript wall or content behind login.
  • No consistent entity (the model can't tell it's the same brand).
  • Counting on llms.txt as a magic switch.

---

I help brands get into AI answers — from an LLM citation strategy to full optimization for AI (GEO). I teach it all from the ground up in the SEO & GEO course. Get in touch — I'll start by auditing whether ChatGPT already cites you and what to change so it does.

Worth reading next:

Paweł Wiszniewski – SEO & GEO Specialist & AI Engineer
About the authorPaweł Wiszniewski

SEO & GEO specialist and AI engineer from Białystok. 10 years building search visibility for recognized brands and 3 years delivering AI — agents, automation and LLM integrations (Next.js, React, Node.js).

/// AUTHOR
Paweł Wiszniewski – AI & Web Engineer

Paweł Wiszniewski

SEO & GEO Specialist & AI Engineer

SEO/GEO specialist (10 years) and AI engineer (3 years). I build search visibility, AI systems and automations that reduce costs and improve operational efficiency.

Signal received?

Terminate
Silence

Initiate protocol. Establish connection. Let's build something loud.

> WAITING_FOR_INPUT...