[ Knowledge Base ]

Why ChatGPT Does Not Mention Your Site

A direct explanation of why ChatGPT, Perplexity, and other AI answer systems do not cite or mention your website, with specific technical causes and how to fix each one.

10 min read|Updated May 2026
Step-by-step illustration of the ChatGPT retrieval process showing the signals that determine which sites are selected and cited
Step-by-step illustration of the ChatGPT retrieval process showing the signals that determine which sites are selected and cited

Why ChatGPT does not cite your site

Being invisible to ChatGPT is a specific technical problem, not a content quality problem. ChatGPT and other AI systems have explicit, diagnosable criteria for selecting sources. Sites that do not meet these criteria are not cited regardless of how good their content is.

The path to getting cited is to understand those criteria and systematically address the gaps. This guide covers the most common reasons and what to do about each one.

How ChatGPT retrieves information

ChatGPT operates in two modes that have different implications for site optimization. The base model uses training data with a fixed knowledge cutoff. It cites sources it encountered during training. For your site to be in this pool, it must have been crawlable, indexed, and have had sufficient content quality and authority signals at the time of training.

ChatGPT with Browse, Perplexity, and Google AI Overviews use real-time web retrieval. They crawl the live web, retrieve relevant pages for the current query, and cite those pages in their responses. For these systems, fixes can have effect within days.

Top reasons ChatGPT skips your site

  • --Your content is rendered client-side and invisible to crawlers. The most common cause for AI-coded sites.
  • --GPTBot is blocked in your robots.txt. Check for User-agent: GPTBot with Disallow: / entries.
  • --You have no structured data. No Organization schema means no entity identity. No FAQPage schema means no extractable Q&A pairs.
  • --Your site has no verifiable third-party mentions. ChatGPT weights sources that appear in reliable external references.
  • --Your content is not in an answer format. Narrative-only content is harder to extract than FAQ blocks, numbered lists, and definition sections.
  • --Your domain is too new or has no authority signals. Brand-new sites with no external mentions have low training-data weight.
  • --Your site has critical crawl errors. 404 pages, redirect loops, and malformed sitemaps reduce crawl coverage.

Checking if GPTBot is blocked

Navigate to yourdomain.com/robots.txt in your browser. Look for any of the following patterns that would block OpenAI's crawler:

  • --User-agent: GPTBot followed by Disallow: /
  • --User-agent: * followed by Disallow: / (blocks all bots)
  • --User-agent: GPTBot with no Allow rules

To allow GPTBot while keeping other restrictions, add: User-agent: GPTBot followed by Allow: / before any wildcard disallow rules.

Adding the minimum required structured data

The minimum structured data set for AI citation readiness includes three schema types: Organization (establishes entity identity), WebSite (provides site-level context), and either FAQPage (for FAQ content) or Article (for editorial content).

All three should be placed in the page head as script tags with type="application/ld+json". They should be present in the server-rendered HTML, not injected after JavaScript execution.

Building verifiable authority

Authority for AI systems is primarily built through external mentions from credible sources. Product Hunt launches, press coverage, directory listings, and case studies on credible platforms all create the third-party signal trail that AI systems use to verify a brand's legitimacy.

Internal signals (good content, clear entity definitions, consistent structured data) are necessary but not sufficient. External signals are required to establish the trust weight that makes a source worth citing.

What to expect on the timeline

For real-time retrieval systems like Perplexity, technical fixes have effect within days to weeks as the crawler re-indexes your site. For training-data-based citation in base ChatGPT, the timeline depends on model update cycles, which vary.

The correct mental model is cumulative probability. Each AEO improvement raises the probability that your site is selected as a citation source. The goal is not to get cited once. It is to build the technical and authority foundation that makes citation consistently likely.

[ FAQ ]

Common questions

[ Free audit ]

See How Visible Your Site Is to AI Systems

AudFlo runs a 32-layer diagnostic across crawlability, structured data, entity signals, and authority. Free. No signup required.