Question 1

What is GPTBot and what does it do?

Accepted Answer

GPTBot is OpenAI's web crawler that collects web content for AI training data and real-time retrieval for ChatGPT Search. Sites that allow GPTBot can have their content indexed and cited by ChatGPT. Blocking GPTBot prevents your content from appearing in ChatGPT responses.

Question 2

How do I allow or block GPTBot?

Accepted Answer

Control GPTBot access through your robots.txt file. Add "User-agent: GPTBot" followed by "Allow: /" to permit full access, or "Disallow: /" to block it entirely. You can also allow or block specific paths. Without an explicit rule, the User-agent: * rules apply.