Question 1

What is the official user-agent string for Googlebot-Discovery?

Accepted Answer

The official user-agent string for Googlebot-Discovery is: Googlebot-Discovery. This is the exact string you must use in robots.txt, Nginx, Apache, or Cloudflare firewall rules to target this bot. User-agent matching in robots.txt is case-insensitive, but the string must be spelled correctly. You can verify that a request genuinely comes from Googlebot-Discovery by performing a reverse-DNS lookup on the source IP — legitimate bots resolve back to their operator's domain.

Question 2

Is Googlebot-Discovery safe?

Accepted Answer

Yes, Googlebot-Discovery is a **safe and legitimate** crawler. It is operated by Google, which publicly documents its crawler at an official URL and follows the Robots Exclusion Protocol (RFC 9309). The user-agent string Googlebot-Discovery is verifiable via reverse-DNS lookup on the crawling IP addresses. You can safely allow it unless you have a specific reason to block (e.g., AI training opt-out or SEO tool visibility).

Question 3

Will blocking Googlebot-Discovery hurt my SEO?

Accepted Answer

⛔ **Critical Impact** — Blocking Googlebot-Discovery will stop Google from crawling and indexing your pages. Within days or weeks you may see pages drop out of Google's search index entirely, resulting in a significant loss of organic search traffic. This is the most severe possible SEO consequence. Only do this intentionally, for example if you are migrating to a different search engine or decommissioning a domain. If you accidentally blocked Googlebot-Discovery, remove the rule immediately and request re-indexing via Google's webmaster tools.

Question 4

How do I block Googlebot-Discovery in robots.txt?

Accepted Answer

Add the following lines to your /robots.txt file:
User-agent: Googlebot-Discovery
Disallow: /
This instructs Googlebot-Discovery not to crawl any path on your site. The Disallow: / directive covers the entire domain including subfolders. To only block specific sections, replace / with the path (e.g., Disallow: /blog/). Note: robots.txt is publicly readable — any bot or human can inspect it at yourdomain.com/robots.txt.

Question 5

Does Googlebot-Discovery respect robots.txt?

Accepted Answer

Yes — Googlebot-Discovery is a well-behaved bot operated by Google. It fetches and parses /robots.txt before crawling any page, following RFC 9309.

Question 6

How do I verify if Googlebot-Discovery is crawling my site?

Accepted Answer

Search your web server access logs for the string Googlebot-Discovery (case-insensitive grep: grep -i "Googlebot-Discovery" /var/log/nginx/access.log). You can also check Google Search Console → Coverage → Crawl Stats for Googlebot variants. For Googlebot-Discovery specifically, filter by user-agent in your log analysis tool (GoAccess, AWStats, etc.).

Googlebot-Discovery

Quick Facts

What is Googlebot-Discovery?

What happens if you block Googlebot-Discovery?

How to block Googlebot-Discovery with robots.txt

Is Googlebot-Discovery safe to allow?

What does Googlebot-Discovery do?

Frequently Asked Questions

Related Bots

Is Googlebot-Discovery blocked on your site?