Operated by Google
The web crawler for Google Cloud's Vertex AI platform, used for data ingestion and model testing.
The web crawler for Google Cloud's Vertex AI platform, used for data ingestion and model testing.
Google-CloudVertexBot is operated by Google as part of their cloud infrastructure stack. It may perform security scanning, CDN pre-warming, or threat intelligence collection. It uses the user-agent Google-CloudVertexBot. Evaluate whether your site uses Google services before blocking, as this crawler may be required for service functionality.
<code>User-agent: Google-CloudVertexBot</code> — Matching is case-insensitive. Robots.txt is fetched from the root of each subdomain separately.
Understanding Google-CloudVertexBot's purpose helps you decide whether to allow or block it.
Google-CloudVertexBot. This is the exact string you must use in robots.txt, Nginx, Apache, or Cloudflare firewall rules to target this bot. User-agent matching in robots.txt is case-insensitive, but the string must be spelled correctly. You can verify that a request genuinely comes from Google-CloudVertexBot by performing a reverse-DNS lookup on the source IP — legitimate bots resolve back to their operator's domain./robots.txt file:
User-agent: Google-CloudVertexBot Disallow: /This instructs Google-CloudVertexBot not to crawl any path on your site. The Disallow: / directive covers the entire domain including subfolders. To only block specific sections, replace / with the path (e.g.,
Disallow: /blog/). Note: robots.txt is publicly readable — any bot or human can inspect it at yourdomain.com/robots.txt.Google-CloudVertexBot (case-insensitive grep: grep -i "Google-CloudVertexBot" /var/log/nginx/access.log). You can also check Google Search Console → Coverage → Crawl Stats for Googlebot variants. For Google-CloudVertexBot specifically, filter by user-agent in your log analysis tool (GoAccess, AWStats, etc.).Disallow: / you can restrict Google-CloudVertexBot to specific paths:
User-agent: Google-CloudVertexBot Disallow: /private/ Disallow: /staging/ Allow: /This allows Google-CloudVertexBot everywhere except the listed paths. Path matching in robots.txt uses prefix matching —
Disallow: /private/ blocks /private/page.html but NOT /public/private/.Check instantly with our free AI Bot Checker
Check Your Website