I want to block googlebot to arrive at my .gz sitemap what will I have to do # Google User-agent: googlebot Disallow: /sitemap.xml.gz/ Or, User-agent: Googlebot-sitemap.xml.gz Disallow: /*sitemap.xml.gz Or User-agent: Googlebot Disallow: /sitemap.xml.gz Or can I use like this # Google User-agent: googlebot (this will allow to visit all my website) Disallow: User-agent: Googlebot-sitemap.xml.gz Disallow: /sitemap.xml.gz May I use both of them together? Will this effect my overall crawling of Googlebot at whole website? Give me the honor to get a reply as soon as possible.
I want to block googlebot to arrive at my .gz sitemap what will I have to do # Google User-agent: googlebot Disallow: # Google User-agent: googlebot Disallow: /*sitemap.xml.gz Answer found in Google webmaster robots.txt analyzer. Now the question is should I block at .gz sitemap at MSN as well?
in Google webmaster www.example.com/sitemap.xml Allowed by Robbots.txt www.example.com/sitemap.xml.gz Blocked by robots.txt Got my point now?
So why not you just block your /sitemap.xml.gz file from your robots.txt and remove this from indexing using URL removal in your webmaster tools..
First, you can have both versions of Sitemap and not get busted. So blocking them is silly. But if you still want to, then use: User-agent: googlebot Disallow: /sitemap.xml.gz