is i open this then see this result http://www.google.com.com/robots.txt User-agent: * Disallow: /click Disallow: /reference/Astro_(satellite_TV) Disallow: /reference/Astro_(Satellite_TV) Disallow: /reference/Satellite_TV Disallow: /reference/Satellite_tv any url i type at the and i add .com extra then i see this is only i see this or you all also see this
That robot.txt it's to tell spiders what they are not allowed to crawl. In this case, yahoo/msn/google, is "restricted" to crawl into those "sub"directories(sub-pages)