Hi, Am I correct? I want to put a robots.txt in my site www.arghcade.com I put the following as below. 1. sitemap location 2. User-agent:* (actually I dont' know what it is for, but someone advise me to do that) 3. I want to ban the SE to crawl my directly /funny 4. I wan to ban the SE to crwal my "add-on domain" /allfunny So, I put the following line in my robots.txt Please help to comment if I am OK. Thanks. --- robots.txt -- Sitemap: http://www.arghcade.com/sitemap.xml User-agent: * Disallow: /funny Disallow: /allfunny
User-agent: * means all robots will follow the instructions if u write User-agent: Googlebot then only googlebot ll follow the instructions Disallow: /dir will instruct bot not to crawl 'dir' yes.
User-agent: * is ok will invite all the robots to crawl your site. Disallow: /dir will instruct not to crawl this directory and for all subdirectories to the robots.