Hi All! I have set up a new site at my webhost. The problem is that the site is accessabel from the adress mainsite.com/newsite.com I want prevent that adress and all subsites on the new site to be indexed by robots, just to have them index the new site´s domain name. How can I fix this with robots.txt? Thanks in advance!
+1... also you can write as many subsites you want.... like the below example.. User-agent: * Disallow: /newsite.com/ Disallow: /newsite1.com/ Disallow: /newsite2.com/ Disallow: /newsite3.com/ Code (markup): Peace!
Just tro throw this into the mix, if you DIDN't want the waybackwhen machine indexing your site just add User-agent: ia_archiver Disallow: / Code (markup):