Prohibit SE to crawl subdomain

Discussion in 'robots.txt' started by kie, Jan 23, 2008.

  1. #1
    Hi frens, I want to know how to block Search Engine to crawl our subdomain. I ever read that is something to do with the robot.txt
    So what I want to know, what exactly should I write and where shoul I put them? Example my domain is: www.example.com and my subdomain is very.example.com
    Thanks :D
     
    kie, Jan 23, 2008 IP
  2. Pwner

    Pwner Peon

    Messages:
    176
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    0
    #2
    Write and save this as robots.txt, then upload it in your root folder of the subdomain:
    User-agent: *
    Disallow: /
     
    Pwner, Jan 23, 2008 IP
  3. kie

    kie Peon

    Messages:
    330
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #3
    hi thanks, so if I do that then no way for any SE to crawl my site rite? =) is there anything we should do again to completely don't let SE to crawl our site? Thanks =)
     
    kie, Jan 23, 2008 IP
  4. Pwner

    Pwner Peon

    Messages:
    176
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    0
    #4
    For more "security" you can put this in the code of the site, somewhere after <head>:

    <meta name="robots" content="noindex, nofollow" />
    <meta name="robots" content="noarchive" />
     
    Pwner, Jan 24, 2008 IP
  5. kie

    kie Peon

    Messages:
    330
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #5
    Thanks Thanks for the info Pwner, God Bless you :)
     
    kie, Jan 24, 2008 IP
  6. catanich

    catanich Peon

    Messages:
    1,921
    Likes Received:
    40
    Best Answers:
    0
    Trophy Points:
    0
    #6
    It should be noted that the SEs will index (read) everything they can unless told not to. You must use the robots.txt file to tell the SEs what not to index.

    User-agent: *
    Disallow: /sub-domain/
    Disallow: /sub-domain/index.htm

    You should go to www.robotstxt.org/ for the full syntax that is used.
     
    catanich, Feb 2, 2008 IP
  7. Pwner

    Pwner Peon

    Messages:
    176
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    0
    #7
    If you put Disallow: / there is no need to indicate the subfolders because you are blocking the root folder.
     
    Pwner, Feb 2, 2008 IP
  8. manish.chauhan

    manish.chauhan Well-Known Member

    Messages:
    1,682
    Likes Received:
    35
    Best Answers:
    0
    Trophy Points:
    110
    #8
    Dear if you want to block your subdomain very.example.com to crawl then you need to put these lines in your robots.txt:

    User-Agent: *
    Disallow: /

    and then put this robots.txt file into your subdomian root folder..
    I hope it'll help you..:)
     
    manish.chauhan, Apr 7, 2008 IP