quick question about robots.txt

Discussion in 'robots.txt' started by abcdefGARY, Feb 9, 2008.

  1. #1
    hello!

    I plan on using a robots.txt file to block spiders from crawling certain pages. just a quick question since I'm new to this:

    User-agent: *
    Disallow: /register
    Disallow: /login
    Code (markup):
    from that code above, I just wanted clarification whether it will block those 2 URL's from being crawled or does the code above mean that it would that block spiders from crawling the register and login directories of my server?

    thanks.
     
    abcdefGARY, Feb 9, 2008 IP
  2. shivam

    shivam Peon

    Messages:
    679
    Likes Received:
    10
    Best Answers:
    0
    Trophy Points:
    0
    #2
    yes, it is fine SE will not crawle those two directories or subpages
     
    shivam, Feb 12, 2008 IP
  3. dansgalaxy

    dansgalaxy Peon

    Messages:
    37
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #3
    Well, it should stop bots from crawling bare in mind some bad bots do not respect robot.txt
     
    dansgalaxy, Feb 21, 2008 IP
  4. manish.chauhan

    manish.chauhan Well-Known Member

    Messages:
    1,682
    Likes Received:
    35
    Best Answers:
    0
    Trophy Points:
    110
    #4
    Then just block those bad bots IP address by .htaccess
     
    manish.chauhan, Apr 6, 2008 IP
  5. alpaslan11

    alpaslan11 Peon

    Messages:
    18
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #5
    how can I find all good bots for my robots txt. thanks alp
     
    alpaslan11, Apr 10, 2008 IP
  6. manish.chauhan

    manish.chauhan Well-Known Member

    Messages:
    1,682
    Likes Received:
    35
    Best Answers:
    0
    Trophy Points:
    110
    #6
    I have a list of some spammy bots , you can only block them...rest would be good bots..
     
    manish.chauhan, Apr 10, 2008 IP
  7. alpaslan11

    alpaslan11 Peon

    Messages:
    18
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #7
    thanks very much
     
    alpaslan11, Apr 10, 2008 IP
  8. manish.chauhan

    manish.chauhan Well-Known Member

    Messages:
    1,682
    Likes Received:
    35
    Best Answers:
    0
    Trophy Points:
    110
    #8
    your welcome.....:)
     
    manish.chauhan, Apr 10, 2008 IP
  9. alpaslan11

    alpaslan11 Peon

    Messages:
    18
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #9
    I searhed about spammy bots. people says can't stop spammy bots because they are skipping robots txt
     
    alpaslan11, Apr 16, 2008 IP
  10. manish.chauhan

    manish.chauhan Well-Known Member

    Messages:
    1,682
    Likes Received:
    35
    Best Answers:
    0
    Trophy Points:
    110
    #10
    yes you are right..some spammy bots don't follow the robots.txt instruction..in this case you can track their IP addresses and block them using htaccess..:)
     
    manish.chauhan, Apr 20, 2008 IP