How i can stop unidentified bots?

Discussion in 'Site & Server Administration' started by skseofleet, Mar 30, 2012.

  1. #1
    Hi all,

    Please assist me to control on unidentified bots. Some bad unidentified bots relentlessly accessing bandwidth of my website. I have disallow all robots by robots.txt except google, yahoo and some others. I also set the 403 error for most known bad bots by .htaccess but still here is no improvement in bandwidth usage prevention. I got tied by doing so many methods to stop them but my all efforts are inefficient. Please suggest me how i can deal with this situation....
     
    skseofleet, Mar 30, 2012 IP
  2. janinjackson

    janinjackson Peon

    Messages:
    136
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #2
    robots.txt has no effect on bad robots, because they probably don't read it and definitely don't obey it.

    Blocking robots by htaccess will not prevent them from trying to get in. You will still see them in your logs. But all they take is a few hundred bytes for a 403, instead of the multiple Ks or MBs they would get if they reached the real page.
     
    janinjackson, Apr 1, 2012 IP
  3. tiffanywilliams12i2

    tiffanywilliams12i2 Peon

    Messages:
    164
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #3
    you can always use mod_Sec how my host helps me/
     
    tiffanywilliams12i2, Apr 5, 2012 IP
  4. henrywilliams

    henrywilliams Peon

    Messages:
    119
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #4
    If you know what bot it is then why don't you block it with robots.txt?
    If you don't know it's name then you can allow a number of bots and keep rest block for your website. Add the following instruction to your robots.txt:

    User Agent (Name of allowed bots like Google Bot)
    Disallow

    User Agent *
    Disallow /
     
    henrywilliams, Apr 6, 2012 IP
  5. Michael Brown

    Michael Brown Peon

    Messages:
    49
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #5
    If some bots are not taking care of robots.txt, there is no easy way to handle them.
     
    Michael Brown, Apr 11, 2012 IP