1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

bad robot list for robots.txt

Discussion in 'Apache' started by debunked, Jun 7, 2004.

  1. zzb

    zzb Peon

    Messages:
    82
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    0
    #21
    Here is a technique that has been know to work pretty well. Takes a bit to set up but it TRAPS bots that do not respect or check for the robots.txt file.

    If this is done properly you should not need to have a ridiculously long robots.txt file.

    http://danielwebb.us/software/bot-trap/


    Good post folks.... there is nothing more frustrating to see a robot on your site sucking up bandwidth and find out it was created in some college computer science course as a tutorial !!

    -- ZZ
     
    zzb, Sep 3, 2007 IP
  2. casitecenter

    casitecenter Peon

    Messages:
    4
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #22
    yes bot-trap is interesting one and i liked the Idea

    But only the las FAQ is a little bad

    I'm going to install it and let you know if it work fine or not.But DID any one test it, already?
     
    casitecenter, May 15, 2008 IP
  3. Will.Spencer

    Will.Spencer NetBuilder

    Messages:
    14,789
    Likes Received:
    1,040
    Best Answers:
    0
    Trophy Points:
    375
    #23
    I am using bot-trap. It works well, but you have to be careful with it.

    Add this near the top of your .htaccess file:
    
    Allow from 127.0.0.1
    Allow from 65.55                # MSN
    Allow from 66.249               # Google
    Allow from 67.195               # Yahoo!
    Allow from 72.30                # Yahoo!
    Allow from 74.6                 # Yahoo!
    Allow from 122.152.129.15       # Baidu
    
    Code (markup):
    This will keep you from banning IP addresses that you really don't want to ban.
     
    Will.Spencer, May 16, 2008 IP