denial-of-service attack by MSNBot! :-) Help!

Discussion in 'Bing' started by bonzay, Nov 28, 2007.

  1. #1
    In the last few days our website literally was flooded with thousands and thousands of page views by MSNBot! This crazy little bot indexes the home page every few seconds though it never changed in the last few days!

    All the other spiders crawl the site as they should. MSNBot is the only one indexing like crazy.

    At MSN Live Search Help I read, that the little biest can actually be tamed by setting the crawl-delay in the robots.txt. Like:
    ...which sets the bot's crawl speed to a single page every 180 seconds.
    Did anyone ever try that? Does it work? What else can we do in order to prevent MSNBot from running amok?
     
    bonzay, Nov 28, 2007 IP
    Anonymously likes this.
  2. iaria

    iaria Peon

    Messages:
    202
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    0
    #2
    are you sure it was the official msnbot and not someone with fake useragent?
     
    iaria, Dec 1, 2007 IP
    Anonymously likes this.
  3. proprod

    proprod Active Member

    Messages:
    216
    Likes Received:
    11
    Best Answers:
    0
    Trophy Points:
    90
    #3
    Yes, crawl-delay works ... googlebot of course, ignores it. Why should google conform, right?

    180 seems a bit high, try 30 or 60 and see how that works.
     
    proprod, Dec 1, 2007 IP
    Anonymously likes this.
  4. Snout

    Snout Peon

    Messages:
    238
    Likes Received:
    10
    Best Answers:
    0
    Trophy Points:
    0
    #4
    MSNBot will require a lot of resources, its unlikely their admins would allow such a load on their own systems. Did you verify the IP address? User agent means nothing, it can be (and being) faked.
     
    Snout, Dec 1, 2007 IP
    Anonymously likes this.
  5. bonzay

    bonzay Peon

    Messages:
    54
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    0
    #5
    No, I didn't verify the IPs.
    Looks like msnbot follows the crawl-delay rule. Now we have a new customer: the baiduspider... :) *sigh
     
    bonzay, Dec 2, 2007 IP