MSNBot being too aggressive?

Discussion in 'Bing' started by digitalpoint, Jul 31, 2009.

  1. webhost.uk.net

    webhost.uk.net Well-Known Member

    Messages:
    296
    Likes Received:
    9
    Best Answers:
    0
    Trophy Points:
    128
    #21
    Yes really MSN bot is working really fast.
     
    webhost.uk.net, Aug 6, 2009 IP
  2. vagrant

    vagrant Peon

    Messages:
    2,284
    Likes Received:
    181
    Best Answers:
    0
    Trophy Points:
    0
    #22
    When i has similar problems with yahoo it turned out the Crawl-delay was been implemented per bot and from looking at my server logs it looks like msn bot is the same way. Fine when they use only one bot but not much good when they use 20+ at once.

    You may find you need to increase the Crawl-delay by the number of bots to get the desired effect.
     
    vagrant, Aug 6, 2009 IP
  3. Uncle Sam

    Uncle Sam Peon

    Messages:
    35
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #23
    Yeah. You are right. Their crawlers are badly designed. The worst part is that they do not follow instructions mentioned in robots.txt or meta-robots tag. It simply disobeys those instructions and MS says that they know about this bug and are trying to fix it ASAP. Look here ---> http://www.provenseo.com/2009/08/bing-indexing-noindex-nofollow-content/
     
    Uncle Sam, Aug 7, 2009 IP
  4. digitalpoint

    digitalpoint Overlord of no one Staff

    Messages:
    38,334
    Likes Received:
    2,613
    Best Answers:
    462
    Trophy Points:
    710
    Digital Goods:
    29
    #24
    Wow... that would be completely moronic if that's the case. So they want to use multiple computers to act as a single process (a search engine spider), yet they haven't figured out how to get their own machines to communicate with each other yet. {rolls eyes}

    How exactly do they think they are going to make a better search engine than Google? Throwing money at advertising doesn't seem to be fixing any of their core problems so far. Maybe a few billion more in advertising will make the spider work right. Then maybe they can brainstorm about how to make the web pages they have indexed searchable by relevancy. A novel idea. :)
     
    digitalpoint, Aug 8, 2009 IP
  5. vagrant

    vagrant Peon

    Messages:
    2,284
    Likes Received:
    181
    Best Answers:
    0
    Trophy Points:
    0
    #25
    I totally agree it's totally moronic, and should not happen.

    However, when i last looked at latest visitors page in cpanel to see what they were doing.... where they group the pages viewed by each IP ... i could see msnbot (also yahobot) obeying crawl delay within each bot .... but with 20 bots on at once, a 2 second delay means as a whole they were visiting 10 pages per second as a group.

    Been on shared hosting, I ended up putting a 60 second Crawl-delay on one of my forums :eek:

    It may well be a bug that they will sort out, but it should be easy enough to see/test if that is what is also happening here.

    vagrant
     
    vagrant, Aug 8, 2009 IP
  6. stylosoft

    stylosoft Peon

    Messages:
    214
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #26
    Yes we have to ignore msn bot or may be msn bot come due to bing is launched and the bing is latest technology
     
    stylosoft, Aug 8, 2009 IP
  7. DoDo Me

    DoDo Me Peon

    Messages:
    2,257
    Likes Received:
    27
    Best Answers:
    0
    Trophy Points:
    0
    #27
    many people wants Ping challenge Google, but they do not want MSNBot claw them sites :(
     
    DoDo Me, Aug 8, 2009 IP