Yahoo's spider crawls not linked directories

Discussion in 'Yahoo' started by noises, Feb 6, 2007.

  1. #1
    hello, this is my first post..hope this is the right section.

    Got a problem on a italian website: yahoo sends me about 5% of the traffic sent by Google. No good.

    So, analyzing the logs, I've found that Yahoo slurp tries to crawl directories even if they're not linked. For example, given a page /somedir/somepage the spider tries to crawl /somedir/ even if there are no links to /somedir/

    Somedir doesn't exist because I use htaccess, and the spider gets a wrong page with no content and <meta name="robots" content="noindex,nofollow">.

    can this be the reason why i get poor rankings on Y?

    thanks to everyone who will share his/her knowledge ;)
     
    noises, Feb 6, 2007 IP
  2. Cryogenius

    Cryogenius Peon

    Messages:
    1,280
    Likes Received:
    118
    Best Answers:
    0
    Trophy Points:
    0
    #2
    Yes, I noticed Slurp doing that on my website. I suggest you setup some RewriteRules in your .htaccess to serve 404 or 301 in those cases.

    I don't think that this affects your rankings on Yahoo, though it is possible that they are looking for a good directory structure. You could experiment and add some valid page for /somedir/ so that the bot does find something to parse...

    Cryo.
     
    Cryogenius, Feb 6, 2007 IP
  3. noises

    noises Peon

    Messages:
    4
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #3
    Thank you, you've helped me.. I was wondering if there was some unwanted link to /somedir/ on my website, but if you noticed Slurp doing that on your website, then it's fine.

    Now I'm trying to redirect 301 to some valid page for /somedir/ so that the bot does find something to parse ;)
     
    noises, Feb 8, 2007 IP
  4. hhheng

    hhheng Banned

    Messages:
    2,633
    Likes Received:
    37
    Best Answers:
    0
    Trophy Points:
    0
    #4
    Yahoo sometimes will ignore nofollow attribute, you'd better use robot.txt if you don't want yahoo spider to crawl and index a page.
     
    hhheng, Feb 9, 2007 IP
  5. oseymour

    oseymour Well-Known Member

    Messages:
    3,960
    Likes Received:
    92
    Best Answers:
    0
    Trophy Points:
    135
    #5
    for the last few months slurp has been running havoc on my site...I just let it be
     
    oseymour, Feb 10, 2007 IP