Remove weird session ids appended to urls in Yahoo index

Discussion in 'Yahoo' started by orcn, Nov 23, 2007.

  1. #1
    Hi all,
    From yahoo site explorer, I see that yahoo has been appending a weird session variable of its own to all our urls and therefore indexing the same content multiple times with a new session in the url. So the webpage count is increasing constantly by 1-200 pages in the index at every re-index by yahoo, although no new pages have actually been added.

    The urls appear in a double type of brackets in the yahoo index like this:

    www.mydomain.com/(S(djc4ih45uvt2xb2aswlnicm4))
    www.mydomain.com/(S(vs4vwr45ttlffbm4ift5xh45))/contact.aspx etc.

    I don't understand where this extra variable in brackets is coming from as I am not using any session ids, tracking ids etc. for this site and the pages are not dynamically generated -they are static pages although using aspx filetype.

    Google has indexed the site correctly without these sessions in the urls so the problem is only occurring in yahoo.

    I have added this code to robots.txt to see if it may work:
    User-Agent: Yahoo! Slurp
    Disallow: /*?sessionid
    Disallow: /*))/
    Disallow: /*))
    Disallow: /*(S(
    Disallow: /*?

    If anyone could give any other advice I'd really appreciate it as am not sure what else to try!

    Thanks all!
     
    orcn, Nov 23, 2007 IP
  2. rehash

    rehash Well-Known Member

    Messages:
    1,502
    Likes Received:
    30
    Best Answers:
    0
    Trophy Points:
    150
    #2
    hmm, maybe someone added a fishy backlink to your site? with some session id?
     
    rehash, Nov 28, 2007 IP
  3. charlesm3

    charlesm3 Active Member

    Messages:
    147
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    51
    #3
    I also noticed this on some other sites. Hopefully someone figures this out.
     
    charlesm3, Nov 28, 2007 IP
  4. cache

    cache Peon

    Messages:
    99
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #4
    I noticed this as well, don't know why.
     
    cache, Nov 28, 2007 IP