Yahoo has NOT indexed 8 billion pages...

Discussion in 'Yahoo' started by clickbuild, Sep 20, 2005.

  1. #1
    They actually indexed 2 billion and then made up the urls for the rest of them!

    Why am I saying this? Because I am having the worst time with Yahoo's bots, Slurp and CafeKelsa.

    I had to officially request CafeKelsa stop hitting my site every 2 seconds with bogus URLs (15k requests in 4 days, they ignored robots.txt) and I am currently "re-educating" Slurp using 301 redirects as to where my actual site content is, or at least trying to as I am now finding that Slurp also drops trailing slashes when it feels like it... more frickin bad links.

    Example: I made these up for ease of understanding

    Directory 1: /books/usa/
    Directory 2: /cars/
    Directory 3: /music/

    Slurp searches for /books/carsbooks/usacars/music/carsbooks/usa/cars/music/books/

    I kid you not. 9 concatenated urls to 20 odd levels deep.

    But wait, there's more! I sell clickbank products - or at least I try to :)
    Now I discovered that Slurp is making up GET URLs (my best guess so far) :confused:

    Here is an actual URL that Slurp tried to get from my site:
    /cbsearch/link.php?id=2459&link=results&pid=pe-facia

    Explained:
    id: I use this to look up the cb hop from the database and then redirect.
    link: used for tracking, tells me the click came from the results page
    pid: tells me the id of the page.

    If I look in the results in the page for "pe-facia", NONE of the links have the id of 2459, the id 2459 doesn't even exist -- yahoo is making this id number up! :mad:

    So is this me? Have I managed to come up with some ingenious way of making Slurp and the other Yahoo bots magically make up pages and try to crawl them or is this a known problem and one more reason why Yahoo is slipping in the world of search?

    Thanks - I feel better having vented a little... thoughts, idea, suggestions welcome.

    Steve
     
    clickbuild, Sep 20, 2005 IP
  2. davesplace1

    davesplace1 Peon

    Messages:
    30
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #2
    Search engines, you can't live with them and you can't live without them :)
     
    davesplace1, Sep 21, 2005 IP