De-indexed pages - my findings based on one of my sites

Discussion in 'Search Engine Optimization' started by GADOOD, Jun 9, 2006.

  1. #1
    I have gone through my entire pages and pasted each and every .html in to Google Search to verify exactly which pages have been de-indexed, and which have not.

    My conclusion is that only articles off main sections have been de-indexed.

    Example, here is my site structure:

    FRONTPAGE/

    FRONTPAGE/PRIMARY-CATEGORY/

    FRONTPAGE/PRIMARY-CATEGORY/SECONDARY-CATEGORY/

    FRONTPAGE/PRIMARY-CATEGORY/SECONDARY-CATEGORY/INDEX.HTML

    FRONTPAGE/PRIMARY-CATEGORY/SECONDARY-CATEGORY/INDEX-ARTICLE01.HTML
    FRONTPAGE/PRIMARY-CATEGORY/SECONDARY-CATEGORY/INDEX-ARTICLE02.HTML
    FRONTPAGE/PRIMARY-CATEGORY/SECONDARY-CATEGORY/INDEX-ARTICLE03.HTML

    Now if you follow that, it's the INDEX-ARTICLEXX.HTML's that have been de-indexed but NOT the INDEX.HTML pages which LINK to the INDEX-ARTICLE00 pages.

    Note: All 200 pages were indexed up until a few months ago when Google began changing and actively de-indexing these particlar 'outer-pages', but no others before them in my site structure.

    Perhaps my case is different (it sounds it may be a lot different) to what a lot here are experiencing but it may account for some.

    In my case, I believe it is firmly due to a change that came with Big Daddy that either devalued the value of say directory submission links (and similar types) which made up the bulk of my PR etc or/and due to a newer way in which Google crawl now it simply doesn't index any further and will actually de-index content in the Google DB which doesn't have enough PR or weight distributed to it. Perhaps I'm crazy for thinking that.. or am I? Isn't there issues with space.. I did hear the rumours.. why don't Google show how many sites in its index anymore.. coinsidence?

    I know this doesn't account for FULL sites being de-indexed, but it may account for a lot of the de-indexing going on and some of the knock-on effects which would be huge if they are de-indexing pages from their DB that don't measure up in PR/weight.

    My site is down from 200 pages to 140 because of this, and I'm quite sure if I carry on building quality links eventually Google will re-include the pages it has dropped back in to its index.

    Google have not de-indexed any other type of page on my site apart from the ones furthest out of the circle of PR/weight if you get what I mean and it seems pretty solid what's happened on my site what's happened.

    Perhaps some may find this useful if you can understand my crap example of my sites structure. :rolleyes:

    Pete
     
    GADOOD, Jun 9, 2006 IP
    Smyrl likes this.
  2. I. Brian

    I. Brian Business consultant

    Messages:
    810
    Likes Received:
    59
    Best Answers:
    1
    Trophy Points:
    145
    #2
    You're basically talking about deep content - ie, that Google may not have been able to reach in a crawl.

    Matt Cutts mentioned something before about the new crawl prioritising the pages for indexing, so I wouldn;t be too surprised to find your deeper level pages return to the index as Google fixes its crawling.

    2c.
     
    I. Brian, Jun 9, 2006 IP