Indexed pages capacity

Discussion in 'Google' started by seo-ireland, Oct 28, 2005.

  1. #1
    This is something that has been playing on my mind for a while. What determines the indexed pages capacity for your site on Google? I say this because I have a store that has about 16000 product pages but I estimate that only 1/8th of the product pages are actually indexed. I have a XML Sitemap created dynamically for each product category so Google has access to all of the product page URLs along with its traditional spidering methods as well. The site is PR 6 and has many strong backlinks so spider frequency shouldn't be a problem.

    I have identified a lot of pages elsewhere in the site that should not have been indexed and are showing as URL only in SERPS. I have choked the spider's access to them so they should go supplemental and disappear soon. Would this 'free' up indexed pages capacity space?

    Any ideas?
     
    seo-ireland, Oct 28, 2005 IP
  2. digitalpoint

    digitalpoint Overlord of no one Staff

    Messages:
    38,334
    Likes Received:
    2,613
    Best Answers:
    462
    Trophy Points:
    710
    Digital Goods:
    29
    #2
    I think it has to do with how "important" Google thinks your site is. Get more natural links (especially deep links), and I bet Google starts showing more.
     
    digitalpoint, Oct 28, 2005 IP
    elkiwi likes this.
  3. 313i

    313i hummmmmm. No clue

    Messages:
    1,338
    Likes Received:
    73
    Best Answers:
    0
    Trophy Points:
    0
    #3

    "Deep links" being mysite.com/folder/folder/importantpage.htm?
     
    313i, Oct 28, 2005 IP
  4. digitalpoint

    digitalpoint Overlord of no one Staff

    Messages:
    38,334
    Likes Received:
    2,613
    Best Answers:
    462
    Trophy Points:
    710
    Digital Goods:
    29
    #4
    Yep... that's a deep link.
     
    digitalpoint, Oct 28, 2005 IP
  5. elkiwi

    elkiwi Active Member

    Messages:
    536
    Likes Received:
    34
    Best Answers:
    0
    Trophy Points:
    68
    #5
    is having all your pages in the root a bad thing? I wrote my cms system with all the pages in root...no directories...is this bad?
     
    elkiwi, Oct 28, 2005 IP
  6. digitalpoint

    digitalpoint Overlord of no one Staff

    Messages:
    38,334
    Likes Received:
    2,613
    Best Answers:
    462
    Trophy Points:
    710
    Digital Goods:
    29
    #6
    I doubt it would matter.
     
    digitalpoint, Oct 28, 2005 IP
  7. elkiwi

    elkiwi Active Member

    Messages:
    536
    Likes Received:
    34
    Best Answers:
    0
    Trophy Points:
    68
    #7
    I think so too as long as the subject matter is all the same, which for me it is.
     
    elkiwi, Oct 28, 2005 IP
  8. dave487

    dave487 Peon

    Messages:
    701
    Likes Received:
    20
    Best Answers:
    0
    Trophy Points:
    0
    #8
    Make sure your title tags are different for each page, google will then know they are different and will be more likely to index them.
     
    dave487, Oct 29, 2005 IP
  9. markhutch

    markhutch Peon

    Messages:
    357
    Likes Received:
    22
    Best Answers:
    0
    Trophy Points:
    0
    #9
    I think it's better to spend 99% of your time building good, original, content for your website and 1% of your time building links and promoting it.

    Most folks spend 99% of their time promoting their site and 1% building good content. As they say in the movies, "Build it and they will come". That is true in website promotion as well. Build a good site, update it on a regular basis and folks will find your site and link to it. This natural linking is the best way to go. Yes, it will take a long time, but most shortcuts don't work and might do more harm than good.
     
    markhutch, Oct 29, 2005 IP
  10. rehash

    rehash Well-Known Member

    Messages:
    1,502
    Likes Received:
    30
    Best Answers:
    0
    Trophy Points:
    150
    #10
    I think PR6 and orignal content should give you a looot of "capacity"
    But are those 16000 product pages original or they are just amazon/something else feeds?
    If they are not original, is a smart move from google's point of view not to crawl and index them since it only would waste resources.
     
    rehash, Oct 31, 2005 IP
  11. seo-ireland

    seo-ireland Peon

    Messages:
    243
    Likes Received:
    12
    Best Answers:
    0
    Trophy Points:
    0
    #11
    hi rehash,

    The 16000 product pages are from a feed so they in themselves contain unoriginal content. The site however is not just a store, it is a community containing forums and blogs (see my sig), which is a lot of original content. Googlebot swallowed up 2.4GB of my bandwidth in August which was amazing, in September it was down to 500MB. Consequently indexed pages started to become supplemental and new pages where not being indexed.

    I realise now that it is probably not an indexed pages capacity problem I have, more of a spider frequency problem. I still can't put my finger on why Googlebot visits dropped so dramtically from 2.4GB to 500MB. It must have been a penalty of some sorts. Quite possibly it was a datafeed penalty but this peeves me somewhat as the products are relevant to the subject matter (i am biased however :)).

    Any ideas about the drop?
     
    seo-ireland, Oct 31, 2005 IP