what happens to URL's missing from the sitemap

Discussion in 'Google Sitemaps' started by monosodium, Jun 15, 2007.

  1. #1
    Does anyone have reliable information about how google handles URL's not included in the sitemap?

    does google stop crawling them?
    does google treat them differently?
    no change?

    i haven't been able to find any reliable info regarding a requirement to be comprehensive. i want to generate a set of sitemaps covering all 300,000 pages at a site, but the dynamic URL's are a pain to generate, would like to leave them out. UNLESS this means they will be disregarded or de-prioritized.
     
    monosodium, Jun 15, 2007 IP
  2. trichnosis

    trichnosis Prominent Member

    Messages:
    13,785
    Likes Received:
    333
    Best Answers:
    0
    Trophy Points:
    300
    #2
    nothink happens . there are missing pages one of my sitemaps. google can visit and find those pages but some time has passed until google finds these pages
     
    trichnosis, Jun 15, 2007 IP
  3. monosodium

    monosodium Well-Known Member

    Messages:
    1,028
    Likes Received:
    50
    Best Answers:
    0
    Trophy Points:
    100
    #3
    i see. thank you for your information.
     
    monosodium, Jun 16, 2007 IP
  4. Snoozy

    Snoozy Peon

    Messages:
    287
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #4
    For the dynamic URLs, you may have a look to the url rewriting method. Then you will be able to add them to your sitemap.

    ps: 300 000 pages... houahoo !! What is the subject of your site ?
     
    Snoozy, Jun 16, 2007 IP
  5. monosodium

    monosodium Well-Known Member

    Messages:
    1,028
    Likes Received:
    50
    Best Answers:
    0
    Trophy Points:
    100
    #5
    yeah... you're right. i did start work on this. it's an engineering nightmare. i guess it's better to do it though, than to leave URL's out. this is the third time i am re-engineering my database. i hope the sitemaps are worth it!

    real estate. but my site is actually way smaller than that of my competitors.

    its not just this sector. nowadays you have, say, international flower delivery services generating a page, or even several pages, for every single street they deliver to. that's easily millions of pages. as the competition heats up, you'll need a million pages just to stay in the running. it's insane.
     
    monosodium, Jun 17, 2007 IP
  6. Snoozy

    Snoozy Peon

    Messages:
    287
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #6
    good luck for the fourth time that you will re-engineer your database !
     
    Snoozy, Jun 18, 2007 IP
  7. Lemezo

    Lemezo Active Member

    Messages:
    429
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    75
    #7
    Yeah, it seems the urls not included in the sitemap get treated like you didn't send any sitemap for them.
     
    Lemezo, Jun 18, 2007 IP
  8. monosodium

    monosodium Well-Known Member

    Messages:
    1,028
    Likes Received:
    50
    Best Answers:
    0
    Trophy Points:
    100
    #8
    well after a lot of unhealthy food and unsociable behaviour, i managed to complete my sitemaps... google just told me status is OK and all URL's have been accepted.

    i left a few URL's out of the sitemap on purpose, and i'll compare how they do in the SERP's.

    i will be posting results here in a few weeks time.
     
    monosodium, Jun 20, 2007 IP
  9. Snoozy

    Snoozy Peon

    Messages:
    287
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #9
    SERP means ? Plz
     
    Snoozy, Jun 24, 2007 IP
  10. SFOD_D223

    SFOD_D223 Peon

    Messages:
    4,512
    Likes Received:
    174
    Best Answers:
    0
    Trophy Points:
    0
    #10
    SEARCH ENGINE Ranking Pages?
     
    SFOD_D223, Jun 24, 2007 IP
  11. monosodium

    monosodium Well-Known Member

    Messages:
    1,028
    Likes Received:
    50
    Best Answers:
    0
    Trophy Points:
    100
    #11
    search engine results pages. very common term here
     
    monosodium, Jun 25, 2007 IP
  12. Jaxta

    Jaxta Peon

    Messages:
    41
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #12
    Why are you re-engineering your database for the sake of dynamic URLs? Just use ISAPI (if using IIS) or mod re-write.

    You simply need to replace "?" with "/" or similar.

    Also ensure you don't use client side sessions or variables/page names such as PHPSESSIONID - anything that makes a spider think that the page may use sessions is a MASSIVE turn off. Spiders obviously can't handle client side sessions so any suggestion that the page may use them will result in the spider ignoring the page.
     
    Jaxta, Jun 26, 2007 IP
  13. Snoozy

    Snoozy Peon

    Messages:
    287
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #13
    thank you for the definition. I didn't know it.
     
    Snoozy, Jun 26, 2007 IP
  14. Snoozy

    Snoozy Peon

    Messages:
    287
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #14
    Same. Thank you. Now I'll understand better the posts that talk about SERP.
     
    Snoozy, Jun 26, 2007 IP
  15. monosodium

    monosodium Well-Known Member

    Messages:
    1,028
    Likes Received:
    50
    Best Answers:
    0
    Trophy Points:
    100
    #15
    thank you for your advice! most of what you say is gibberish to me but i think you are talking about php. i don't use that.... anyway my site is doing great right now... spending one week setting up a system for getting everything into sitemaps was definitely worth it. :)

    as for the URL's missing from the sitemap, they are still in the SERPs, and in the index, no change yet. early days still.
     
    monosodium, Jun 27, 2007 IP
  16. Aragorn

    Aragorn Peon

    Messages:
    1,491
    Likes Received:
    72
    Best Answers:
    1
    Trophy Points:
    0
    #16
    Google sitemap helps Google crawler access those pages that are otherways inaccessible. I mean those stray pages that are not linked from any other pages. Also how often those pages may change. It is just an information for the crawler. If you omit a page then google will treat that page too as any other page. The only difference is that, if that page is not linked from any other page, then there is no means for google to find that page.
     
    Aragorn, Jul 2, 2007 IP