indexing large sitemap failure!!??

Discussion in 'Google' started by WRocker, Jul 8, 2009.

  1. #1
    Hi

    few time ago, google has changed the layout of their webmaster tools. there where my problems started happening.

    i have a warez site and a sitemap as big as nearly 5MB.
    before google renewd their webmaster tools page .. i was able to successfully submit my sitemap and google could index it .. now i can't anymore! when i try to resubmit the site map, i get the following error:

    URL timeout: HTTP request timeout
    We encountered an error while trying to access your Sitemap. Please ensure your Sitemap follows our guidelines and can be accessed at the location you provided and then resubmit.

    first i thought it's a problem with my webhosting company ( since i have moved to new one at the same time google renewed their site ) . i told them to verify whether anything is preventing google from accessing the sitemap. they replied me that everything is ok with them and nothing is blocking google.

    at that point i made a help topic on google webmasters forum. and i got an answer from a man called Phil Payne telling me to try to submit a sitemap with only one URL included. i tried what he suggested and the sitemap including this single URL worked fine.

    i got back to him and told him that it worked. his answer was the following :
    well my problem is that the script i use to create the sitemap indexes the whoole website .. it doesn't index only one section or one category and so splitting the sitemap into multiple sitemaps is not an option i have.

    the only solution i see ( from my point of view ) is to find a way to make google reindex the sitemap successfully .. but i don't see how to do that.

    because before google renewed their website ... the sitemap was as large as it is now ... maybe now it got a little bigger. but it doesn't mean google can't index it anymore!!!

    am asking for help and suggestions here. is there anyway i could make google reindex my sitemap successfully again .. or what shall i do ?

    help guys!!
     
    WRocker, Jul 8, 2009 IP
  2. brian65

    brian65 Active Member

    Messages:
    1,172
    Likes Received:
    14
    Best Answers:
    0
    Trophy Points:
    88
    #2
    I had a similar "big" sitemap problem where I had 80,000 pages to index. Luckily all my listings had a unique number in my database. So I generated one sitemap for listings 1 to 40,000 and another for 40,001 upwards. Then generated a sitemap index file. Can you adopt a similar strategy?
     
    brian65, Jul 8, 2009 IP
  3. bermuda

    bermuda Peon

    Messages:
    868
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #3
    Sometimes the errors could be related to the stability issues of the hosting service providers. You may consider changing your current company with a better one.

    Also, try using the GZ format for the sitemap instead of the XML as sometimes it helps, like when you look at how Blogs are having sitemaps.
     
    bermuda, Jul 8, 2009 IP
  4. WRocker

    WRocker Active Member

    Messages:
    32
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    93
    #4
    Hi

    thanks alot for your answer.

    what i can do is splitting the current sitemap i have into 2 or 3 chunks .. but the problem is with all the links that will be submitted later .. because everytime i generate the sitemap ... it will index the whole site .. and so i will not be able to figure out where i left off .. plus i update my sitemap once or twice a day .. so it will be a big waste of time to split sitemaps everytime i wanna update!
     
    WRocker, Jul 8, 2009 IP
  5. WRocker

    WRocker Active Member

    Messages:
    32
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    93
    #5
    Hi bermuda ... i believe my hosting provider is quite stable .. and how about changing to GZ ? am a newbie actually .. does the the GZ format have the same structure of XML ?
     
    WRocker, Jul 8, 2009 IP
  6. WRocker

    WRocker Active Member

    Messages:
    32
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    93
    #6
    out of a sudden google is back to successfully index my links .. obviously they're encountering some sorta problem with their new indexing system !
     
    WRocker, Jul 16, 2009 IP
  7. Abhik

    Abhik ..:: The ONE ::..

    Messages:
    11,337
    Likes Received:
    606
    Best Answers:
    0
    Trophy Points:
    410
    Digital Goods:
    2
    #7
    Abhik, Jul 16, 2009 IP
  8. Grobbulus

    Grobbulus Peon

    Messages:
    557
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #8
    Agree 100% with the above and try using the GZ format.
     
    Grobbulus, Jul 16, 2009 IP
  9. onlinewebcreater

    onlinewebcreater Peon

    Messages:
    65
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #9
    There are so may tools available in internet market to develop sitemap but google take up to 40000 webpage url or 10 MB in size in this case you need to create master xml file to integrate these two or more sitemap. Create sitemap manually or by software it doesn't mean.
     
    onlinewebcreater, Jul 16, 2009 IP
  10. ~kev~

    ~kev~ Well-Known Member

    Messages:
    2,866
    Likes Received:
    194
    Best Answers:
    0
    Trophy Points:
    110
    #10
    5mb is not a large sitemap. And its not the "size" of the sitemap, its how many urls is in it. A sitemap can have no more then 50,000 urls. Anymore then that, and the sitemap has to be broken into parts.

    If google can not index the sitemap, try going to the sitemap url in the browser. If the file can not be accessed, something has changed. Such as the htaaccess file, or the folder permissions.
     
    ~kev~, Jul 16, 2009 IP