Found errors in sitemap and how to rectify it

Discussion in 'Google Sitemaps' started by virtualsunil, Aug 4, 2008.

  1. #1
    Hello every one,

    I am doing seo for a web site which is having around 70,000 dynamic pages. I recently found some errors in sitemap. As when i went to Diagnostics under web crawl following errors are coming

    Errors for URLs in Sitemaps 155
    Not found 88
    Unreachable URLs 373

    And i also found that on 30th July my web site home page was giving 404 (Not found) as per google sitemap. But that day full time my web site was running fine. I don’t know why google sitemap have given error. And secondly how can i find the broken links on server as suggested by google sitemap. Because i tried to find the files in server but was unable to find.

    Please guys help me out!!!!!

    Thanks,
    Sunil
     
    virtualsunil, Aug 4, 2008 IP
  2. websitetools

    websitetools Well-Known Member

    Messages:
    1,513
    Likes Received:
    25
    Best Answers:
    4
    Trophy Points:
    170
    #2
    You can try use the 30 days free evaluation of A1 Sitemap Generator. It will show you all URLs and from where they are linked. That makes it easy to find what pages link to e.g. a "404 - not found" URL. See Solve broken links section.
     
    websitetools, Aug 4, 2008 IP
  3. rockarolla

    rockarolla Peon

    Messages:
    14
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #3
    I've got an error:

    I don't have such URL in my web so it seems like a bug in the crawler ...
     
    rockarolla, Aug 9, 2008 IP
  4. webrickco

    webrickco Active Member

    Messages:
    268
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    60
    #4
    Yes, you might not have this URL, but it is probably displayed because the spider is trying to reach a page with a php error. The result of the crawl is the name of the function where the error occurs (mysql-connect in this case).

    The error, result of the dynamic page generation might look like this:
    Warning: mysql_connect() [function.mysql-connect]: Unknown MySQL server host 'localhostdd' (11001) in W:\www\Webrick\buildsitemap_ctd.php on line 119
    Database access denied!

    To make this happened, i intentionnaly misspelled the name of the connection host in the php script. and the sitemap generator will recognize the error generate as an URL to follow, however this URL should not be followed.

    Try another sitemap generator like for instance http://www.webrickco.com/buildsitemap.php. It will display a list of URL from which you can choose before you build the sitemap. This link should appear in yellow and the url generating the error should be identified as well.
     
    webrickco, Aug 10, 2008 IP
  5. samia

    samia Peon

    Messages:
    331
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #5
    Problem can be in your server !
     
    samia, Aug 11, 2008 IP
  6. catanich

    catanich Peon

    Messages:
    1,921
    Likes Received:
    40
    Best Answers:
    0
    Trophy Points:
    0
    #6
    We use gSiteCrawler to generate our Sitemaps. It has a section in it for broken links.
     
    catanich, Aug 21, 2008 IP