Hello every one, I am doing seo for a web site which is having around 70,000 dynamic pages. I recently found some errors in sitemap. As when i went to Diagnostics under web crawl following errors are coming Errors for URLs in Sitemaps 155 Not found 88 Unreachable URLs 373 And i also found that on 30th July my web site home page was giving 404 (Not found) as per google sitemap. But that day full time my web site was running fine. I don’t know why google sitemap have given error. And secondly how can i find the broken links on server as suggested by google sitemap. Because i tried to find the files in server but was unable to find. Please guys help me out!!!!! Thanks, Sunil
You can try use the 30 days free evaluation of A1 Sitemap Generator. It will show you all URLs and from where they are linked. That makes it easy to find what pages link to e.g. a "404 - not found" URL. See Solve broken links section.
Yes, you might not have this URL, but it is probably displayed because the spider is trying to reach a page with a php error. The result of the crawl is the name of the function where the error occurs (mysql-connect in this case). The error, result of the dynamic page generation might look like this: Warning: mysql_connect() [function.mysql-connect]: Unknown MySQL server host 'localhostdd' (11001) in W:\www\Webrick\buildsitemap_ctd.php on line 119 Database access denied! To make this happened, i intentionnaly misspelled the name of the connection host in the php script. and the sitemap generator will recognize the error generate as an URL to follow, however this URL should not be followed. Try another sitemap generator like for instance http://www.webrickco.com/buildsitemap.php. It will display a list of URL from which you can choose before you build the sitemap. This link should appear in yellow and the url generating the error should be identified as well.