Its over 6 months and some of the pages still not indexed, what should be done now? Sitemap statistics: Total URLs: 98 Indexed URLs: 82
Can you provide the url? It does take a while to index all the pages. The best thing you could do is make sure you have a good, up to date sitemap AND try to get deep backlinks to the non-indexed pages from sites with relatively high PR (as they tend to be crawled more often). Also, have you made sure the internal linking is works for the spiders? Google doesn't crawl javascript, for instance. Try using a different sitemap generator too. Take a close look at your Google Webmaster Tools for any errors and how the Google spiders sees your site. Lastly, take a look a the Google Sitemap FAQ thread.
Not that I know of. It's a great idea though! Being able to compare the indexed pages with a sitemap would be a great tool.
Just one question. I agree Google doesnot crawl links from Javascript. But if these links are included in Google XML sitemap without javascript code, then also will google not crawl these javascript web pages.
When you try to crawl the url using a sitemap generator, javascript shouldn't be crawled either. Ask on the Google Sitemap FAQ.
I just put together a tool for you, that checks a sitemap against the Google index. After 450 requests in test, they figured me out and blocked my IP/Tool footprint/query/something somehow. Anyway.. I haven't given up just yet. I'll try to mimic a browser even better than just making a httpRequest, but I think it's tricky to fool them. Exspecially if you have to have a high request frequency, like a tool of this needs to have. Will keep you posted. ::m