We have a huge site with over 23 million page and all dynamically driven pages. We are in search of a Google XML sitemap generator which can generate over 23 million pages site map as well automated one. Automated in the sense once new page is updated in our database it should take inside sitemap automatically and once its been removed from our DB the same page link should get removed from sitemap. Hope to get the solution from Webmasters here.
Do you use any standard CMS platform? Otherwise, you will need something custom that can read your database then can then either import into a sitemap generator or generate the sitemap. Alternatively you can have a sitemapper tool crawl all 23 million URLs which will probably be quite a challenge. Not only time-wise (!), but also file/memory-wise if you also want the sitemapper to include all extended information like internal linking, titles etc.
23 million pages is a too big website, I think you dont need to create a XML sitemap if your website is that big.