does google ban people if you have very large sitemap? What is the max url on can have in sitemap? thanks
I think it's something ridiculous like 100,000 links Anyway, they certainly won't ban you if you have a sitemap that is too large.
how about break up the map into seperate pages. or check out google recommendations.. http://www.google.com/support/webmasters/bin/answer.py?answer=35769
Each sitemap file can have 10.000 links. You can have a sitemap index file that refers to 1000 of these... See this article for more information about all kinds of sitemaps.
If you have a log of links then google recommends you to cut it down to multi site maps and have a sitemap index file.
of course , google will not ban you if you have large sitemap. sitemap is only a list of your web pages.
I read this too, but I have some sitemaps that have upwards of 40,000 links in them and they index fine, google reports all urls submitted in the webmaster tool
Also, from the article you linked to: Information about XML sitemaps protocol: * Each XML sitemap file can contain max 50.000 urls and be 10 mb in size. * It is possible to link 1000 XML sitemaps using a sitemap index file. * You can read our article about page priorities in XML sitemaps. * XML sitemap files and sitemap index files have to be stored as UTF-8 documents.
Of course they wont ban you. The max that will happen is that they might not be able to index the whole of it if it has too many links.
Ram2007, as you said people having large number of links should break up sitemap in different files and maintain a sitemap index file, could you please tell how can the people having blog on blogger (blogspot) do this in case they have large number of pages for their blog?
Google, Yahoo and MSN will not ban anyone or any site for having to many pages. Google has stated the following recommendations: * Each XML sitemap file can contain max 50.000 urls and be 10 mb in size. * It is possible to link 1000 XML sitemaps using a sitemap index file. If you use the Site Map tool from http://gsitecrawler.com/ , it will take care of everthing you need.