I an auto site that pull youtube videos.... I do a sitemap generation, and get over 10+K url on the sitemap...... Would there be any penalty or get banned if i submit this sitemap to G?
Regarding the 10k number. You can have up to 50k pages in a single XML sitemap file. You may want to check my website sitemaps comparison article.
A sitemap with that many links in it would be a huge file, the bot may not be allowed to us that much bandwidth. you might want to break it down some just incase.
Hi, You mean, after I generate the sitemap, I edit it and cut the links to 2500, then submit to Google? Is that any sitemap generator that we can set the number of max link to generate? thanks
I split my sitemap after 49k links and gzip them. No problems here Don't forget, if you splitt link to your sitemap index file, otherwise google wont find additional sitemaps.
I have several sitemaps with about 48k links, and no errors or whatever. Just make sure you just use url, and remove dates, or whatever from sitemap. If possible try gzip.
some more info needed: 1. why are mentioned the magic figure 48K or 48K, is that a max number? 2. who to remove dates? 3. for gzip files, is that sitemap.gz? thanks
I guess there are more, but A1 Sitemap Generator can. Anyways, the official page limit per xml sitemap file (see http://www.sitemaps.org) is 50.000 pages. (or 10 mb file size - whatever comes first)
The best one I found is http://gsitecrawler.com/ and it has no upper limit. But watch the file size (10M is the limit) but I don't know of anyone that has tested this.
having too many urls will not be a problem for google. there is only one criteria , you must limit your per sitemap with 10M. if you have more urls , you can create more than one sitemap
How many links should it be on a sitemap as maximum? Sorry for that but I do not know what (k) means? Is it thousend or hundred?
50k = 50 thousand, i.e. each xml sitemap file max 50.000. (Or file size 10 MB) But you can have multiple sitemaps listed/referenced in a sitemap index file.
its alway recommended to keep the sitemap file less than a 1000 links if you want it crawled fully. There is a Time frame for execution of each file and if your file is going to take too much of time to download, the chances are that your files are not going to be crawled. So limit your entries and increase your results.
1000 links seems too small.................. that means only have 1000 pages indexed by Googel at the best, right?
I have tested it on a site with around 750,000 pages. Gsitecrawler handles it just fine. It splits the sitemap for you based on size/number of links.
yes, it can handle unlimited pages, but it generate many "useless" pages... say, my game site only have 2000 games, so I am expecting it generate 2K url, but it goes up to 15K............... most of the "useless pages" are, for example, ......... 10 more url on the same "content" about "rating 1" "rating 2"... or so......... This becuase, each game could be rated by users....... so, every rating option becomes a url in sitemap..... Anyone knows how to get rid of those annoying pages......... ?