Hello friends, l used a sitemap builder software that is created with this company, it took lots of time to creat a sitemap of my article site ( 3 day nonstop ). But when the site sitemap creation finished, l upluoaded all 4 types of sitemap formats to my server and after l added them to google sitemap, it gawe lots of errors, first of all html sitemap fize size limit, its ok l split it less than 10 mb for each, its ok to spilt hml or txt formats into pieces. but the most ennoying thing is xml problem l am living. the softwares xml sitemap creature is like this <url><loc>http://www.articles4.co.cc/article100798.html</loc></url> <url><loc>http://www.articles4.co.cc/index.php?pg=2&page=category&category_id=179</loc></url> Code (markup): in this lines, l learn google doesnt accept links like this <url><loc>http://www.articles4.co.cc/index.php?pg=2&page=category&category_id=179</loc></url> Code (markup): l dont know why. l have a bunch of lines like this around a half of my sitemap (45000 pages) the most ennoying thing is these lines are devided into other links, l should manually find them and delete. my request if there are any coder can help me how to solve this problem? l mean what should l do to find and remove these lines from the file or is there any better sitemap creator ( free ) that converts more than 100 000 pages without any error?
l am greatfull to you for this information, l wouldnt think sitemaps cant imagine this. u buy or download a sitemap generator, it creates 100 000 of indexed pages but than u have to convert some symbols manually urself, its idiot but thanks for informing me.
Many sitemapper tools including A1 Sitemap Generator handle this correctly... So I think you have just been unlucky Note: You may want to report the bug/issue to the author of the sitemapper tool that generated the incorrect sitemaps.
How big are the sitemap files in terms of pages? Is it possibly that the sitemaps themselves are too large and that is what is causing Google to tip over?
Definitely sounds like a sitemap size issue here. Check that out that it perhaps is not causing the submission process to time out by accident
Try to do everything piece by piece, little by little and adding to it, instead of trying to do it all at once. Or try Thomas' suggestion of A1. Hope this helps. Take care.
thomas l also tried ur site map, its very slow and it takes months to index 100 000 pages. l used wonderwebware which is much more faster and indexes from 10 simistinouse connections which is really faster aa yes, google accepts sitemaps less tha 10 mb, so files created more than 10 u have to devide them, this sitemap maker do it as well but it doesnt convert some symbols well, l did pal, at least l used txt ( yahoo based sitemap ) in google cos l couldnt manage to convert all xml files cos they always giwe problem. but a good sitemap creater really needed in sector.
People often tell me A1SG is far faster than anything else. The only exception I have encountered in some cases is with websites/webservers that dislike HEAD requests before GET requests. Most crawlers (and all browsers etc.) just use GET, but A1SG defaults to HEAD followed by GET request. Generally speaking, it is quite possible to achieve around 20.000 URLs / hour with a little optimization and standard computer. (But many factors play in so can not make promises... Can't help it if e.g. website/webserver/DB is overloaded/slow etc... But I have been scanning much higher numbers of URLs/hour on some sites) ... If A1SG (current version) is underperforming compared to any other desktop sitemapper tool, I am very interested in knowing the website address. (Goes for all) If A1SG seems very slow, it fits with the GET settings issue. If so, the fix is: Scan website | Crawler Engine | Advanced Engine Settings: Enable/tick: Default to GET for page requests There's of course also the chance you have discovered a different issue... Can I assume the website is http://www.articles4.co.cc? And can I run a quick test against it?
I use the sitemap generator below, it has some good options for smaller websites up to 500 pages, you can save it in different formats as well, also has a editor so you can quickly remove certain pages or folders that you don't want in your sitemap. You can index more than 500 pages but I didn't look into how you do that. The page is a bit messy as well, but it does the job. Sitemap generator
this is the soft u need to to try: http://www.auditmypc.com/free-sitemap-generator.asp take reference of this thread: http://forums.digitalpoint.com/showthread.php?t=531765