Need help with sitemap

Discussion in 'Google Sitemaps' started by RECEP, Mar 3, 2009.

  1. #1
    Hello friends, l used a sitemap builder software that is created with this company, it took lots of time to creat a sitemap of my article site ( 3 day nonstop ). But when the site sitemap creation finished, l upluoaded all 4 types of sitemap formats to my server and after l added them to google sitemap, it gawe lots of errors, first of all html sitemap fize size limit, its ok l split it less than 10 mb for each, its ok to spilt hml or txt formats into pieces. but the most ennoying thing is xml problem l am living. the softwares xml sitemap creature is like this
    <url><loc>http://www.articles4.co.cc/article100798.html</loc></url>
    <url><loc>http://www.articles4.co.cc/index.php?pg=2&page=category&category_id=179</loc></url>
    Code (markup):
    in this lines, l learn google doesnt accept links like this
    <url><loc>http://www.articles4.co.cc/index.php?pg=2&page=category&category_id=179</loc></url>
    Code (markup):
    l dont know why.
    l have a bunch of lines like this around a half of my sitemap (45000 pages)
    the most ennoying thing is these lines are devided into other links, l should manually find them and delete.

    my request if there are any coder can help me how to solve this problem? l mean what should l do to find and remove these lines from the file or is there any better sitemap creator ( free ) that converts more than 100 000 pages without any error?
     
    RECEP, Mar 3, 2009 IP
  2. RECEP

    RECEP Well-Known Member

    Messages:
    1,855
    Likes Received:
    23
    Best Answers:
    0
    Trophy Points:
    195
    #2
    :) no1 knows?
     
    RECEP, Mar 4, 2009 IP
  3. websitetools

    websitetools Well-Known Member

    Messages:
    1,513
    Likes Received:
    25
    Best Answers:
    4
    Trophy Points:
    170
    #3
    websitetools, Mar 5, 2009 IP
  4. RECEP

    RECEP Well-Known Member

    Messages:
    1,855
    Likes Received:
    23
    Best Answers:
    0
    Trophy Points:
    195
    #4
    l am greatfull to you for this information, l wouldnt think sitemaps cant imagine this. u buy or download a sitemap generator, it creates 100 000 of indexed pages but than u have to convert some symbols manually urself, its idiot:p but thanks for informing me.
     
    RECEP, Mar 6, 2009 IP
  5. websitetools

    websitetools Well-Known Member

    Messages:
    1,513
    Likes Received:
    25
    Best Answers:
    4
    Trophy Points:
    170
    #5
    Many sitemapper tools including A1 Sitemap Generator handle this correctly... So I think you have just been unlucky :)

    Note: You may want to report the bug/issue to the author of the sitemapper tool that generated the incorrect sitemaps.
     
    websitetools, Mar 7, 2009 IP
  6. siscopemike

    siscopemike Guest

    Messages:
    27
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #6
    How big are the sitemap files in terms of pages? Is it possibly that the sitemaps themselves are too large and that is what is causing Google to tip over?
     
    siscopemike, Mar 8, 2009 IP
  7. zitoitala

    zitoitala Peon

    Messages:
    106
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #7
    Definitely sounds like a sitemap size issue here. Check that out that it perhaps is not causing the submission process to time out by accident
     
    zitoitala, Mar 8, 2009 IP
  8. Lpe04

    Lpe04 Peon

    Messages:
    579
    Likes Received:
    15
    Best Answers:
    0
    Trophy Points:
    0
    #8
    Try to do everything piece by piece, little by little and adding to it, instead of trying to do it all at once. Or try Thomas' suggestion of A1.

    Hope this helps.
    Take care.
     
    Lpe04, Mar 8, 2009 IP
  9. RECEP

    RECEP Well-Known Member

    Messages:
    1,855
    Likes Received:
    23
    Best Answers:
    0
    Trophy Points:
    195
    #9
    thomas l also tried ur site map, its very slow and it takes months to index 100 000 pages. :( l used wonderwebware which is much more faster and indexes from 10 simistinouse connections which is really faster

    aa yes, google accepts sitemaps less tha 10 mb, so files created more than 10 u have to devide them, this sitemap maker do it as well but it doesnt convert some symbols
    well, l did pal, at least l used txt ( yahoo based sitemap ) in google cos l couldnt manage to convert all xml files cos they always giwe problem.

    but a good sitemap creater really needed in sector.
     
    RECEP, Mar 8, 2009 IP
  10. Lpe04

    Lpe04 Peon

    Messages:
    579
    Likes Received:
    15
    Best Answers:
    0
    Trophy Points:
    0
    #10
    Google allows you to submit a text file
     
    Lpe04, Mar 10, 2009 IP
  11. websitetools

    websitetools Well-Known Member

    Messages:
    1,513
    Likes Received:
    25
    Best Answers:
    4
    Trophy Points:
    170
    #11
    People often tell me A1SG is far faster than anything else. The only exception I have encountered in some cases is with websites/webservers that dislike HEAD requests before GET requests. Most crawlers (and all browsers etc.) just use GET, but A1SG defaults to HEAD followed by GET request.

    Generally speaking, it is quite possible to achieve around 20.000 URLs / hour with a little optimization and standard computer. (But many factors play in so can not make promises... Can't help it if e.g. website/webserver/DB is overloaded/slow etc... But I have been scanning much higher numbers of URLs/hour on some sites) ... If A1SG (current version) is underperforming compared to any other desktop sitemapper tool, I am very interested in knowing the website address. (Goes for all)

    If A1SG seems very slow, it fits with the GET settings issue. If so, the fix is:
    Scan website | Crawler Engine | Advanced Engine Settings:
    Enable/tick: Default to GET for page requests

    There's of course also the chance you have discovered a different issue... Can I assume the website is http://www.articles4.co.cc? And can I run a quick test against it?
     
    websitetools, Mar 11, 2009 IP
  12. RECEP

    RECEP Well-Known Member

    Messages:
    1,855
    Likes Received:
    23
    Best Answers:
    0
    Trophy Points:
    195
    #12
    yes, l did at last with txt cos non of sitemap creator software works correctly
     
    RECEP, Mar 12, 2009 IP
  13. gary101

    gary101 Peon

    Messages:
    278
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    0
    #13
    I use the sitemap generator below, it has some good options for smaller websites up to 500 pages, you can save it in different formats as well, also has a editor so you can quickly remove certain pages or folders that you don't want in your sitemap.

    You can index more than 500 pages but I didn't look into how you do that. The page is a bit messy as well, but it does the job.

    Sitemap generator
     
    gary101, Mar 12, 2009 IP
  14. RECEP

    RECEP Well-Known Member

    Messages:
    1,855
    Likes Received:
    23
    Best Answers:
    0
    Trophy Points:
    195
    #14
    thank,l will try it.
     
    RECEP, Mar 20, 2009 IP
  15. coolseo36

    coolseo36 Well-Known Member

    Messages:
    979
    Likes Received:
    92
    Best Answers:
    0
    Trophy Points:
    140
    #15
    coolseo36, Mar 23, 2009 IP