1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

bizarre urls generated in my sitemap file

Discussion in 'Google Sitemaps' started by relixx, May 17, 2006.

  1. #1
    Ive just started making sitemaps for my sites, and while testing the numerous scripts and tools out there i noticed a weird occurance.

    When the sitemap got generated I kept on seeing bizarre urls, such as http://www.domain.com/https://www.domain.com and urls that were incorrect (such as www.domain.com/blah.php instead of www.domain.com/blah/blah.php). Ive checked out my site with Xenu and it came back clean, so Im rather stumped at this. Can anyone shed some light on this?
     
    relixx, May 17, 2006 IP
  2. Xig

    Xig Peon

    Messages:
    75
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    0
    #2
    They might be badly interpreting different ways of expressing relative and absolute URLs ... are you getting consistant bizarre URLs from the scripts and tools?
     
    Xig, May 17, 2006 IP
  3. websitetools

    websitetools Well-Known Member

    Messages:
    1,513
    Likes Received:
    25
    Best Answers:
    4
    Trophy Points:
    170
    #3
    Could we get an url of your site? I would like to run it over with my sitemap generator. Call me curious :)
     
    websitetools, May 17, 2006 IP
  4. hubel

    hubel Grunt

    Messages:
    382
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    168
    #4
    hubel, May 17, 2006 IP
  5. MaxPowers

    MaxPowers Well-Known Member

    Messages:
    264
    Likes Received:
    5
    Best Answers:
    1
    Trophy Points:
    120
    #5
    Depending on whether the sitemap tool is using an HTML parser or regular expressions to do the job, your results may vary. strange URLs can be a sign of code that can't be validated at w3c or one of the other validator tools.

    If you are truly having an impossible time getting it done, the last resort may be to find a sitemap service like AutoMapIt.com or others who use 'web-bugs' as an alternative. Web-bugs are images that call scripts in order to grab your URLs. This isn't the only way to do it, but for the really stubborn pages, it's sure to do the trick.
     
    MaxPowers, May 17, 2006 IP
  6. relixx

    relixx Active Member

    Messages:
    946
    Likes Received:
    54
    Best Answers:
    0
    Trophy Points:
    70
    #6
    yeah, pretty much :(

    yeah, thats when i started noticing the errors :(
     
    relixx, May 17, 2006 IP
  7. Xig

    Xig Peon

    Messages:
    75
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    0
    #7
    Can you PM me the URL of your site? I'm intrigued :) I'll see if I notice anything obvious in the code that might be causing it
     
    Xig, May 18, 2006 IP
  8. relixx

    relixx Active Member

    Messages:
    946
    Likes Received:
    54
    Best Answers:
    0
    Trophy Points:
    70
    #8
    Thanks for the offer :) however, I feel I've figured it out. the site uses a <base href=""> tag in the header, meaning that we coded all internal links without the www. domain. com bit, as it pulls this from the base href tag.
     
    relixx, May 18, 2006 IP