A little confused about some sitemap+robots.txt stuff

Discussion in 'Search Engine Optimization' started by capitalalchemy, Jun 16, 2009.

  1. #1
    1. Ok, so let's say that I create a sitemap and its an onsite sitemap. Now let's say that I create an xml sitemap for google's eyes only.

    Lets pretend that I have like 30 links in that xml sitemap, and that 3 of those links are in the robots.txt file as Disallow.

    I'm starting to think that this confuses google. Should I leave links in robots.txt out of my xml sitemap?

    2. This is a stupid one I'm afraid :( - Is is common practice to update the xml sitemap every time a new page is added to a site, and resubmit it to google? that's what I've been doing. Is this normal practice to inform google of new pages?
     
    capitalalchemy, Jun 16, 2009 IP
  2. Styxbowl20v

    Styxbowl20v Active Member

    Messages:
    231
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    78
    #2
    If you want to disallow a page then don't include it in your XML sitemap. Only pages you want crawled should be in your XML sitemap.

    You should add any new pages to your XML sitemap but you dont need to "resubmit". If you are hosting your XML sitemap on your server and referencing it from your robots.txt file then it will be picked up automatically by Google, Yahoo, and MSN.

    Hope that helps!
     
    Styxbowl20v, Jun 16, 2009 IP
  3. trosquin

    trosquin Active Member

    Messages:
    681
    Likes Received:
    9
    Best Answers:
    0
    Trophy Points:
    60
    #3
    Right...the XML map is only the pages you want Google to index and crawl.

    Once you have a sitemap and tell Google where to find it in your webmaster tools....then Google will crawl it everytime they come to your site....once a day/week/month. So resubmitting is a waste of your time.
     
    trosquin, Jun 16, 2009 IP
  4. capitalalchemy

    capitalalchemy Member

    Messages:
    116
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    46
    #4
    Thanks guys - so I should include it in my robots file as well?

    like Allow: sitemap.xml?

    I did not know that, I never even thought about it - I feel so stupid now, oh well thanks for the great advice - everyone's always so helpful :D

    - take care.
     
    capitalalchemy, Jun 16, 2009 IP
  5. Styxbowl20v

    Styxbowl20v Active Member

    Messages:
    231
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    78
    #5
    Styxbowl20v, Jun 17, 2009 IP