We were unable to access the URL you provided due to a restriction in robots.txt.

Discussion in 'robots.txt' started by NewComputer, Jun 20, 2005.

  1. #1
    Please make sure the robots.txt file and the Sitemap URL are correct and resubmit your Sitemap.

    Anyone seen this when submitting a sitemap? I have a new site that has some wierd things going on. I am not sure what it is. Adsense ads are not displaying, this site map is not working... I don't know what is going on...
     
    NewComputer, Jun 20, 2005 IP
  2. jbw

    jbw Peon

    Messages:
    343
    Likes Received:
    12
    Best Answers:
    0
    Trophy Points:
    0
    #2
    So whats in your robots.txt file?
     
    jbw, Jun 21, 2005 IP
  3. wrmineo

    wrmineo Peon

    Messages:
    3,087
    Likes Received:
    379
    Best Answers:
    0
    Trophy Points:
    0
    #3
    First make sure your robots.txt file is accessible, then make sure it's not excluding Google or others that you want crawling your site.

    If that is okay, check to make sure your robots.txt file is not being "protected" via firewall software.
     
    wrmineo, Jun 21, 2005 IP
  4. NewComputer

    NewComputer Well-Known Member

    Messages:
    2,021
    Likes Received:
    68
    Best Answers:
    0
    Trophy Points:
    188
    #4
    It was blank...
     
    NewComputer, Jun 21, 2005 IP
  5. crazyhorse

    crazyhorse Peon

    Messages:
    1,137
    Likes Received:
    19
    Best Answers:
    0
    Trophy Points:
    0
    #5
    There are probably plenty tools that allow you to validate your text file but this Robot.txt validator works for sure. Still weird that the text file wasnt accessible, did you set the wrong CHMOD rights or doesnt that make any difference?
     
    crazyhorse, Jun 21, 2005 IP
  6. NewComputer

    NewComputer Well-Known Member

    Messages:
    2,021
    Likes Received:
    68
    Best Answers:
    0
    Trophy Points:
    188
    #6
    The robots.txt file validates... My host just said that it may be a problem with my custom 404 page?
     
    NewComputer, Jun 21, 2005 IP
  7. NewComputer

    NewComputer Well-Known Member

    Messages:
    2,021
    Likes Received:
    68
    Best Answers:
    0
    Trophy Points:
    188
    #7
    Now this is getting wierd...

    I removed the www from the site map submission and it worked, why would that be? Also, Google ads are not displaying on the www url but they are on the non-www url.
     
    NewComputer, Jun 21, 2005 IP
  8. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #8
    If you're going to have a robots.txt file, don't leave it blank.

    Have it contain the following:

    User-agent: *
    Disallow: 
    
    Code (markup):
    Are you using any rewrites or redirects in the .htaccess file?

    What did you use to upload the files to the server?
     
    minstrel, Jun 22, 2005 IP
  9. NewComputer

    NewComputer Well-Known Member

    Messages:
    2,021
    Likes Received:
    68
    Best Answers:
    0
    Trophy Points:
    188
    #9
    I upload using WS_FTP, I am not using any htaccess rewrites or redirects...
     
    NewComputer, Jun 22, 2005 IP
  10. nevetS

    nevetS Evolving Dragon

    Messages:
    2,544
    Likes Received:
    211
    Best Answers:
    0
    Trophy Points:
    135
    #10
    I wonder if there isn't some sort of problem with your apache configuration - maybe at your host that lets spiders get to one url, but not the other. There's a firefox plugin to change your user agent. Try accessing your robots.txt file as a googlebot and see if you don't end up either getting rejected or getting redirected.
     
    nevetS, Jun 22, 2005 IP
  11. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #11
    minstrel, Jun 23, 2005 IP