hows my robots.txt

Discussion in 'robots.txt' started by raptrex, Jul 11, 2008.

  1. #1
    User-Agent: *
    Disallow: /lang
    Disallow: /img
    Disallow: /style
    Disallow: /cache
    Allow: /
    Disallow: /admin
    
    
    Code (markup):

     
    raptrex, Jul 11, 2008 IP
  2. shivakhanal

    shivakhanal Peon

    Messages:
    13
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #2
    If You are using Google Adsense in your website then all the page should be allowed to the Adsense Crawler so add this lines too:

    User-agent: Mediapartners-Google
    Disallow:

    Thanks
     
    shivakhanal, Jul 12, 2008 IP
  3. Yousif

    Yousif Banned

    Messages:
    233
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #3
    Everyone should drop this subject for robots.txt. Firstly, robots.txt is extremely stupid. Google may not snoop around the areas you choose to block, but anyone with a browser can enter /robots.txt at the end of your domain and clearly see all of your directories. Do yourself a huge favor and simply remove /robots.txt. The countermeasure would simply be to lock or restrict the folders with a password or some kind of authorization/authentication scheme. Lastly, add an index.html file to each folder, and call it quits. That's your best bet for the state of security. As far as SEO and other optimization methods, it might help, but security matters more than functionality ;)
     
    Yousif, Jul 12, 2008 IP
  4. catanich

    catanich Peon

    Messages:
    1,921
    Likes Received:
    40
    Best Answers:
    0
    Trophy Points:
    0
    #4
    I would drop the "Allow: /", it is not needed.

    User-Agent: *
    Disallow: /lang
    Disallow: /img
    Disallow: /style
    Disallow: /cache
    Disallow: /admin
     
    catanich, Aug 21, 2008 IP
  5. Got-Paid-Forum.com

    Got-Paid-Forum.com Peon

    Messages:
    47
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #5
    I couldnt agree more! I have never addded robot.txt to any of my sites just for that reason.
     
    Got-Paid-Forum.com, Aug 22, 2008 IP
  6. doridori

    doridori Banned

    Messages:
    473
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #6
    excellent excellent. i second that.
     
    doridori, Aug 23, 2008 IP
  7. Bridgesan

    Bridgesan Peon

    Messages:
    194
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #7
    what is the txt for can you get a text that displays sertain ads?
     
    Bridgesan, Aug 25, 2008 IP
  8. mishraasuresh

    mishraasuresh Peon

    Messages:
    8
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #8
    Plz suggest me google adsence pin no
     
    mishraasuresh, Sep 5, 2008 IP
  9. nbs

    nbs Peon

    Messages:
    145
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #9


    Great security advice.
     
    nbs, Sep 5, 2008 IP
  10. imnocost

    imnocost Active Member

    Messages:
    62
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    50
    #10
    Question about not using the robots.txt, when I don't use it on my site, in my (Cpanel - Awstats) it shows up in my "HTTP error codes" as a 404 - Document not found.

    I checked it earlier today and it shows 56 hits for the robots.txt, so wouldn't this be a problem?

    Let me know what you think
    Mike
     
    imnocost, Sep 13, 2008 IP
  11. fireworking

    fireworking Peon

    Messages:
    460
    Likes Received:
    10
    Best Answers:
    0
    Trophy Points:
    0
    #11
    lol! 56 hits for the robots.txt. Maybe I should make the includes folder called "h2kld9wj" and the other useful stuff in one big folder : "ao47hgaiwuhga9w83hawyt9awyg30awu30t"
     
    fireworking, Sep 14, 2008 IP
  12. keyaa

    keyaa Peon

    Messages:
    137
    Likes Received:
    9
    Best Answers:
    0
    Trophy Points:
    0
    #12
    @fireworking: that's just security by obscurity. Don't do it.

    - Your include files should always be named .php and start with <?php. No .inc, .txt, .htm etc.
    - Set correct file permissions & use .htaccess to deny access to directories & files.

    Also, not having a robots.txt is a bad thing SEOwise.
     
    keyaa, Sep 29, 2008 IP
  13. Jean-Luc

    Jean-Luc Peon

    Messages:
    601
    Likes Received:
    30
    Best Answers:
    0
    Trophy Points:
    0
    #13
    It is a common misconception to believe that robots.txt can improve security. But still robots.txt is very useful to pass crawling instructions to polite robots.

    Jean-LUc
     
    Jean-Luc, Sep 29, 2008 IP