Help with robots.txt file

Discussion in 'robots.txt' started by compindustries, Jun 19, 2006.

  1. #1
    Hello everyone, I have a site www.bignutsracing.com

    Could someone be so generous as to instruct me on the proper use of a robots.txt file.

    Spiders are allowed to crawl everything on the site, therefore, I have placed the file in both the www.bignutsracing.com directory and the www.bignutsracing.com/forums directory.

    All I put in the file was:

    User-agent: *
    Disallow:

    Is this correct?

    Any help is greatly appreciated!
     
    compindustries, Jun 19, 2006 IP
  2. woodside

    woodside Peon

    Messages:
    182
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    0
    #2
    If you want spiders to crawl everything, you don't need a robots.txt file. robots.txt files are used to restrict certain files/directories from robots/crawlers.
     
    woodside, Jun 19, 2006 IP
  3. compindustries

    compindustries Peon

    Messages:
    8
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #3
    Thanks for the response...I thought I read somewhere that spiders automatically look for a robots.txt file and some will not do anything if they do not find one?
     
    compindustries, Jun 19, 2006 IP
  4. debunked

    debunked Prominent Member

    Messages:
    7,298
    Likes Received:
    416
    Best Answers:
    0
    Trophy Points:
    310
    #4
    I would atleast put in a blank page so that the robots find it and it doesn't show up in error logs
     
    debunked, Jun 19, 2006 IP
  5. gemini181

    gemini181 Well-Known Member

    Messages:
    2,883
    Likes Received:
    134
    Best Answers:
    0
    Trophy Points:
    155
    #5
    I could 'google it', but since we already have a thread going please, tell me exactly how to use the robots.txt file to restrict certain files/directories from robots/crawlers?

    Thanks
     
    gemini181, Jun 19, 2006 IP
  6. johnharrytaylor

    johnharrytaylor Peon

    Messages:
    28
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #6
    if i want to restrict certain bots then what should i write in rorots.txt file
     
    johnharrytaylor, Jun 28, 2006 IP
  7. Jean-Luc

    Jean-Luc Peon

    Messages:
    601
    Likes Received:
    30
    Best Answers:
    0
    Trophy Points:
    0
    #7
    Jean-Luc, Jun 28, 2006 IP
  8. joepan

    joepan Peon

    Messages:
    71
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #8
    I added a robots.tct file to my site but am not sure what it does or if it is doing anything but according to google site map beta it looks at it every few days. So it must do something :)
     
    joepan, Jul 1, 2006 IP
  9. sivainternet

    sivainternet Peon

    Messages:
    521
    Likes Received:
    20
    Best Answers:
    0
    Trophy Points:
    0
    #9
    User-agent: BotName
    Disallow: /folder1/
    Disallow: /folder2/

    say, for example, you dont want google to crawl your images directory. The usage would be:

    User-agent: Googlebot
    Disallow: /images/

    If you want to block all search engines:

    User-agent: *
    Disallow: /images/

    You can get detailed tutorials if you search on google
     
    sivainternet, Jul 1, 2006 IP
  10. gdnovey

    gdnovey Peon

    Messages:
    12
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #10
    gdnovey, Jul 5, 2006 IP
  11. tomart

    tomart Peon

    Messages:
    4
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #11
    We had restrict robots.txt for a while time. Is any chance to say googlebot to index our website now? We change robots.txt to allow indexing website, add website via form at Google pages, but googlebot do not indexing yet :(
     
    tomart, Oct 31, 2007 IP
  12. Jean-Luc

    Jean-Luc Peon

    Messages:
    601
    Likes Received:
    30
    Best Answers:
    0
    Trophy Points:
    0
    #12
    Google Webmaster Tools can help you solve this. Create an account if you don't have one, then add the site to the account and ask Google to verify it.

    When this is done, go to "Tools", then "Analyze robots.txt" and Google will read your robots.txt. Then you will have to be patient as the following actions by Google depend on the number of quality links pointing to your web site.

    Jean-Luc
     
    Jean-Luc, Oct 31, 2007 IP
  13. tomart

    tomart Peon

    Messages:
    4
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #13
    OK, Thanks. I try it. We have Google webmaster tools account.
     
    tomart, Oct 31, 2007 IP
  14. muks

    muks Member

    Messages:
    83
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    45
    #14
    That is correct but you don't have to put another robots.txt in the forums directory

    w w w.bignutsracing.com/robots.txt will suffice for all the directories under it
     
    muks, Nov 1, 2007 IP