Do you use Robots.txt File? Here is the Importance.

Discussion in 'Search Engine Optimization' started by Rogi, Mar 14, 2009.

  1. #1
    Hi, Not much of the webmasters are known to the role of Robots.txt. SO here is my try, please also share your views about this one.
    Not many web master take the time to use a robots.txt file for their website. For search engine spiders that use the robots.txt to see what directories to search through.

    The robots.txt file which can be created simply in a notepad can be very helpful in keeping the spiders indexing your actual pages and not other information, such as looking through your stats and the files that you are going to provide the visitors to download after making the payment etc.

    The robots.txt file is useful in keeping your spiders from accessing parts folders and files in your hosting directory that are totally unrelated to your actual web site content, for example you might have also uploaded those files which are in progress and not fully developed and completed. You can choose to have the spiders kept out of areas that contain programming that search engines cannot parse properly, and to keep them out of the web stats portion of your site.
    What are your ideas about this file?

    Feedback Appreciated!!
     
    Rogi, Mar 14, 2009 IP
  2. rootbinbash

    rootbinbash Peon

    Messages:
    2,198
    Likes Received:
    88
    Best Answers:
    0
    Trophy Points:
    0
    #2
    It is very important especially to disallow something,a file or file extension.
     
    rootbinbash, Mar 14, 2009 IP
  3. sfinances

    sfinances Banned

    Messages:
    63
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #3
    this is very important to our sites but i didn't use to our sites yet.
     
    sfinances, Mar 14, 2009 IP
  4. catanich

    catanich Peon

    Messages:
    1,921
    Likes Received:
    40
    Best Answers:
    0
    Trophy Points:
    0
    #4
    On all of our sites, we use the robots.txt file. It keeps the SEs from indexing the junk on your sites.
     
    catanich, Mar 14, 2009 IP
  5. georgedosen

    georgedosen Active Member

    Messages:
    771
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    60
    #5
    Yah but some SE can still index the files robot.txt tells you not to.
    Most of are like hacking spiders tho...lol
     
    georgedosen, Mar 14, 2009 IP
  6. BenGregg

    BenGregg Peon

    Messages:
    619
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    0
    #6
    I don't think I have ever used a Robots.txt file.
     
    BenGregg, Mar 14, 2009 IP
  7. seosapien

    seosapien Peon

    Messages:
    618
    Likes Received:
    12
    Best Answers:
    0
    Trophy Points:
    0
    #7
    Really?!?! I have always considered it a must, it is one of the first steps I take on any on site optimization. It doesn't rank a site by itself but I consider it still is a good thing to have.
     
    seosapien, Mar 14, 2009 IP
  8. thawt

    thawt Peon

    Messages:
    8
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #8
    Google by default caches pages, so meta name="googlebot" content="noarchive"

    * NOINDEX tag tells Google not to index a specific page
    * NOFOLLOW tag tells Google not to follow the links on a specific page
    * NOARCHIVE tag tells Google not to store a cached copy of your page
    * NOSNIPPET tag tells Google not to show a snippet (description) under your Google listing, it will also not show a cached link in the search results

    Although thats not really SEO, most SEO would want google to crawl over everything!

    If a webmaster wishes to restrict the information on their site available to a Googlebot, or another well-behaved spider, they can do so with the appropriate directives in a robots.txt

    Google also rely on the following two robots, deepbot and freshbot.. so you can "deepbot" content="noarchive" etc.
     
    thawt, Mar 14, 2009 IP
  9. stevepmd

    stevepmd Peon

    Messages:
    303
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #9
    Hello,
    Where do I place this code in a wordpress blog?

    Thanks
     
    stevepmd, Mar 14, 2009 IP
  10. gr8webseller

    gr8webseller Peon

    Messages:
    1,097
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #10
    thanks for sharing this info. i am completely new to robots.txt. i will create a robots.txt file for my site.
     
    gr8webseller, Mar 15, 2009 IP
  11. zeekstern

    zeekstern Active Member

    Messages:
    872
    Likes Received:
    20
    Best Answers:
    0
    Trophy Points:
    60
    #11
    NOARCHIVE tag tells Google not to store a cached copy of your page

    When/why would you not want Google to store a cached copy?

    Thanks,
    Zeek
     
    zeekstern, Mar 15, 2009 IP
  12. pdesigns

    pdesigns Peon

    Messages:
    213
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    0
    #12
    You can also direct search engines to your sitemap with a robots.txt file by placing Sitemap: 'sitemap link' in it.
     
    pdesigns, Mar 15, 2009 IP
  13. johnbelly

    johnbelly Peon

    Messages:
    104
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #13
    Thanks buddy for providing such information. Actually, I don't know about robots.txt.
    But, after reading your thread i get idea about robots.txt and its importance.
     
    johnbelly, Mar 16, 2009 IP
  14. Rogi

    Rogi Active Member

    Messages:
    1,103
    Likes Received:
    18
    Best Answers:
    0
    Trophy Points:
    90
    #14
    We need it in that case, when we plan to change the image of our website and its graphic very frequently.
     
    Rogi, Mar 16, 2009 IP
  15. jingwen

    jingwen Peon

    Messages:
    312
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    0
    #15
    Of course the robots.txt is very important for us,if you do not want the search engine to crawl your page,you need it.
     
    jingwen, Mar 17, 2009 IP
  16. steven15

    steven15 Peon

    Messages:
    21
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #16
    Hi Rogi
    Can we tell from a site's "view source code" if the file is attached?
    Does this file help in optimising in some way as well?

    Regards
     
    steven15, Mar 17, 2009 IP
  17. manis

    manis Well-Known Member

    Messages:
    530
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    108
    #17
    I was totally unaware about this,thanks for showing importance i will try it!
     
    manis, Mar 17, 2009 IP
  18. affiliate-toolbars

    affiliate-toolbars Peon

    Messages:
    36
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #18
    I agree with everyone, robot.txt file plays an important role in SEO if you want to keep the spiders from finding pages that aren't really related to your site, but to my understanding having one that doesn't disallow anything helps your SE rankings a little too.
     
    affiliate-toolbars, Mar 17, 2009 IP
  19. mikelorentz

    mikelorentz Member

    Messages:
    73
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    43
    #19
    oh my.. i m shocked to see how many people are not aware of robots.txt. I suppose every matured and experienced webmaster do use robots.txt in the times of need. If your website having any data that you don't want to be get listed like user information. Robots.txt help you to prevent SE bot to crawl such pages.
     
    mikelorentz, Mar 17, 2009 IP
  20. 385ashley

    385ashley Peon

    Messages:
    30
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #20
    robots.txt is a file that you use to define your targeted users. you mention those bots as nofollow in this file to whom you don't want to show your site.
     
    385ashley, Mar 17, 2009 IP