Something about robots.txt

Discussion in 'Search Engine Optimization' started by maxbear, Jul 26, 2007.

  1. #1
    I see some blog state that I can disallow the bot to index the wordpress files. Here is how they use:

    Disallow: /wp-

    Or

    Disallow: /wp-*

    Or

    Disallow: /wp-includes/

    Anyone know what's the differences between /wp- and /wp-* ? Which once should I use if I want to block all the subdirectories and files which starting at wp- ?
     
    maxbear, Jul 26, 2007 IP
  2. The_FoX

    The_FoX Peon

    Messages:
    666
    Likes Received:
    21
    Best Answers:
    0
    Trophy Points:
    0
    #2
    I dont there are as many directories with Wp- as you say. All you have is wp-admin and wp-includes or may be one more.

    So please take the pain in writing them down rather than going for experiments.

    Cheers!
    Mani
     
    The_FoX, Jul 26, 2007 IP
  3. Dan Schulz

    Dan Schulz Peon

    Messages:
    6,032
    Likes Received:
    437
    Best Answers:
    0
    Trophy Points:
    0
    #3
    wp-admin, wp-content and wp-includes are it

    Those are the directories you want to disallow in your robots.txt file.

    Just bear in mind that if a Web page is password protected, the search engine spider won't be able to get past it. For more information on the robots.txt protocol, go to http://www.robotstxt.org/wc/robots.html
     
    Dan Schulz, Jul 26, 2007 IP
  4. Peobigwig

    Peobigwig Peon

    Messages:
    135
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #4
    I second that.
     
    Peobigwig, Jul 26, 2007 IP
  5. trichnosis

    trichnosis Prominent Member

    Messages:
    13,785
    Likes Received:
    333
    Best Answers:
    0
    Trophy Points:
    300
    #5
    this one block everythink which starts with wp-.

    this will block

    wp-
    wp-gjgjgjgj/ (folder)
    wp-kgjkfggfngfng.php (html,asp etc.)
     
    trichnosis, Jul 26, 2007 IP
  6. The_FoX

    The_FoX Peon

    Messages:
    666
    Likes Received:
    21
    Best Answers:
    0
    Trophy Points:
    0
    #6
    Damn - that confuses me. :D How about this one?

    Disallow: /script

    doesn't that disallow all the files/folders beneath the folder "script"?

    If i was to disallow only the folder "script" then it should be

    Disallow: /script/
     
    The_FoX, Jul 27, 2007 IP