Wordpress Robots.txt

Discussion in 'Search Engine Optimization' started by Calbert, Apr 14, 2007.

  1. #1
    What do you guys and girls think of this robot.txt for a wordpress blog?

    User-agent:  *
    # disallow all files in these directories
    Disallow: /cgi-bin/
    Disallow: /z/j/
    Disallow: /z/c/
    Disallow: /stats/
    Disallow: /dh_
    Disallow: /about/
    Disallow: /contact/
    Disallow: /tag/
    Disallow: /wp-admin/
    Disallow: /wp-includes/
    Disallow: /contact
    Disallow: /manual
    Disallow: /manual/*
    Disallow: /phpmanual/
    Disallow: /category/
     
    User-agent: Googlebot
    # disallow all files ending with these extensions
    Disallow: /*.php$
    Disallow: /*.js$
    Disallow: /*.inc$
    Disallow: /*.css$
    Disallow: /*.gz$
    Disallow: /*.wmv$
    Disallow: /*.cgi$
    Disallow: /*.xhtml$
     
    # disallow all files with ? in url
    Disallow: /*?*
     
    # disable duggmirror
    User-agent: duggmirror
    Disallow: /
     
    # allow google image bot to search all images
    User-agent: Googlebot-Image
    Disallow:
    Allow: /*
     
    # allow adsense bot on entire site
    User-agent: Mediapartners-Google*
    Disallow:
    Allow: /*
    
    Sitemap: http://www.example.com/sitemap.xml
    
    Code (markup):
    Would you use it?
     
    Calbert, Apr 14, 2007 IP
  2. mjesales

    mjesales Peon

    Messages:
    326
    Likes Received:
    16
    Best Answers:
    0
    Trophy Points:
    0
    #2
    mine normally would just disallow

    Disallow: /wp-admin/
    Disallow: /wp-includes/

    but you can include the other stuff if you want - some of it looks pretty specific to your blog.

    as for the tag pages - you know- a lot of people say that they are dupe content... but my tag pages often rank in the search engines very high...
     
    mjesales, Apr 14, 2007 IP
  3. Calbert

    Calbert Well-Known Member

    Messages:
    1,072
    Likes Received:
    38
    Best Answers:
    0
    Trophy Points:
    140
    #3
    mmmmm ok

    thanks

    I might give it a try...
     
    Calbert, Apr 14, 2007 IP
  4. nfd2005

    nfd2005 Well-Known Member

    Messages:
    295
    Likes Received:
    20
    Best Answers:
    0
    Trophy Points:
    130
    #4
    I would just be simple about it:

     
    nfd2005, Apr 14, 2007 IP
  5. archard

    archard Peon

    Messages:
    221
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #5
    Why not disallow wp-content as well?
     
    archard, Apr 14, 2007 IP
  6. nfd2005

    nfd2005 Well-Known Member

    Messages:
    295
    Likes Received:
    20
    Best Answers:
    0
    Trophy Points:
    130
    #6
    I would definately disallow wp-content.

    by using:

    Disallow: /wp-

    this disallows everything with a wp- prefix.

    It works with Google's robots.txt analysis. (Webmaster Tools)
     
    nfd2005, Apr 14, 2007 IP
  7. dsendecki

    dsendecki Peon

    Messages:
    212
    Likes Received:
    15
    Best Answers:
    0
    Trophy Points:
    0
    #7
    Anybody see anything wrong with this?

     
    dsendecki, Apr 15, 2007 IP