robots.txt question

Discussion in 'Search Engine Optimization' started by rgordon83, Jun 3, 2007.

  1. #1
    google webmaster said that the url on my site http://theguitarresource.com/forums/index.php is restricted by my robots.txt file. is that because it ends in .php? should i remove this from the robots, i'de assume my forum is a good page to be indexed...

    User-agent: *
    Disallow: /wp-content/
    Disallow: /wp-admin/
    Disallow: /wp-includes/
    Disallow: /wp-
    Disallow: /feed/
    Disallow: /trackback/
    Disallow: /cgi-bin/


    User-agent: Googlebot
    # disallow all files ending with these extensions
    Disallow: /*.php$
    Disallow: /*.js$
    Disallow: /*.inc$
    Disallow: /*.css$
    Disallow: /*.gz$
    Disallow: /*.wmv$
    Disallow: /*.cgi$
    Disallow: /*.xhtml$

    # allow google image bot to search all images
    User-agent: Googlebot-Image
    Disallow:
    Allow: /*

    # allow adsense bot on entire site
    User-agent: Mediapartners-Google*
    Disallow:
    Allow: /*
     
    rgordon83, Jun 3, 2007 IP
  2. MrX

    MrX Well-Known Member

    Messages:
    1,563
    Likes Received:
    77
    Best Answers:
    0
    Trophy Points:
    140
    #2
    why are u hiding .php file from google? there's nothing for it index.
     
    MrX, Jun 3, 2007 IP
  3. rgordon83

    rgordon83 Peon

    Messages:
    671
    Likes Received:
    22
    Best Answers:
    0
    Trophy Points:
    0
    #3
    also i did a search for my indexed pages and just this came up: theguitarresource.com/wp-admin/edit.php?paged=2

    first of all why is that the only page indexed??? 2nd, shouldn't this be blocked by my robots file?? thanks!
     
    rgordon83, Jun 3, 2007 IP
  4. MrX

    MrX Well-Known Member

    Messages:
    1,563
    Likes Received:
    77
    Best Answers:
    0
    Trophy Points:
    140
    #4
    your robots is completely f'd. just use this:

     
    MrX, Jun 3, 2007 IP
  5. kohinoor

    kohinoor Peon

    Messages:
    169
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    0
    #5
    Allow *.php , it will help you.
     
    kohinoor, Jun 4, 2007 IP
  6. shanaka

    shanaka Active Member

    Messages:
    409
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    81
    #6
    just block admin and image folders. allow rest of files and folders.
     
    shanaka, Jun 4, 2007 IP
  7. rgordon83

    rgordon83 Peon

    Messages:
    671
    Likes Received:
    22
    Best Answers:
    0
    Trophy Points:
    0
    #7
    is this better?

    User-agent: *
    Disallow: /wp-admin/
    Disallow: /wp-


    # allow google image bot to search all images
    User-agent: Googlebot-Image
    Disallow:
    Allow: /*

    # allow adsense bot on entire site
    User-agent: Mediapartners-Google*
    Disallow:
    Allow: /*
     
    rgordon83, Jun 4, 2007 IP
  8. Jean-Luc

    Jean-Luc Peon

    Messages:
    601
    Likes Received:
    30
    Best Answers:
    0
    Trophy Points:
    0
    #8
    Hi,

    User-agent: *
    Disallow: /wp-
    
    Code (markup):
    is probably good enough!

    It allows all robots everywhere except in areas with a URL starting with /wp-.

    When you give a robot special instructions with a directive like
    User-agent: Googlebot
    you instruct this robot to only look at the instruction following this line and to disregard the instructions following
    User-agent: *

    Hope this helps.

    Jean-Luc
     
    Jean-Luc, Jun 4, 2007 IP