1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

Is My Robot.txt Okay ???

Discussion in 'robots.txt' started by Algore, Oct 20, 2008.

  1. #1
    This is what my robot.txt is setup can anyone say is it okay or i need to modify few things here

    Here is my robot.txt

    User-agent: *
    Disallow: /wp-content/
    Disallow: /wp-admin/
    Disallow: /wp-includes/
    Disallow: /wp-
    Disallow: /feed/
    Disallow: /trackback/
    Disallow: /cgi-bin/
    User-agent: Googlebot
    Disallow: /*.php$
    Disallow: /*.js$
    Disallow: /*.cgi$
    Disallow: /*.xhtml$
    Disallow: /*.php*
    Disallow: */trackback*
    Disallow: /*?*
    Disallow: /z/
    Disallow: /wp-*
    Disallow: /*.inc$
    Disallow: /*.css$
    Disallow: /*.txt$


    If i have made any msitake please expert correct me i will rectify as from past few days i am not getting traffic as much as i got the last few weeks

    Regards
     
    Algore, Oct 20, 2008 IP
  2. Adpubster

    Adpubster Peon

    Messages:
    4,017
    Likes Received:
    153
    Best Answers:
    0
    Trophy Points:
    0
    #2
    You can verify it in Googles webmaster tools. Far as I know, though, wildcards are NOT allowed in robots.txt files. Someone correct me if I'm wrong...I'd be elated if it were valid!
     
    Adpubster, Oct 20, 2008 IP
  3. Algore

    Algore Peon

    Messages:
    963
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #3
    Refering to Googles webmaster tools will solve this problem ( if any mistakes )
     
    Algore, Oct 20, 2008 IP
  4. Adpubster

    Adpubster Peon

    Messages:
    4,017
    Likes Received:
    153
    Best Answers:
    0
    Trophy Points:
    0
    #4
    Not quite sure I follow what you're asking here, but to clarify, if you go into Google's Webmaster Tools, there is a place where you can submit your robots.txt and it will analyze it and inform you of any problems.
     
    Adpubster, Oct 20, 2008 IP
  5. 3312easy

    3312easy Peon

    Messages:
    21
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
  6. Algore

    Algore Peon

    Messages:
    963
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #6
    Algore, Oct 21, 2008 IP
  7. seomadboy

    seomadboy Banned

    Messages:
    29
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #7
    Seems to be compiled right!
     
    seomadboy, Oct 21, 2008 IP
  8. Algore

    Algore Peon

    Messages:
    963
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #8
    Not quite sure

    I never had the robot.txt file suddenly some where i saw in some forums to protect some directories and thats why i have created the robot.txt file

    Whats your advice shall robot.txt be there or not
     
    Algore, Oct 21, 2008 IP
  9. Adpubster

    Adpubster Peon

    Messages:
    4,017
    Likes Received:
    153
    Best Answers:
    0
    Trophy Points:
    0
    #9
    Just realize that:

    a) The wildcard is "nonstandard" in the robots.txt (not all robots will use it as you intend).
    b) Not all robots even honor the robots.txt so it's no guarantee that robots won't crawl those places excluded by the file.
     
    Adpubster, Oct 22, 2008 IP
  10. Algore

    Algore Peon

    Messages:
    963
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #10
    No man ... see i want all the robot , crawlers and spiders of all search engines to completely go thru my site ...

    Is it good to have the robot.txt file if no please tell me the goods and bads i will remove it
     
    Algore, Oct 23, 2008 IP
  11. Adpubster

    Adpubster Peon

    Messages:
    4,017
    Likes Received:
    153
    Best Answers:
    0
    Trophy Points:
    0
    #11
    You indicated above that you wanted to "protect" some directories/forums. That's not the same as wanting all search engines to go completely through your site, though.

    If you don't want anyone in certain areas, a robots.txt is not the way to do it, since not all robots are well-behaved (and putting hidden or not-allowed places in there just informs them where it is) The best thing to do is to put restrictions (logins etc) on those areas.
     
    Adpubster, Oct 23, 2008 IP
  12. Algore

    Algore Peon

    Messages:
    963
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #12
    Can you please tell any general procedure how to block robots/ spiders / crawlers entering into certain folders etc

    Hope you get it ... i want exactly that
     
    Algore, Oct 23, 2008 IP
  13. Adpubster

    Adpubster Peon

    Messages:
    4,017
    Likes Received:
    153
    Best Answers:
    0
    Trophy Points:
    0
    #13
    It's pretty much impossible to deny all robots since you don't know what a robot is (not you specifically, I mean since there is no comprehensive list of IPs to define robots) so you would have to password protect (.htaccess) the folders.
     
    Adpubster, Oct 24, 2008 IP
  14. shinyk

    shinyk Peon

    Messages:
    13
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #14
    I've heard of cases where the .htaccess folder protect actually ( which hold passwords ) had bee hacked or reverse MD5'd, posing a security risk if that password happens to be important.
     
    shinyk, Oct 24, 2008 IP
  15. Adpubster

    Adpubster Peon

    Messages:
    4,017
    Likes Received:
    153
    Best Answers:
    0
    Trophy Points:
    0
    #15
    Make sure to restrict access to .htaccess in the httpd.conf (or similar file)
     
    Adpubster, Oct 27, 2008 IP
  16. Algore

    Algore Peon

    Messages:
    963
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #16
    Algore, Oct 27, 2008 IP
  17. shipit

    shipit Peon

    Messages:
    64
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #17
    You can also use Google Webmaster tools to create robot.txt.
     
    shipit, Oct 31, 2008 IP
  18. sugihfulus

    sugihfulus Peon

    Messages:
    16
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #18
    hmm.. I don't know.
    is important robottxt for our adsense ?
     
    sugihfulus, Feb 4, 2009 IP
  19. Adpubster

    Adpubster Peon

    Messages:
    4,017
    Likes Received:
    153
    Best Answers:
    0
    Trophy Points:
    0
    #19
    It's more important for your site rather than specifically for the adsense on it.
     
    Adpubster, Feb 4, 2009 IP
  20. Algore

    Algore Peon

    Messages:
    963
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    0
    #20
    How much important it it for adsense point of view
     
    Algore, Feb 5, 2009 IP