I am looking for a robots.txt Guru who know their way around WordPress. I want to make sure that I am not excluding things which will prevent getting inner Pagerank to my post pages, and that I am avoiding dupicate content. The site I need help with is at, http://www.madmouseblog.com Any suggestions would be helpful. Thanks,
Are you looking for robot.txt Guru ? It should be robots.txt . It is quite easy to write and understand . I may help you . what is the problem ?
User-agent: Googlebot # Disallow all directories and files within Disallow: /cgi-bin/ Disallow: /wp-admin/ Disallow: /wp-includes/ # Disallow all files ending with these extensions Disallow: /*.php$ Disallow: /*.js$ Disallow: /*.inc$ Disallow: /*.css$ # Disallow parsing individual post feeds, categories and trackbacks.. Disallow: */trackback/ Disallow: */feed/ Disallow: /category/* Code (markup):
Thanks for the information, but your example won't validate. My robots.txt is a little more advanced, so it doesn't help me much. What I currently have may be a little over kill, but I would like to sort it out a bit.
User-agent: * Disallow: /wp-content/ Disallow: /wp-admin/ Disallow: /wp-includes/ Disallow: /wp- Disallow: /feed/ Disallow: /trackback/ Disallow: /cgi-bin/ User-agent: Mediapartners-Google* Disallow: # BEGIN XML-SITEMAP-PLUGIN Sitemap: http://www.yoursite.com/sitemap.xml.gz # END XML-SITEMAP-PLUGIN This is my Valid and simple robot txt for my wordpress blog
Thanks for your input. This is a very simple one as well, while it works I was hoping for something more. I am looking for a Guru, and I figured that he would go to my site and view my robots.txt file and just analyze it, telling me if there were any problems with my current one. Moderators - Please close thread.