robots.txt

Discussion in 'Search Engine Optimization' started by CommandTree1985, Apr 7, 2010.

  1. #1
    Hi,

    We have a bit of a problem with the indexing of our sites.

    We have footer links for things like 'send to friend' 'print page' etc and they are getting indexed by Google, i.e.

    http://www.website.co.uk/about_us/index:sendtofriend.html

    When you click on these links the page is usually blank and theres no link back to the home page.

    I have implemented the following robots.txt file to prevent any new sites from being affected;

    User Agent: *

    Disallow: */index:sendtofriend*

    etc

    However, I recently read that Googlebot will ignore the wildcard and the User Agent needs to be : Googlebot - is this the case?

    Sounded like BS to me but new sites still seem to have the bad pages indexed despite the new robots.txt.

    Also, what's the best way to fix sites which are already affected?

    Kind regards,
    Anthony
     
    CommandTree1985, Apr 7, 2010 IP
  2. smsinhindi

    smsinhindi Peon

    Messages:
    561
    Likes Received:
    5
    Best Answers:
    0
    Trophy Points:
    0
    #2
    smsinhindi, Apr 7, 2010 IP