robot.txt wildcards are they allowed?

Discussion in 'robots.txt' started by mnymkr, May 4, 2008.

  1. #1
    I am trying to use a robot.txt to prevent duplicate content with some bad forum URLS

    These three URLS all point to the same page


    http://www.mychemistrytutor.com/for...ormation-is-needed/msg5342/?topicseen#msg5342

    http://www.mychemistrytutor.com/forums/general-chemistry/some-information-is-needed/

    http://www.mychemistrytutor.com/forums/general-chemistry/some-information-is-needed/msg5342/

    I need to create a robot.txt that allows for dynamic generation of the topic title name and msg number but blocks everything after that
     
    mnymkr, May 4, 2008 IP
  2. manish.chauhan

    manish.chauhan Well-Known Member

    Messages:
    1,682
    Likes Received:
    35
    Best Answers:
    0
    Trophy Points:
    110
    #2
    No you can't do it as robots.txt is not auto updated. You need to add some instructions every time when you add some new urls in your site...:)
     
    manish.chauhan, May 5, 2008 IP
  3. Trusted Writer

    Trusted Writer Banned

    Messages:
    1,370
    Likes Received:
    52
    Best Answers:
    0
    Trophy Points:
    160
    #3
    Trusted Writer, May 13, 2008 IP