help regarding robot.txt for my forum

Discussion in 'Websites' started by pankajdeoria, Jan 31, 2010.

  1. #1
    Dear Friends,
    I am having a forum website and using SMF software. Recently, I came to know that each reply of each thread is indexed in google seperatly which may be considered as a spamming by google. Also, I read that if you will create a robot.txt file for your forum then google will update the new response of each thread under one search result and that will not be considered as spamming.

    Please tell me and if possible , give me the complete code for robot.txt file that will help me to solve the above problem.

    My Forum Website is : http://forum.WeDost.Com

    Thank You.
     
    Last edited: Jan 31, 2010
    pankajdeoria, Jan 31, 2010 IP
  2. siteswanted

    siteswanted Peon

    Messages:
    45
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #2
    can you show an example?
     
    siteswanted, Feb 1, 2010 IP
  3. Lex350

    Lex350 Notable Member

    Messages:
    1,889
    Likes Received:
    31
    Best Answers:
    0
    Trophy Points:
    215
    #3
    Maybe have a look at DP robot file. I'm sure they know what they are doing.
     
    Lex350, Feb 1, 2010 IP
  4. pankajdeoria

    pankajdeoria Banned

    Messages:
    314
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    125
    #4
    Where to see the DP Robot.txt file?

    Thank you.
     
    pankajdeoria, Feb 1, 2010 IP
  5. coolseo36

    coolseo36 Well-Known Member

    Messages:
    979
    Likes Received:
    92
    Best Answers:
    0
    Trophy Points:
    140
    #5
    Its here,

    http://forums.digitalpoint.com/robots.txt

    Take a look at,

     
    coolseo36, Feb 2, 2010 IP
  6. tonsblogger

    tonsblogger Active Member

    Messages:
    330
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    53
    #6
    Nice, post I need this too, I'm going to build a forum site.
     
    tonsblogger, Feb 2, 2010 IP
  7. pankajdeoria

    pankajdeoria Banned

    Messages:
    314
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    125
    #7
    Thank you for this.
    One more question,

    How to stop same thread response being displayed twice in google search engine?

    Thanks.

     
    pankajdeoria, Feb 2, 2010 IP
  8. ccchong

    ccchong Peon

    Messages:
    50
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #8
    thread /google/how-to-get-indexed-in-google-search-engine-quickly/
    replys /google/how-to-get-indexed-in-google-search-engine-quickly/msg3/#msg3
    /google/how-to-get-indexed-in-google-search-engine-quickly/msg4/#msg4

    try add this to robots.txt?
    Disallow: /*/msg


    or u can set rel=nofollow to reply link?
     
    ccchong, Feb 2, 2010 IP
    dexterb007 likes this.
  9. pankajdeoria

    pankajdeoria Banned

    Messages:
    314
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    125
    #9
    Thank you .. I have created my robot.txt file n upload to my site.

    have a look and tell me If I need any addition or removal?

    http://forum.wedost.com/robots.txt

    Thank you.
     
    pankajdeoria, Feb 2, 2010 IP
  10. ccchong

    ccchong Peon

    Messages:
    50
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #10
    A little change:
    
    User-agent: *
    Disallow: /index.php?action=search
    Disallow: /index.php?action=calendar
    Disallow: /index.php?action=login
    Disallow: /index.php?action=register
    Disallow: /index.php?action=profile
    Disallow: /index.php?action=stats
    Disallow: /index.php?action=arcade
    Disallow: /index.php?action=printpage
    Disallow: /index.php?PHPSESSID=
    Disallow: /index.php?*rss
    Disallow: /index.php?*wap
    Disallow: /index.php?*wap2
    Disallow: /index.php?*imode
    Disallow: /index.php?action=search2
    Disallow: /index.php?action=pm
    Disallow: /index.php?action=activate
    Disallow: /index.php?action=reminder
    Disallow: /*/?action=unread 
    Disallow: /*/?action=sendtopic
    Disallow: /*/msg 
    Disallow: /Themes/
    Disallow: /Sources/
    Disallow: /Smileys/
    Disallow: /Packages/
    Disallow: /header/
    Disallow: /images/
    Disallow: /avatars/
    Disallow: /attachments/
    Disallow: /gallery/*jpeg
    Disallow: /?action=printpage
    Disallow: /?prev_next=prev
    Disallow: /?prev_next=next
    Disallow: /*sort
    Disallow: /?action=unread 
    
    Code (markup):
     
    ccchong, Feb 2, 2010 IP
  11. pankajdeoria

    pankajdeoria Banned

    Messages:
    314
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    125
    #11
    What is the difference between this and my robotx.txt . I think you have just removed * at the last.

    My question: Is there any effect of * at the end?

    Thanks for your help.

     
    pankajdeoria, Apr 3, 2010 IP
  12. eadler25

    eadler25 Peon

    Messages:
    217
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #12
    * means all
     
    eadler25, Apr 3, 2010 IP
  13. pankajdeoria

    pankajdeoria Banned

    Messages:
    314
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    125
    #13
    Then Whether I should add * or remove it?

     
    pankajdeoria, Apr 3, 2010 IP
  14. eadler25

    eadler25 Peon

    Messages:
    217
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #14
    if you want hide all jpeg images from image folder then you right

    Disallow: /image/*jpeg

    so google not index any of ur images from that folder
     
    eadler25, Apr 3, 2010 IP
  15. pankajdeoria

    pankajdeoria Banned

    Messages:
    314
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    125
    #15
    Thanks I got the change. Thanks for listing .. I was missing ' / '.

     
    pankajdeoria, Apr 3, 2010 IP
  16. eadler25

    eadler25 Peon

    Messages:
    217
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #16
    you can check ur robot.txt with google webmaster tools
     
    eadler25, Apr 3, 2010 IP
  17. Article Master

    Article Master Peon

    Messages:
    37
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #17
    Nice Work. Well Done. All The Best For Future.
     
    Article Master, Oct 26, 2010 IP