Free Ringtones - Free Ringtone - Halifax - Flights - Mortgage

PDA

View Full Version : robots.txt for phpBB forum


Googles76
Dec 3rd 2005, 7:45 am
If I'm running the latest phpBB forum, which parts should I have included in robots.txt so they don't get spidered? I don't want all that junk being in the index, just the useful posts. If anyone has an actual example of a robots.txt with that, it would be great:)

mdvaldosta
Dec 3rd 2005, 8:02 am
Here's one of mine:

User-agent: *
Disallow: /forums/admin/
Disallow: /forums/images/
Disallow: /forums/includes/
Disallow: /forums/language/
Disallow: /forums/templates/
Disallow: /forums/common.php
Disallow: /forums/config.php
Disallow: /forums/groupcp.php
Disallow: /forums/memberlist.php
Disallow: /forums/modcp.php
Disallow: /forums/posting.php
Disallow: /forums/profile.php
Disallow: /forums/privmsg.php
Disallow: /forums/viewonline.php
Disallow: /forums/search.php
Disallow: /forums/faq.php

Googles76
Dec 3rd 2005, 8:13 am
Here's one of mine:

User-agent: *
Disallow: /forums/admin/
Disallow: /forums/images/
Disallow: /forums/includes/
Disallow: /forums/language/
Disallow: /forums/templates/
Disallow: /forums/common.php
Disallow: /forums/config.php
Disallow: /forums/groupcp.php
Disallow: /forums/memberlist.php
Disallow: /forums/modcp.php
Disallow: /forums/posting.php
Disallow: /forums/profile.php
Disallow: /forums/privmsg.php
Disallow: /forums/viewonline.php
Disallow: /forums/search.php
Disallow: /forums/faq.php

Thanks! thats what I was looking for:cool:

vic@VKIStudios
Dec 6th 2005, 12:30 pm
robots.txt is great for controlling the indexing of bots, for your site, if you have duplicate content elsewhere on the web. Duplicate content is bad, bad bad!

minstrel
Dec 11th 2005, 11:35 am
Here's what mine looks like:

Disallow: /phpbb/admin/
Disallow: /phpbb/db/
Disallow: /phpbb/images/
Disallow: /phpbb/includes/
Disallow: /phpbb/language/
Disallow: /phpbb/profile.php
Disallow: /phpbb/groupcp.php
Disallow: /phpbb/memberlist.php
Disallow: /phpbb/login.php
Disallow: /phpbb/modcp.php
Disallow: /phpbb/posting.php
Disallow: /phpbb/privmsg.php
Disallow: /phpbb/search.php

unitechy
Dec 14th 2005, 1:57 am
should i just put what minstrel said on robots.txt file????

minstrel
Dec 14th 2005, 7:49 am
Change the "/phpbb/" part to whatever your forum folder is named, of course.

unitechy
Dec 14th 2005, 6:44 pm
yup thanku very much..

Jac0bsnn
Jan 18th 2006, 6:37 am
Mine is more compact:

User-agent: *
Disallow: /forum/admin/
Disallow: /forum/includes/
Disallow: /forum/common.php
Disallow: /forum/config.php
Disallow: /forum/groupcp.php
Disallow: /forum/memberlist.php
Disallow: /forum/modcp.php
Disallow: /forum/profile.php

Martindale
Apr 30th 2007, 2:52 pm
Sorry for the ancient bump, but what how would I go about blocking the "viewtopic.php?p=<postnumber>" links?

Mainly to avoid duplicate content penalties.

minstrel
Apr 30th 2007, 5:52 pm
There is no duplicate content penalty. There is a duplicate content filter which will index one page and ignore other pages with identical content. Unless Google indexes the "wrong" page, you don't need to worry.

How does "viewtopic.php?p=<postnumber>" create a duplicate content issue?

Martindale
May 2nd 2007, 6:20 am
A topic is usually "viewtopic.php?t=<threadID>" - however, each individual post on that page, the first page of the thread, can be linked to using "viewtopic.php?p=<postID>#<postID>". Google of course ignores the #postID, referring only to a specific area of the page. However, with probably 20 posts per page, there's no content difference (at ALL) on the post pages versus the thread page itself.

minstrel
May 2nd 2007, 6:27 am
I think you're worrying way too much about the duplicate content issue here. Remember, it's a filter, not a penalty.

You don't really want to eliminate the individual post URLs, since other sites will legitmately want to link to either a specific thread OR to a specific post.

And you don't NEED to eliminate individual post URLs - all links will take you to the thread one way or another, which is after all the whole idea.

Martindale
May 8th 2007, 8:10 pm
Wow, thanks for the info. That helps a ton. :)

trichnosis
May 11th 2007, 6:20 pm
this looks good but i think you must add $ to end of the file names

Disallow: /phpbb/admin/
Disallow: /phpbb/db/
Disallow: /phpbb/images/
Disallow: /phpbb/includes/
Disallow: /phpbb/language/
Disallow: /phpbb/profile.php$
Disallow: /phpbb/groupcp.php$
Disallow: /phpbb/memberlist.php$
Disallow: /phpbb/login.php$
Disallow: /phpbb/modcp.php$
Disallow: /phpbb/posting.php$
Disallow: /phpbb/privmsg.php$
Disallow: /phpbb/search.php$