I was reading at another seo forum that your site can get a dupe content penalty if the http://domain.com, http://www.domain.com, and the http://www.domain.com/index.html page all show up with the same content, and they go on to say that this is easily fixed with the .htaccess file, this stuff sounds fishy to me, but does anyone know for sure? Thanks, RonMo
It can happen. I think (I'm not a SE guru) search engines treat www. domain.com and domain.com as two different sites. What you need to do is include these lines in you .htacess file. This is a 301 redirect. This redirects domain.com to www. domain.com. It tells the Search Engine that domain.com has permenantly moved to www. domain.com. I don't believe there would be a problem with the www. domain.com and www. domain.com/index.html. Skinny
I read that same thing so I checked a few sites of mine and found one that was counted as a copy of the other because one was listed as www. and the other with out www same site. Here is a good tool to check with. http://www.copyscape.com/
i have a problem with this, I have about 4 websites that are pretty much the same, but for different regions. for example, www. domain-uk.com, www. domain-usa.com each of which has the same content (articles, homepage text etc) except for information stored in a directory. Is there any way to get around this? EDIT: also, reading other posts on here say that one solution is to have one of them not indexed, but i want all of them indexed (about 20 in total).
I think Google has been working hard to stop this from happening.... I'm pretty sure the Google BigDaddy updates have alot to do with this specifically...... It's called, canolization or something like that (somebody correct me)
do you mean it only indexes the 1 site, or that 1 page? although i suppose if it is only indexing one version of every article in effect it is only indexing one site...........
usually when google sees 2 exact pages they will give 1 the benefit... but if you are doing complete exact sites... that is just asking for trouble. You can go the google route and only get the primary indexed then redirect based on GEO IP... of course that is called cloaking but its what google does with there search
umm... what is geo ip?, and is it something that isn't recommended? (although i dont really have too many options open to me right now lol) thanks for your help!
Good call on using copyscape. I just found a competitor using the introduction from my wife's DJ company site as his own introduction. His site was "built and maintained by" http://www.gillenwatermotorsports.com/design.html His URL is http://www.gillenwatermotorsports.com/rick.html What a slimey bastage...
I wonder if the DJ is even aware his designer stole content. Maybe calling out cases like this will help discourage it. I found a similar case of content borrowing by using copyscape but the semi-plagorist had been fairly ingenious in replacing a few words throughout the article.
I talked to the DJ yesterday, and he was not aware what his web designer buddy did. He then started talking about how much he was paying to have his domain name redirect to a sub-page on the guy's site. I took pity on him and pointed him to some hosting packages that would fit his needs for only $30 a year. Its situations like that when I love what I do.
I've found a number of instances where entire pages from my site were posted on other company's websites, and have sent "cease and desist" emails. The letters have worked in all cases, so far. Some of the sites are so crappy, however, it's not worth the time to confront them. In nearly all cases, it has been some sleezbag developer stealing my stuff and impressing his client/boss with his great content.