Hi friends, While checking duplicate content of my site's internal pages through www.copyscape.com, its resulting the following error: "The document could not be retrieved - error code 404. Please check the URL and try again." The page / pages are opening nicely; no redirection, forbidden etc. Can anybody help me?
404 means no page there , be sure you wrote correct url's , I don't know any reason to show 404 instead of this
Thanks for your feedback. Now a days, some people are posting in my forums by picking up part of content from other sites. So my site is getting under duplicate content issue. Should I remove those posts or put it under 'no follow' tag....
If your site is based on member contributions, you might end up facing angry posters if you start removing posts / articles. Besides, you can't really monitor all content on your site. Well, if you can do that now, it will become hardly possible once your site grows big, eventually. I'd concentrate on building my site to be a useful resource for online readers rather than keep filtering "duplicate content". Imagine that the site which has got the "original" post or article isn't known to all. That means part of the web surfers will get to read it on your site, I see no bad in doing that. I'd rather fight the principle of duplicate content than follow it. Take the news portals for instance, many of them are posting the same content, simply because their scope is to spread the information, so why keep it in a single place which isn't accessible to all?
Thanks Clive, Thank you very much for your valued opinion. My main concern was that some sites are banned by SEs because of this. Actually I don't know how major SEs treat the 'duplicate content' issue. Really feeling nice to be here. Julie