Couple of questions 1) What is the best way to determine if a certain text that I have is unique, and not indexed by google. I have been searching for parts of the text in quotes to see if it came up, and if it didn't, I would assume that google would see my text as unique. Is this the best way or is there a better way? Is it possible that this may be inaccurate? 2) Does google save the data of pages it crawls but does not index? In other words, is the content of pages of new websites compared to the uniqueness of 'crawled but not indexed' pages as well? I hope that makes sense. Thanks! cg
Well did you write the content yourself? If yes then don't worry about it, there are millions even billions of people out there writing things on the Internet, surely some phrases are going to be identicle/similar.
i read somewhere that google looks for a % of duplicated content on a single page, which makes sense since there is a high chance a phrases is going to be identical to the billions of sites out there.
A few weeks ago I was on position 1! for extremely popular keywords. I looked what happend and found that this happened because the was an error in the feed reader software. This software downloaded feeds and placed them onto my site again and again. So I had hundreds of pages and each of them about 30 times. Yes I had the same content more than 30 times and this for hundreds of pages. Than I deleted the high listed page and the next day the other page with the same content was listed at #1. At the end I delete all the double contend and lost my top listing on Google and also much traffic. But I think on the long run it's better to be serious. But I learned that Google will not penalize you for double content. I think they do it only for home pages and for very competitive content.
What if quote a text and this text is actually a single whole page but I then add my own words or reviews to it. Is that considered duplicate content? In other words, if I quote a whole page with 100 words and then add my own unique review is this considered to be a dupe? If I quote a text that would actually mean I have copied 100% from a website even if I added my own review? What is the exact the exact percentage to determine your sites uniqueness?
Ya'll worry too much... it is a DUPLICATE CONTENT FILTER .... it is NOT a penalty. To actually graduate to a penalty you'd need to satisfy other red flags such as cloaking, lin spamming, KW stuffing AS WELL as a whack of duplicate content Read up on duplicate content
If you are the originator of the content, dont worry as Google also indexes the dates.So if your site was uploaded before the duplicate site, dont worry you wont be penalized.
You can use copyscape, if any thing else google only takes keywords when searching for uniqueness so just change up a bit here and there
You can use copyscape or the your first point, which is searching part of content in google will also help. Anyway, to be 100% sure I write all content my self. That way, I don't have to worry about any duplicate problems.
This is serious problem actually. Just imagine. You've type nice article put into your website, and right after this stupid idiot will copy your article to wikipedia. Guess who will be in SERP faster?