Hi Folks, i have just installed a WordPress theme in mysite.com/blog and I have created a post in put it live, the content I got for this post was cut and pasted from a pdf that resides on the root of mysite.com, Q1, could this be considered duplicate content? Q2, I have a sister site that I would also like to use this content (from the pdf) is copying content from one site to another (word for word) ok or frowned upon by google et al? thanks T
While technically duplicate content, you won't experience any negative effects from what you have already done. However, I wouldn't recommend adding the content to your sister site, too. That is duplicate content- and even if you aren't penalised for it, you won't see any positive change in your rankings. WebDev
No it is not, All the above comments shows that how talented they are in their services. This term is called content syndication. The real definition of Duplicate content is "same content on the same domain with multi techniques like text spinning"
Q1. This is not duplicate as the contents of the pdf are not parsed for content with the website. Q2. The Google definition of duplicate content is actually "Duplicate content generally refers to substantive blocks of content within or across domains that either completely match other content or are appreciably similar". However even where duplication exists it does not immediately incur a penalty as Google makes the distinction between deceptive and non-deceptive practices. There are means, e.g. canonicalisation, where you can have duplicated content but explicitly declare it to Google using the rel="canonical" link element and thus not incur a penalty. If content is syndicated then it should link back to the original content, and the syndicated material should use the noindex meta tag to prevent search engines from indexing the syndicated version of the content.
.pdf? No. If there's another form of it, like .docx file for example, that might get picked up as duplicate (hardly anyone does that though).
If the content is already considered by the google and then you are copying that content it will be a pure duplicate.