Situation: you want to start a new site but you have no content and you want to fill it with content quickly. You search for e-books (PDF or DjVu) related to your theme, cut chapters from these books, make them look like articles and post them to your site. As a result you got a lot of content. Questions arises: 1.) Does it break copyright? 2.) Does google or yahoo ban your for that? 3.) If you rewrite these chapters - does it still breaks copyright? I will be very pleased for your replies
Questions arises: 1.) Does it break copyright? Yes, it's copyright infringement. 2.) Does google or yahoo ban your for that? It's duplicate content so google will likely ignore it, and if a DMCA is filed, google/yahoo will remove your site from their index, your host may close your account, and you may get sued. 3.) If you rewrite these chapters - does it still breaks copyright? Derivative copies are still copyright infringement.
Does it mean that google or yahoo some how know about content stored in e-books?...I do not really want to steal someones content, but if I make a paper book, some one will scan it, make a PDF or DjVu from it - how will google know about that? .... I think i some one some how pull content from PDF or DjVu it will look unique to google...thats my opinion ...
Google can read pdf's - if it couldn't, it wouldn't count as content. Google can detect duplicate content regardless of if it is stored in a pdf or html text.