PDF and DjVu books as a source of unique content

Discussion in 'Legal Issues' started by Uzi Levitovitch, Dec 30, 2008.

  1. #1
    Situation: you want to start a new site but you have no content and you want to fill it with content quickly. You search for e-books (PDF or DjVu) related to your theme, cut chapters from these books, make them look like articles and post them to your site. As a result you got a lot of content.

    Questions arises:
    1.) Does it break copyright?
    2.) Does google or yahoo ban your for that?
    3.) If you rewrite these chapters - does it still breaks copyright?

    I will be very pleased for your replies :)
     
    Uzi Levitovitch, Dec 30, 2008 IP
  2. mjewel

    mjewel Prominent Member

    Messages:
    6,693
    Likes Received:
    514
    Best Answers:
    0
    Trophy Points:
    360
    #2
    Questions arises:
    1.) Does it break copyright? Yes, it's copyright infringement.
    2.) Does google or yahoo ban your for that? It's duplicate content so google will likely ignore it, and if a DMCA is filed, google/yahoo will remove your site from their index, your host may close your account, and you may get sued.
    3.) If you rewrite these chapters - does it still breaks copyright? Derivative copies are still copyright infringement.
     
    mjewel, Dec 30, 2008 IP
  3. Uzi Levitovitch

    Uzi Levitovitch Active Member

    Messages:
    117
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    53
    #3
    Does it mean that google or yahoo some how know about content stored in e-books?...I do not really want to steal someones content, but if I make a paper book, some one will scan it, make a PDF or DjVu from it - how will google know about that? .... I think i some one some how pull content from PDF or DjVu it will look unique to google...thats my opinion :)...
     
    Uzi Levitovitch, Dec 30, 2008 IP
  4. mjewel

    mjewel Prominent Member

    Messages:
    6,693
    Likes Received:
    514
    Best Answers:
    0
    Trophy Points:
    360
    #4
    Google can read pdf's - if it couldn't, it wouldn't count as content. Google can detect duplicate content regardless of if it is stored in a pdf or html text.
     
    mjewel, Dec 30, 2008 IP