php text extraction

Discussion in 'PHP' started by khu84, Nov 30, 2008.

  1. #1
    I need to develop php scripts which are capable of extracting texts from different kinds of documents like .pdf, .doc, .rtf etc

    So can anyone suggest a resource which can help me in this regard.
     
    khu84, Nov 30, 2008 IP
  2. leo.bonnafe

    leo.bonnafe Peon

    Messages:
    4
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #2
    A really easy way to achieve what you want, using a consistent interface for all file formats, is with phpLiveDocx. The following file formats are supported:

    DOCX - Microsoft Word DOCX Format
    DOC - Microsoft Word DOC Format
    RTF - Rich Text Format File
    PDF - Acrobat Portable Document Format
    TXD - TX Text Control Format
    TXT - ANSI Plain Text

    You can download the PHP5 components from phpLiveDocx.org

    Leo
     
    leo.bonnafe, Feb 5, 2009 IP