Hello, Final goal is to convert/export a pdf content to text format (doc, txt, html etc) online on unix server (PHP, Java, etc). I tried with PHP but after searches and trials I could not find a solution. Is there a possiblity, did anyone made this ? I found out some PHP library PDFlib but this creates PDF from text not the other way arround. Beside this a lot of programs but in win32 Please help, I need just a hint and I will take from there...any example ? Thanks. R.
ok, it is a good solution but also this is tricky...do you know a good class? or example maybe...or just give me link to information...as I tried several solutions and all failed. Not to mention PDF use more encoding types and this makes work harder...someone managed it ?? Thanks!
I did some work on this recently and I couldn't find any php classes that supported all the pdf encoding types. In the end I used php to call xpdf via the command line: http://www.foolabs.com/xpdf/download.html
how did you managed to install xpdf on shared hosting ?? can this be done via cpanel or need ssh access with full rights?
I've developed a class in pure PHP to extract text contents from pdf files ; it is available here : http://www.phpclasses.org/package/9732-PHP-Extract-text-contents-from-PDF-files.html Of course, the pdf specifications allow you to write the same contents in many different ways so if you encounter issues, please don't hesitate to send me your PDF file.