Anyone know how to convert a pdf with correct line spacing so I can dump it into mysql. I would probably be using php. What i need to know is how to convert a pdf to text but with correct line spacing.. I'm not sure if it would be better if I used xml or not. Any suggestions? What I have is a catalog. And almost all pages are laid out the same. So I figure if I use a character/space count then I could easliy dump it into mysql. Any suggestions?
You can use any of the several converters of PDF to text in order to do that, however, making sure about the correct spacing is always something you should check before doing anything else with it, since it may variate in the process of conversion.