Pdf to text : Poppler-utils versus Convert/imagemagick?
Hi,
I would like to ask you which one could be mostly recommended to convert a pdf to text. http://packages.debian.org/fr/sid/poppler-utils I have used convert file.pdf file.txt and pdftotext but still I do not see the great benefits of one over the other. Which one is better working / might be advised? Regards |
It really depends on just what the pdf is
if it is just a stranded plain text doc saved as a pdf then pdftotext if it is a formatted MS docx saved to a pdf pdf2html or import into libreoffice ( my preferred ) if it is a "fax" tiff or jpg scan saved as a pdf gimp and save to ppm then gocr or ocrad |
All times are GMT -5. The time now is 12:03 PM. |