LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Software (https://www.linuxquestions.org/questions/linux-software-2/)
-   -   Pdf to text : Poppler-utils versus Convert/imagemagick? (https://www.linuxquestions.org/questions/linux-software-2/pdf-to-text-poppler-utils-versus-convert-imagemagick-4175451419/)

Xeratul 02-23-2013 12:25 PM

Pdf to text : Poppler-utils versus Convert/imagemagick?
 
Hi,

I would like to ask you which one could be mostly recommended to convert a pdf to text.
http://packages.debian.org/fr/sid/poppler-utils

I have used convert file.pdf file.txt
and pdftotext but still I do not see the great benefits of one over the other.

Which one is better working / might be advised?

Regards

John VV 02-23-2013 12:44 PM

It really depends on just what the pdf is
if it is just a stranded plain text doc saved as a pdf
then pdftotext

if it is a formatted MS docx saved to a pdf
pdf2html or import into libreoffice ( my preferred )

if it is a "fax" tiff or jpg scan saved as a pdf
gimp and save to ppm
then gocr or ocrad


All times are GMT -5. The time now is 12:03 PM.