Quote:
Originally Posted by ufmale
I have a lot of of pdf files that needed to convert to text?
Is there any command line tool or library that I can use for this purpose?
|
poppler includes a command line tool called "pdftotext". Aternatively, you can use pdf2ps+ps2ascii.
Quote:
I also looking for a command line tool to retrieve the first sentence from each pdf files.
Does anyone know any tool that can do that?
|
Once you have a plain text file you can use the command "head" for that.