Converting PDF to text, rtf doc format
Hi all
Is there any program which can convert PDF to word processor file ? If the PDF has smart quotes, bullet icons, copyright and trademark symbols, etc. what happens to them intext format? So ideally would like to conver into rtf or doc. Thanks SS |
well there's the program pdf2ps which converts PDF files to PostScript, then there's the program ps2ascii which converts PostScript to ASCII. I think that's as good as you'll get unless you trawl google for a better program.
|
The xpdf package is a reasonably good free pdf viewer. In the xpdf-utils package, there is also a pdftotext command included (use: pdftotext somefile.pdf myfile.txt). Otherwise, you may want to use the pdftohtml package (same syntax). (The pdftohtml program keeps more of the original layout.)
I am using Ubuntu (now 5.04 or Hoary, by far the best distribution there is :-) and the packages xpdf-utils (pdftotxt included) and pdftohtml are available for apt-get or synaptic. Therefore surely also in Debian, and probably in many other distributions. |
All times are GMT -5. The time now is 01:09 PM. |