LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Software (https://www.linuxquestions.org/questions/linux-software-2/)
-   -   from html to txt/doc (https://www.linuxquestions.org/questions/linux-software-2/from-html-to-txt-doc-525478/)

gawain 02-04-2007 09:30 AM

from html to txt/doc
 
Hi everybody.

I've been using lynx to convert html files to txt. But I was wondering if there is any other utility which could do the same job, and maybe a bit better. I've been searching in the forum, but it seems as if everybody is working from txt/doc to html.

Please, if anyone has got any hint.
Thanks a lot

David the H. 02-04-2007 09:58 AM

html2text

It did an excellent job for me converting some public domain stories into to text files for reading on my pda. I did have to fiddle with some of the conversion parameters to get the headers to come out as I wanted them, but that wasn't difficult to do.

gawain 02-04-2007 02:30 PM

Great. Thanks

dive 02-04-2007 03:33 PM

You might also want to look at w3m and elinks. elinks has the advantage that if you pass a web address that requires cookies and/or some sort of login it will usually work.

gawain 02-05-2007 01:52 PM

Thanks for the new tips.

Quote:

Originally Posted by dive
elinks has the advantage that if you pass a web address that requires cookies and/or some sort of login it will usually work.

I'm not sure I understand what you mean; anyway, I'll try both of them
Thanks again

dive 02-05-2007 02:30 PM

I mean if you do something like 'elinks -dump google.com > google.txt' it will log you in (providing you have a login there of course) using cookies and convert your personalised page, rather than the page you would get without login in.

gawain 02-06-2007 01:58 AM

OK! The things they think of!!
:)


All times are GMT -5. The time now is 05:39 AM.