Extract text from a html file
Hey guys
I am a newbie. I want to write a script that takes a directory with set of html files as input scan through each file and remove the html tags within the body of the pages and pick each word of the plain text that remains and put it in a file with each word on a new line.
--------------------------------------------
Thanks in advance
Last edited by gsphanikumar6; 08-20-2004 at 12:16 PM.
|