extract text portions from html files
Hello,
I have a daunting task: 350 html files on HD, each of these files contain a portion of text which I want to extract and save into a text file:
<li>E-mail: <a href="mailto:abc@abc.abc">abc@abc.abc</a></li>
Is there an easy way to do, a shell script, a command line tool, or something of this kind which will extract the emails?
Once this is done, I have to extract other text data and save it to file.
Thanks, indeed.
|