Well, if you specifically want ASCII, then you'll need to call the
html2text(1) command with the
-ascii flag.
Code:
html2text -ascii INPUTFILE.html > OUTPUTFILE.txt
But it's not a perfect solution. Anything between php brackets (
<?php ?>) is left in the converted file.
html2text seems to only remove obvious HTML tags, and this might not result in the kind of text files that you really want.