found another little gremlin in my code. I have a problem when I remove <SPAN></SPAN> tags, if there is some other tag in between, its getting removed as well.
Example code:
Code:
<P STYLE="font-weight: medium"><SPAN STYLE="font-style: normal"><SPAN STYLE="font-weight: medium">After</SPAN></SPAN><I><SPAN STYLE="font-weight: medium">
The Lost Colony, </SPAN></I>
my replacement commands:
Code:
sed -i 's/<I>/<em>/g' "$f"
sed -i 's/<\/I>/<\/em>/g' "$f"
sed -i 's/<SPAN STYLE=".*">//g' "$f"
sed -i 's/<\/SPAN>//g' "$f"
**EDIT**
Its hard to see, but what I mean is in the first code set, the word
After gets removed when the span tags are removed. The
<I> tag gets removed as well.
**EDIT**