ok I'll explain it in detail.
here is the html file.
Quote:
<BODY LANG="en-US" DIR="LTR">
<P STYLE="margin-bottom: 0cm"><FONT FACE="Bamini">kl;lf;fsg;G
<FONT FACE="Bitstream Charter, serif">(baticoloa)</FONT> rpj;jhz;b
Nfhusq;Nfzp gpuNjrj;jpy; jkpo; ,isQd; xUtu; milahsk; njupahj
MAjjhupfspdhy; Rl;Lf;nfhiy nra;ag;gl;Ls;shu;. Rl;Lf;nfhiy
nra;ag;gl;ltu;> rpj;jhz;b ehtyu; tPjpiar; Nru;e;j kJutPud; Rnu];
<FONT FACE="Bitstream Charter, serif">(suresh)</FONT> (taJ 19) vd
njupate;Js;sJ. jdJ ez;gu;fSld; Js;Se;J xd;wpy; ,d;W tpahof;fpoik fhiy
6.30 kzpf;F nrd;W nfhz;bUe;jNghJ ,j;jhf;Fjy; elj;jg;gl;lJ. mtuJ
rlyj;ij kPl;fr;nrd;w NghJ rlyj;jpw;F mUfpy; Fz;L xd;W fhzg;gl;ljhf
rpwpyq;fh fhty;Jiwapdu; njuptpj;Js;du;. kuz tprhuizfspd; gpd;du; ,d;W
gpw;gfy; RNu]; rlyk; cwtpdu;fsplk; xg;gilf;fg;gl;lJ.</FONT></P>
</BODY>
|
You can see there are two font varients.
1. <FONT FACE="Bitstream Charter, serif">&&&&&&&&&&&</FONT>
2. <FONT FACE="Bamini">**********</FONT>
the script will search entire html, and will find the pattern <FONT FACE="Bamini">**********</FONT>
and then it will apply this sed command only to the text which situated by "<FONT FACE="Bamini">**********</FONT>" tags (the *****)
Quote:
|
sed -e 's/—/௮/g' -e 's/•/௬/g' -e 's/‚/ஸ்ரீ/g' < $filename > $filename-tmp-tsc2uni
|
important thing is, it should not make any changes to other text areas.