Guys I have another problem, i think sed can do that
input (all in one line): Code:
SITA<br>_________<br><span class="foto">FOTO: ilustračné foto SITA_AP<br><br><span class="datum">Sobota 24. októbra 2009</span><br clear="left"> Code:
2009 Sobota 24. októbra Code:
cat mikus.html | sed -n 's/"datum">\([a-zA-Z][a-zA-Z]* [0-9][0-9]*\. [a-zA-Z][a-zA-Z]*\)&[a-zA-Z][a-zA-Z]*;\([0-9][0-9][0-9][0-9]\)/\2 \1/gp' Code:
SITA<br>_________<br><span class="foto">FOTO: ilustračné foto SITA_AP<br><br><span class=2009 Sobota 24. októbra</span><br clear="left"> I tried same regexp (without memory of course) in grep an seems work Code:
grep -o '"datum">[a-zA-Z][a-zA-Z]* [0-9][0-9]*\. [a-zA-Z&][A-Za-z&]*;[0-9][0-9][0-9][0-9]' mikus.html |
awk
Code:
# awk -vRS='</span>' '{gsub(/.*>| /,"")}1' file |
All times are GMT -5. The time now is 10:08 AM. |