Quote:
Originally Posted by Artanicus
Will get you pretty close, need just a bit more parsing:
Code:
wget -O - "http://en.wikipedia.org/w/index.php?title=Lyman_Enos_Knapp&action=edit" | awk '/textarea/,/<\/textarea>/'
edit:
And to get rid of the entities, just pipe it onwards to elinks / lynx.
|
Thanks! I ended up doing something similar:
Code:
wget -O - "http://en.wikipedia.org/w/index.php?title=Lyman_Enos_Knapp&action=edit" | sed -ne '/textarea/,/textarea/p'
In case anyone goes down this path for twiki (not wikipedia), there's an attribute for getting the wiki code without editing (still needs some trimming though):
Code:
wget -q --no-proxy -O - http://cds.u-strasbg.fr/twikiDCA/bin/view/EuroVODCA/DCASchedule?raw=on | sed -ne '/textarea/,/textarea/{;1,2d;p;}'