How do I cut out a specific piece of a html page (using sed/awk or similar)?
I need to cut a specific table out of a html page. How can I do that?
I've been looking at the sed and awk/gawk commands, but it's a little overwhelming for a first-time user of those commands... It doesn't need to be done with either sed or awk. If you know some other command that can do this easily, please let me know! Let me explain in more detail what I nedd to do: Take a html page like this: Code:
<html> |
This example might get you started....
Code:
sed '/<table.*>/,/<\/table>/d' index.html >file.txt |
Since my original post, I have started wondering if I should do what I want do in a python script. It might turn out to be easier at the end of the day.
Thanks anyway though. Even if I end up not using your contribution anyway :) |
All times are GMT -5. The time now is 12:56 PM. |