First of all I'd look if that site doesn't have an RSS/Atom feed that might already contain a compressed version of the news.
Otherwise, I wouldn't use awk/grep etc. but a tool that is designed to deal with HTML (and similar code like XML).
Two such tools are
xmllint (part of libxml2) and
xmlstarlet.
The thing you want to learn are "xpath queries". Yes, there's a small learning curve but you'll soon appreciate working
with the code you're parsing, not
against it.
Just
look around for a suitable tutorial.
If you give us example code we can help more.