LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Newbie (https://www.linuxquestions.org/questions/linux-newbie-8/)
-   -   semi hard sed/awk problem. (https://www.linuxquestions.org/questions/linux-newbie-8/semi-hard-sed-awk-problem-4175507572/)

////// 06-10-2014 03:47 AM

semi hard sed/awk problem.
 
hey guys/gals.

i have the following problem:

i have a list of html tags:
Code:

<a>
<abbr>
<acronym>
<address>
<applet>
<area>
<article>
<aside>
<audio>
<b>
<base>
<basefont>
<bdi>
<bdo>
<big>
<blockquote>
<body>
<br>
<button>
<canvas>
<caption>
<center>
<cite>
<code>
<col>
<colgroup>
<datalist>
<dd>
<del>
<details>
<dfn>
<dialog>
<dir>
<div>
<dl>
<dt>
<em>
<embed>
<fieldset>
<figcaption>
<figure>
<font>
<footer>
<form>
<frame>
<frameset>
<h1>
<h2>
<h3>
<h4>
<h5>
<h6>
<head>
<header>
<hr>
<html>
<i>
<iframe>
<img>
<input>
<ins>
<kbd>
<keygen>
<label>
<legend>
<li>
<link>
<main>
<map>
<mark>
<menu>
<menuitem>
<meta>
<meter>
<nav>
<noframes>
<noscript>
<object>
<ol>
<optgroup>
<option>
<output>
<p>
<param>
<pre>
<progress>
<q>
<rp>
<rt>
<ruby>
<s>
<samp>
<script>
<section>
<select>
<small>
<source>
<span>
<strike>
<strong>
<style>
<sub>
<summary>
<sup>
<table>
<tbody>
<td>
<textarea>
<tfoot>
<th>
<thead>
<time>
<title>
<tr>
<track>
<tt>
<u>
<ul>
<var>
<video>
<wbr>

what i need to do to em is the following:
Code:

(\<a\>)(*.*)(\<\/a\>)
yeah, im tweaking a port 80 sniffer and use those results in a regex.
im hoping someone has a resolution to this problem or i have to do it atleast 50% by hand :(

linosaurusroot 06-10-2014 05:21 AM

see http://www.linuxquestions.org/questi...ge-4175449919/

grail 06-10-2014 08:37 AM

Maybe you could provide some output as the regex listed would be rather ineffectual on the presented input?


All times are GMT -5. The time now is 04:16 AM.