singletons:<br> <li> <!--> <hr>opener/closer pairs:<html> ... </html> <head> ... </head> <title> ... </title> <body> ... </body> <center> ... </center> <h1> ... </h1> <h2> ... </h2> <h3> ... </h3> <h4> ... </h4> <b> ... </b> <i> ... </i> <pre> ... </pre> <code> ... </code> <sup> ... </sup> <sub> ... </sub> <p> ... </p> <a ... "filename"> ... </a> <ul> ... </ul> <ol> ... </ol> <dl> ... </dl> <dd> ... </dd> <dt> ... </dt> <table> ... </table> <tr> ... </tr> <td> ... </td> <th> ... </th>
>
. These can be ignored for every tag other
than the anchor (<a href = "filename">
) tag, from
which you need to extract the filename.
<
and no
>
; anchor tags will always have a filename in double
quotes; etc. The only error that may occur is that tags are not
properly nested.