<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd"> <!--Rendered using the Haskell Html Library v0.2--> <HTML ><HEAD ><META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=UTF-8" ><TITLE >tagsoup-0.12: Parsing and extracting information from (possibly malformed) HTML/XML documents</TITLE ><LINK HREF="haddock.css" REL="stylesheet" TYPE="text/css" ><SCRIPT SRC="haddock-util.js" TYPE="text/javascript" ></SCRIPT ></HEAD ><BODY ><TABLE CLASS="vanilla" CELLSPACING="0" CELLPADDING="0" ><TR ><TD CLASS="topbar" ><TABLE CLASS="vanilla" CELLSPACING="0" CELLPADDING="0" ><TR ><TD ><IMG SRC="haskell_icon.gif" WIDTH="16" HEIGHT="16" ALT=" " ></TD ><TD CLASS="title" >tagsoup-0.12: Parsing and extracting information from (possibly malformed) HTML/XML documents</TD ><TD CLASS="topbut" ><A HREF="index.html" >Contents</A ></TD ><TD CLASS="topbut" ><A HREF="doc-index.html" >Index</A ></TD ></TR ></TABLE ></TD ></TR ><TR ><TD CLASS="section1" >tagsoup-0.12: Parsing and extracting information from (possibly malformed) HTML/XML documents</TD ></TR ><TR ><TD CLASS="doc" ><P >TagSoup is a library for parsing HTML/XML. It supports the HTML 5 specification, and can be used to parse either well-formed XML, or unstructured and malformed HTML from the web. The library also provides useful functions to extract information from an HTML document, making it ideal for screen-scraping. </P ><P >Users should start from the <A HREF="Text-HTML-TagSoup.html" >Text.HTML.TagSoup</A > module. </P ></TD ></TR ><TR ><TD CLASS="section1" >Modules</TD ></TR ><TR ><TD ><TABLE CLASS="vanilla2" CELLSPACING="0" CELLPADDING="0" ><TR ><TD STYLE="width: 50em" ><IMG SRC="minus.gif" CLASS="coll" ONCLICK="toggle(this,'n:0')" ALT="show/hide" >Text</TD ><TD ></TD ><TD ></TD ></TR ><TR ><TD STYLE="padding: 0; padding-left: 2em" COLSPAN="3" ><TABLE CLASS="vanilla2" CELLSPACING="0" CELLPADDING="0" ID="n:0" STYLE="display:block;" ><TR ><TD STYLE="width: 48em" ><IMG SRC="minus.gif" CLASS="coll" ONCLICK="toggle(this,'n:1')" ALT="show/hide" >HTML</TD ><TD ></TD ><TD ></TD ></TR ><TR ><TD STYLE="padding: 0; padding-left: 2em" COLSPAN="3" ><TABLE CLASS="vanilla2" CELLSPACING="0" CELLPADDING="0" ID="n:1" STYLE="display:block;" ><TR ><TD STYLE="width: 46em" ><IMG SRC="minus.gif" CLASS="coll" ONCLICK="toggle(this,'n:2')" ALT="show/hide" ><A HREF="Text-HTML-TagSoup.html" >Text.HTML.TagSoup</A ></TD ><TD ></TD ><TD ></TD ></TR ><TR ><TD STYLE="padding: 0; padding-left: 2em" COLSPAN="3" ><TABLE CLASS="vanilla2" CELLSPACING="0" CELLPADDING="0" ID="n:2" STYLE="display:block;" ><TR ><TD STYLE="padding-left: 1.25em;width: 44em" ><A HREF="Text-HTML-TagSoup-Entity.html" >Text.HTML.TagSoup.Entity</A ></TD ><TD ></TD ><TD ></TD ></TR ><TR ><TD STYLE="padding-left: 1.25em;width: 44em" ><A HREF="Text-HTML-TagSoup-Match.html" >Text.HTML.TagSoup.Match</A ></TD ><TD ></TD ><TD ></TD ></TR ><TR ><TD STYLE="padding-left: 1.25em;width: 44em" ><A HREF="Text-HTML-TagSoup-Tree.html" >Text.HTML.TagSoup.Tree</A ></TD ><TD ></TD ><TD ></TD ></TR ></TABLE ></TD ></TR ></TABLE ></TD ></TR ><TR ><TD STYLE="padding-left: 1.25em;width: 48em" ><A HREF="Text-StringLike.html" >Text.StringLike</A ></TD ><TD ></TD ><TD ></TD ></TR ></TABLE ></TD ></TR ></TABLE ></TD ></TR ><TR ><TD CLASS="s15" ></TD ></TR ><TR ><TD CLASS="botbar" >Produced by <A HREF="http://www.haskell.org/haddock/" >Haddock</A > version 2.6.1</TD ></TR ></TABLE ></BODY ></HTML >