I want to know how you guys parse html. I'm writing a program to compare products on a website. I already have a system that scrapes the websites.
I looked at HTMLParser, I'm a little confused on how to use it, I was thinking about writing my own script that searches the site for example:
The pricing code looks like this:
<dl class="pricing"> <dt> Regular Price: </dt> <dd> $899.99 </dd> </dl>
I would like to extract the
Regular Price: and the
899.99 Should I use a parsing api like HTMLParser or write my own?