Hey guys,

I want to know how you guys parse html. I'm writing a program to compare products on a website. I already have a system that scrapes the websites.

I looked at HTMLParser, I'm a little confused on how to use it, I was thinking about writing my own script that searches the site for example:

The pricing code looks like this:

<dl class="pricing">
    
        <dt>
            Regular Price:
        </dt>
        <dd>
            $899.99
        </dd>

</dl>

I would like to extract the Regular Price: and the 899.99 Should I use a parsing api like HTMLParser or write my own?

PO

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.