C++ and Webpage Parseing

Question

CrazyProgrammer 0 Light Poster

14 Years Ago

Hi all,
I wasn't sure where to post this, so I thought here might be best :D
I have to develop a program that takes a webpage and extracts information from it, so I was wander how would be the best way to do this, without using an external library(like curl)?
I.e. how does ones browser find words on a webpage when using ctrl+F, does it simply run a string search algorithm or are there other tricks to it? Another way to look at it, how would a web crawler know about the links etc, does it search for all <a></a> tags in the webpage or like before might there be some other trick to it?
Any pointers would be gladly appreciated :D
Thanks

algorithm c c# c++ web-browser

2 Contributors
1 Reply
141 Views
3 Weeks Discussion Span
Latest Post 14 Years Ago Latest Post by daviddoria

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

daviddoria 334 Posting Virtuoso Featured Poster · Answer 1 · 2011-04-15T05:31:54+00:00

I'd bet you Qt has some widgets that are searchable. I've never used them, but it might be worth looking into.

Also, are you sure you need to do this in c++? People tell me that perl is the way to go for string manipulation/searching/etc.