I wasn't sure where to post this, so I thought here might be best :D
I have to develop a program that takes a webpage and extracts information from it, so I was wander how would be the best way to do this, without using an external library(like curl)?
I.e. how does ones browser find words on a webpage when using ctrl+F, does it simply run a string search algorithm or are there other tricks to it? Another way to look at it, how would a web crawler know about the links etc, does it search for all <a></a> tags in the webpage or like before might there be some other trick to it?
Any pointers would be gladly appreciated :D