how to make a crawler to fetch particular web page's content

Question

Arun.N 0 Newbie Poster

17 Years Ago

i try to make a crawler that crawls a web page & retrieves
the stock information from google,but can't do it .
so plz help me 2 make that type of crawler.
urgent plz...

php

6 Contributors
5 Replies
169 Views
2 Years Discussion Span
Latest Post 15 Years Ago Latest Post by moobaa

All 5 Replies

chrelad 4 Light Poster

17 Years Ago

Hi Arun.N,

Sounds like your looking for cURL... Have a look at the cURL documentation and see what you think.

cURL + regular expressions (preg_match_all) = exactly what your looking for.

I've written a few of these "crawlers" myself, so I'll include some foundational code for a very a simple one for you:

<?php

// Return a handle to a curl connection to the site you want to pull info from
$ch = curl_init('http://finance.google.com/finance');

// Set some options for the connection
curl_setopt($ch,CURLOPT_HEADER,0); // Don't return header information, although, this can be handy ;)
curl_setopt($ch,CURLOPT_RETURNTRANSFER,1); // Give us the page source

// Open the connection with the options specified
$cr = curl_exec($ch);

// Run your regular expression against the source to pull what you want, you can use external programs to format the html for easier parsing if you want before you scan it.
preg_match_all('/href="()"/i',$cr,$pm,PREG_SET_ORDER);

// So you can see what you found
print_r($pm);

// Display the results again :D
foreach($pm as $pv) echo $pv[1] . "\r\n";

?>

Hope this helps!

almostbob 866 Retired: passive income ROCKS

15 Years Ago

use the rss feed of the stock page

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

mario.stoica 0 Newbie Poster · Answer 1 · 2008-01-04T04:47:59+00:00

<?php

$ch = curl_init("http://www.example.com/");
$fp = fopen("example_homepage.txt", "w");

curl_setopt($ch, CURLOPT_FILE, $fp);
curl_setopt($ch, CURLOPT_HEADER, 0);

curl_exec($ch);
curl_close($ch);
fclose($fp);
?>

gopiyadav 0 Newbie Poster · Answer 2 · 2010-03-20T18:20:40+00:00

Hi friends
I need some help from you guys.....I need a crawler such that it tracks the changes in the website content and it should show the track changes like oldcontent and newcontent should be shown side by side

moobaa 0 Newbie Poster · Answer 3 · 2010-03-25T13:20:54+00:00

Hi there...

I tried this cURL script, but all I get returned is "Array()"... I've got this running here: http://www.kiwitube.com/scraper.php

Cheers,
Todd

how to make a crawler to fetch particular web page's content

Recommended Answers Collapse Answers

All 5 Replies

Recommended Answers