954,135 Members — Technology Publication meets Social Media
Username:
Password:
Lost login information?
Have something to say? Contribute New Article Reply to this Article

web scraping

Hi,

Can anyone guide me to the best resouce or learning materials for web scraping?

Thanks in advance

crazynp
Junior Poster in Training
58 posts since Jan 2007
Reputation Points: 10
Solved Threads: 2
 

There isn't much that you need besides www.php.net/file and www.php.net/file_get_contents

Those should get you started.

Gary King
PHP/vBulletin Guru
Team Colleague
417 posts since Nov 2003
Reputation Points: 53
Solved Threads: 5
 

Thanks for the reply. It has been so long anyone has replied to this thread. But I have been using the Curl function now. The main problem I am facing now is to get the result eithout the hyperlinks and unmeangiful characters like ..., http://...../ .

Also do you have any idea or suggestion or hints or anything to make the realsitic article from the result with different unwanted characters.

Thanks in advance!

crazynp
Junior Poster in Training
58 posts since Jan 2007
Reputation Points: 10
Solved Threads: 2
 

Use www.php.net/preg_match to match the particular contents that you want from the page.

Gary King
PHP/vBulletin Guru
Team Colleague
417 posts since Nov 2003
Reputation Points: 53
Solved Threads: 5
 

This article has been dead for over three months

Post: Markdown Syntax: Formatting Help
You