943,708 Members | Top Members by Rank

Ad:
  • PHP Discussion Thread
  • Unsolved
  • Views: 3513
  • PHP RSS
Feb 1st, 2007
0

web scraping

Expand Post »
Hi,

Can anyone guide me to the best resouce or learning materials for web scraping?

Thanks in advance
Similar Threads
Reputation Points: 10
Solved Threads: 2
Junior Poster in Training
crazynp is offline Offline
58 posts
since Jan 2007
Feb 7th, 2007
0

Re: web scraping

There isn't much that you need besides www.php.net/file and www.php.net/file_get_contents

Those should get you started.
Team Colleague
Reputation Points: 53
Solved Threads: 5
PHP/vBulletin Guru
Gary King is offline Offline
360 posts
since Nov 2003
Feb 8th, 2007
0

Re: web scraping

Thanks for the reply. It has been so long anyone has replied to this thread. But I have been using the Curl function now. The main problem I am facing now is to get the result eithout the hyperlinks and unmeangiful characters like ..., http://...../.

Also do you have any idea or suggestion or hints or anything to make the realsitic article from the result with different unwanted characters.

Thanks in advance!
Reputation Points: 10
Solved Threads: 2
Junior Poster in Training
crazynp is offline Offline
58 posts
since Jan 2007
Feb 9th, 2007
0

Re: web scraping

Use www.php.net/preg_match to match the particular contents that you want from the page.
Team Colleague
Reputation Points: 53
Solved Threads: 5
PHP/vBulletin Guru
Gary King is offline Offline
360 posts
since Nov 2003

This thread is more than three months old

No one has posted to this discussion for at least three months. Please let old threads die and do not reply to them unless you feel you have something new and valuable to contribute that absolutely must be added to make the discussion complete. Otherwise, please start a new thread in this forum instead.
Message:
Previous Thread in PHP Forum Timeline: Files linked to MySQL records
Next Thread in PHP Forum Timeline: Php Ide





About Us | Contact Us | Advertise | Acceptable Use Policy
Forum Index | Build Custom RSS Feed


Follow us on Twitter


© 2011 DaniWeb® LLC