Web Crawler

Reply

Join Date: May 2005
Posts: 3
Reputation: maverick_79in is an unknown quantity at this point 
Solved Threads: 0
maverick_79in maverick_79in is offline Offline
Newbie Poster

Web Crawler

 
0
  #1
May 25th, 2005
Hi,

Well i am new to these forum as well as IT field.But i was looking for a right form to post up my add for help.

I am an Aerospace student from scotland dont knwo much about IT.
I have been assigend the IT Project from the university to Design a SIMPLE WEB CRAWLER Using JAVA to get some scientific Earth data from the net and filter them according to the parameter they have given and for this these ****ers have gave me only 30 days.

I need a help from experts like u.Please i have got few question if any one can help me i would be thankful to you.
1) What's the basic for this i should read cause i havent done much proramming and if any one of u have got PDF can u forward me.
2) I tried to download abour crawler but could not get what sort of JAVA i have to learn wether all of it or part of it for my application.
3) Any online Tutorials or links to read.
4) Can any one can help/Teach me to write code ?
5) I have got 30 to max 45 Days to get it done.

Waiting for a positive feedback from any one in this forum.Please help me if u can.
You can email me on pranav162@gmail.com

Cheers,
Maverick.
Reply With Quote Quick reply to this message  
Join Date: Jun 2004
Posts: 2,108
Reputation: server_crash is on a distinguished road 
Solved Threads: 18
server_crash server_crash is offline Offline
Postaholic

Re: Web Crawler

 
0
  #2
May 25th, 2005
I wrote a very simple one not too long ago, actually, it was simpler than what your wanting. If you don't know much about programming, then you definitely have some work ahead of you.

As for the tutorials you'll need to read, there's a lot! You will want to read about Strings and String methods so that you can parse through all of the HTML code. Then, you got to worry about the Robot Exclusionary Standard, so you'll need to read about how to access files and all that good stuff. There's the java.net and java.io package you'll need to get familar with so you can create Streams, URls, and connections to the websites.
The java.sun.com site is the best place to look for those tutorials.
There's a lot more too it than that though. I really don't see how you can go into this not knowing much about the language and come out with something like that, but good luck. If you need anything else, let me know.

Here is some resources I looked at while creating mine, although in the end none of them turned out to be helpful..I just designed my own.
http://moguntia.ucd.ie/programming/webcrawler/
http://research.compaq.com/SRC/merca.../www/paper.pdf
http://cis.poly.edu/suel/papers/crawl.pdf
Reply With Quote Quick reply to this message  
Join Date: May 2005
Posts: 3
Reputation: maverick_79in is an unknown quantity at this point 
Solved Threads: 0
maverick_79in maverick_79in is offline Offline
Newbie Poster

Re: Web Crawler

 
0
  #3
Jun 6th, 2005
Hi,

Well sorry for the late reply cause i had my exams.
I have got the detail about the project.Its like i have to link the webcrawler to 2 websites and interact with this 2 sites.
The college have agreed to use any dummmy programme and make the necessary changes.

Can you help me with any ready dummy programme by which i can make the necessary changes and just provide them with basic module.

Regards,
Maverick.







Originally Posted by server_crash
I wrote a very simple one not too long ago, actually, it was simpler than what your wanting. If you don't know much about programming, then you definitely have some work ahead of you.

As for the tutorials you'll need to read, there's a lot! You will want to read about Strings and String methods so that you can parse through all of the HTML code. Then, you got to worry about the Robot Exclusionary Standard, so you'll need to read about how to access files and all that good stuff. There's the java.net and java.io package you'll need to get familar with so you can create Streams, URls, and connections to the websites.
The java.sun.com site is the best place to look for those tutorials.
There's a lot more too it than that though. I really don't see how you can go into this not knowing much about the language and come out with something like that, but good luck. If you need anything else, let me know.

Here is some resources I looked at while creating mine, although in the end none of them turned out to be helpful..I just designed my own.
http://moguntia.ucd.ie/programming/webcrawler/
http://research.compaq.com/SRC/merca.../www/paper.pdf
http://cis.poly.edu/suel/papers/crawl.pdf
Reply With Quote Quick reply to this message  
Join Date: Jun 2004
Posts: 2,108
Reputation: server_crash is on a distinguished road 
Solved Threads: 18
server_crash server_crash is offline Offline
Postaholic

Re: Web Crawler

 
0
  #4
Jun 6th, 2005
That would be much easier, but I don't know of any starter programs like that. I'll search around and see what I can find.
Reply With Quote Quick reply to this message  
Join Date: May 2005
Posts: 3
Reputation: maverick_79in is an unknown quantity at this point 
Solved Threads: 0
maverick_79in maverick_79in is offline Offline
Newbie Poster

Re: Web Crawler

 
0
  #5
Jun 8th, 2005
any news

Right now i am going through the JAVA BASICS

maverick
Reply With Quote Quick reply to this message  
Join Date: Jun 2004
Posts: 609
Reputation: freesoft_2000 is an unknown quantity at this point 
Solved Threads: 7
freesoft_2000 freesoft_2000 is offline Offline
Practically a Master Poster

Re: Web Crawler

 
0
  #6
Jun 16th, 2005
Hi everyone,

You can read the below threads, they comes with some sample codes and examples as well as detailed expalanations

http://java.sun.com/developer/techni...ty/WebCrawler/

http://www.devarticles.com/c/a/Java/...Web-with-Java/

I hope this helps you

Yours Sincerely

Richard West
Microsoft uses "One World, One Web, One Program" as a slogan.
Doesn’t that sound like "Ein Volk, Ein Reich, Ein Führer" to you, too?
— Eric S. Raymond

Tell me what type of software do you like and what would you pay for it

http://www.daniweb.com/techtalkforums/thread19660.html
Reply With Quote Quick reply to this message  
Reply

This thread is more than three months old.
Perhaps start a new thread instead?
Message:


Thread Tools Search this Thread



About Us | Contact Us | Advertise | DaniWeb | Acceptable Use Policy | RSS Feed

©2003 - 2009 DaniWeb® LLC