| | |
Scheduling System
![]() |
This is a request for information on creating a scheduling system. I have been unsuccessful in finding worthwhile results from Google. Since I have never created one, I thought I might ask those who have and try to avoid common mistakes or pitfalls.
As an example, let's say I am scheduling a web scraper:
Scraper 1 runs every 5mins Monday-Sunday
Scraper 2 runs every 15 mins Monday-Friday
Scraper 1 does not run on Christmas Day
I am not asking for code, but best practices, reading materials or things to consider.
Think crontab, but with a history (when was last scheduled run executed and finished? Is it still executing?). I plan to store the schedule data in a database.
Thanks for any input.
(EDIT: Did not see the "Software Design" forum at first go, if this is in the wrong forum..)
As an example, let's say I am scheduling a web scraper:
Scraper 1 runs every 5mins Monday-Sunday
Scraper 2 runs every 15 mins Monday-Friday
Scraper 1 does not run on Christmas Day
I am not asking for code, but best practices, reading materials or things to consider.
Think crontab, but with a history (when was last scheduled run executed and finished? Is it still executing?). I plan to store the schedule data in a database.
Thanks for any input.
(EDIT: Did not see the "Software Design" forum at first go, if this is in the wrong forum..)
Last edited by Stylish; Nov 18th, 2008 at 6:50 pm.
I NEED AN ADULT!
•
•
Join Date: Aug 2008
Posts: 1,162
Reputation:
Solved Threads: 138
well if you are wanting a database, don't know how deep you want to go with this
schedule
SCHEDULE_ID
SCRAPER_ID
LAST_RUN_TIME
scrapers
SCRAPER_ID
SCRAPER_NAME
INTERVAL_SECONDS
EXCLUDE_DATE
then you can check if LAST_RUN_TIME + INTERVAL_SECONDS >= currenttime
this means it is now scheduled to stated
then check your exclude date
id(date != EXCLUDE_DATE)
{
run the scraper
}
i don't know what you are meaning by best practices or reading materials, for what crons?
this might help
http://www.adminschoice.com/docs/crontab.htm
schedule
SCHEDULE_ID
SCRAPER_ID
LAST_RUN_TIME
scrapers
SCRAPER_ID
SCRAPER_NAME
INTERVAL_SECONDS
EXCLUDE_DATE
then you can check if LAST_RUN_TIME + INTERVAL_SECONDS >= currenttime
this means it is now scheduled to stated
then check your exclude date
id(date != EXCLUDE_DATE)
{
run the scraper
}
i don't know what you are meaning by best practices or reading materials, for what crons?
this might help
http://www.adminschoice.com/docs/crontab.htm
Custom Application & Software Development
www.houseshark.net
www.houseshark.net
![]() |
Similar Threads
- Round-Robin Scheduling Algorithm (Java)
- Round-Robin Scheduling using threads, having some trouble (VB.NET)
- cpu scheduling I don't know why is not working (Java)
- Need help with CPU scheduling algorithms (C++)
- ASP Project (Web Development Job Offers)
- Analyzing the Round Robin Scheduling (C)
- Scheduling Tasks (Java)
- round robin scheduling (Java)
Other Threads in the IT Professionals' Lounge Forum
- Previous Thread: Using Microblogging to increase site interactivity?
- Next Thread: Should I play with snakes?
| Thread Tools | Search this Thread |
Tag cloud for IT Professionals' Lounge
1gbit advertising advice amazon answers archive british broadband business businessprocesses career carrier censorship cern china cio collectiveintelligence connectivity consumer consumers corporateearnings css datatransfer debtcollectors dictionary digg digital ebay ecommerce email employment environment facebook food government grid high-definition hottub infodelivery infotech intel internet interview isp japan kindle lhc library malware marketing mit moonfruit news onlineshopping php piracy piratebay pope porn program questions religion remoteworking research retail schools security sex shopping simple smallbusiness sms socialnetworking software softwareengineer spam speed spending startrek statistics stocks study stumbleupon survey tabletpc technology touch-screen touchscreen training twitter uk vbulletin videoinprint voips vulnerability web webdesign webdeveloper windows words






