| | |
Plan for parsing HTML
Please support our HTML and CSS advertiser: PostgreSQL or MySQL? Compare and contrast the two most popular open source databases
![]() |
Hi! I just inherited a rather large legacy site here at work that has no database behind it. It's a large volume of HTML pages with the content written right into the HTML page. I need to extract the content and bring it into a database, or XML files.
Each section of the HTML pages has header tag and a standard title, so I'm thinking I should write a perl script to parse the pages based on header tags and insert them into MYSQL.
Before I begin, I thought I'd check with you guys to see if you have had any similar experience and recommendations.
Thanks!
Tom Tolleson
Each section of the HTML pages has header tag and a standard title, so I'm thinking I should write a perl script to parse the pages based on header tags and insert them into MYSQL.
Before I begin, I thought I'd check with you guys to see if you have had any similar experience and recommendations.
Thanks!
Tom Tolleson
Last edited by Tom Tolleson; Dec 3rd, 2008 at 12:02 pm. Reason: typo
![]() |
Similar Threads
- Parsing html form. (PHP)
Other Threads in the HTML and CSS Forum
- Previous Thread: Text box mouseover
- Next Thread: Frames and Framesets or ???
| Thread Tools | Search this Thread |
Tag cloud for HTML and CSS
appointments asp background backgroundcolor beta browser bug calendar cart cgi code codeinjection corporateidentity create css design development displayimageinsteadofflash dreamweaver drupal emailmarketing epilepsy explorer firefox flash font fonts form format google griefers hackers hitcounter hover html ide ie7 ie8 iframe image images internet internetexplorer intranet iphone javascript jpeg layout macbook maps marketshare microsoft mozilla multimedia navigationbars news offshoreoutsourcingcompany opacity opera optimization perl pnginie6 positioning problem scroll seo shopping studio swf swf. templates textcolor theme timecolor titletags url urlseparatedwords visual visualization web webdevelopment webform website windows7 wordpress xml xsl






