externalaw 0 Newbie Poster

OK,

Please advise me,

I have 1500 pages of static pages, each with content related to a keyword.

I have the list of keywords and copy those in a column in excel,

Then I upload the 1500 pages of content

And i do this

get file wget -i ./1 and download

remove 1st 6 lines all files sed '1d' -i *.htm;

remove html tags

sed -e :a -e 's/<[^>]*>//g;/</N;//ba'

Add to the top line to show next content starts <%$%>

sed -i 1i"<%$%>" *.html

Now I

for i in 'cat *.html';do $i >> fileb; done


AND NOW I AM IN TROUBLE

How the hood do I get these into nice columns? So each text between <%$%> should be in its own column,

I then want to past this next to the keywords column I already have and VOILA a glossary is born.

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.