944,111 Members | Top Members by Rank

Ad:
You are currently viewing page 1 of this multi-page discussion thread
Mar 8th, 2006
0

Avoiding the duplicate content issue

Expand Post »
I have 400,000 pages of content-light pages. That's because many have an image, and a very short description that is legally regulated. I want to add unique keyword-rich content to all the pages on the left sidebar. However, I don't want to handwrite 400,000 pages of unique content. I've thought about writing one page of content, and then having a script that will scramble the words, and add similar words/hyperlinks based on some kind of rules.

Any other ideas would be very much appreciated.
Similar Threads
Reputation Points: 10
Solved Threads: 1
Light Poster
shimon is offline Offline
46 posts
since Mar 2006
Mar 8th, 2006
0

Re: Avoiding the duplicate content issue

Why not do something like madlibs? Configure about 10 or 20 unique variables describing each page. Then, write a paragraph which contains those variables, effectively giving you 400,000 unique paragraphs.

For example:

page 1:
$title = "Yorkshire Terrier"
$price = "$800"
$weight = 6

page 2:
$title = "Maltese Spaniel"
$price = $1200"
$weight = 4

output:
"Welcome to my page about the $title dog. This breed usually costs around $price. Adult $title's typically weigh about $weight pounds."

If you configure enough variables, you'll end up with a lot of keyword rich, unique content. If done right, the only words that will really end up being in common are words like "and" and "the" and other short phrases that the search engines are more than likely to simply dismiss and focus on the unique quality keywords.
Administrator
Staff Writer
Reputation Points: 1422
Solved Threads: 163
The Queen of DaniWeb
cscgal is offline Offline
13,646 posts
since Feb 2002
Mar 8th, 2006
0

Re: Avoiding the duplicate content issue

Just to add: If you have your products (or whatever each unique page is) categorized and organized in the database, just use that information to generate the variables. So it's much less work than even punching in variables for each.
Administrator
Staff Writer
Reputation Points: 1422
Solved Threads: 163
The Queen of DaniWeb
cscgal is offline Offline
13,646 posts
since Feb 2002
Mar 18th, 2006
0

Re: Avoiding the duplicate content issue

Go cscgal!
Reputation Points: 10
Solved Threads: 0
Newbie Poster
sirKel is offline Offline
6 posts
since Mar 2006
Mar 18th, 2006
0

Re: Avoiding the duplicate content issue

Haha thanks
Administrator
Staff Writer
Reputation Points: 1422
Solved Threads: 163
The Queen of DaniWeb
cscgal is offline Offline
13,646 posts
since Feb 2002
Mar 18th, 2006
0

Re: Avoiding the duplicate content issue

Automatically generating pages in the manner in which described is a big search engine optimization no no. Sure, it may appear to be the convenient way to solve a problem, but at what expense. Obviously the web pages are not performing as you would like them to and content is lacking. So build content.

I like this place and I support it in my way everyday. Yet, it surprises me to no end what I read sometimes in this forum.

Doesn't anybody even glance at what the SEs write about regarding what is and is not acceptable design practices and SEO techniques? Search engines do ban web sites once in a while, not as frequently as I would like to see, but they make their power known on occasion.

So, now you have 400,000 pages that need textual content. If you were looking for a short-cut to resolve your massive copy troubles here is one : Do you remember how to use NotePad?
Moderator
Featured Poster
Reputation Points: 180
Solved Threads: 26
Occupy Search Engines
canadafred is offline Offline
962 posts
since Feb 2006
Mar 18th, 2006
0

Re: Avoiding the duplicate content issue

Just earlier this month, I went to the Search Engine Strategies conference, and a group of us were actually discussing this specific situation. An SEO firm was actually in the same position with a client. A bunch of us, including << on second thought, snipping their names, but they are very well respected in the seo industry >> and a bunch of other SEOs (who were speakers at the seminars) were actually discussing this issue and this madlibs thing is the idea we came up with. It was actually coincidence that shimon posted about the same situation only weeks later. Basically what I'm saying is: Does this technique work *right now*? I believe, yes, it definitely does. Will it always work? Probably not. Does it follow your ethical standards? Probably not. Is it conductive to a highly usable website that is more optimized for the human visitor than for the search engines? Most likely not. But for those who don't have the time to write 400,000 paragraphs ... or when there is simply no ROI in the time investment in doing so ... this is a viable solution.
Administrator
Staff Writer
Reputation Points: 1422
Solved Threads: 163
The Queen of DaniWeb
cscgal is offline Offline
13,646 posts
since Feb 2002
Mar 27th, 2006
0

Re: Avoiding the duplicate content issue

If everyone completely and absolutely avoided using duplicate content Google would have around one million pages indexed by now, if that. You could use duplicate search results from any number of sources. There are many articles sites who wnat you to use their content.

I don't know anything about your site or how it works. Are any search variables passed? For instance, if a user searched for "great big widgets", that search term could be passed to another application that would show relavant content about great big widgets. In other words, this can all be automated.

The idea about the garbled words is a great idea for the robots. The only down-side would be that the content wouldn't really make sense to users.

I agree that you're not going to set the world on fire with duplicate content but it could help get those pages off the ground in a more cost effective way than contracting original content.
Reputation Points: 10
Solved Threads: 0
Newbie Poster
mj99 is offline Offline
4 posts
since Mar 2006
Mar 27th, 2006
0

Re: Avoiding the duplicate content issue

When I said madlibs idea, I certainly was not referring to the fact that madlibs rarely make sense. By all means, the content should make sense to the web surfer more importantly than to the search engines. However, variables can be substituted into sentences to form multiple sentences each referring to something different. Take my example in my first post of this thread.
Last edited by cscgal; Mar 27th, 2006 at 6:49 pm.
Administrator
Staff Writer
Reputation Points: 1422
Solved Threads: 163
The Queen of DaniWeb
cscgal is offline Offline
13,646 posts
since Feb 2002
Mar 27th, 2006
0

Re: Avoiding the duplicate content issue

Gotcha. Its a great idea.
Reputation Points: 10
Solved Threads: 0
Newbie Poster
mj99 is offline Offline
4 posts
since Mar 2006

This thread is more than three months old

No one has posted to this discussion for at least three months. Please let old threads die and do not reply to them unless you feel you have something new and valuable to contribute that absolutely must be added to make the discussion complete. Otherwise, please start a new thread in this forum instead.
Message:
Previous Thread in Search Engine Optimization Forum Timeline: What is ethical SEO?
Next Thread in Search Engine Optimization Forum Timeline: difference prices links





About Us | Contact Us | Advertise | Acceptable Use Policy
Forum Index | Build Custom RSS Feed


Follow us on Twitter


© 2011 DaniWeb® LLC