943,505 Members | Top Members by Rank

Ad:
You are currently viewing page 1 of this multi-page discussion thread
Jun 14th, 2004
0

Using a robots.txt to block links

Expand Post »
I would like to maximize the PR on my forum display and thread display pages, without sacrificing PR to less important pages. (for instance, the newthread.php page)

Now suppose there are 5 links on page A. Normally, page A's PR will be spread out to the 5 pages. But now if I added a robots.txt file which blocked indexing of two of the links. Would page A's PR now be spread less thin among 3 pages? Or would it be spread just as thin, but 2 of the pages would be entitled to a share of PR but just wouldn't use it.

I hope this sorta made a bit of sense.
Administrator
Staff Writer
Reputation Points: 1422
Solved Threads: 162
The Queen of DaniWeb
cscgal is online now Online
13,645 posts
since Feb 2002
Jun 14th, 2004
0

Re: Using a robots.txt to block links

I sent you a PM RE: this topic.
Reputation Points: 16
Solved Threads: 2
Junior Poster
Arizona Web is offline Offline
118 posts
since Jun 2004
Jun 14th, 2004
0

Re: Using a robots.txt to block links

And I started this thread RE: your PM
Administrator
Staff Writer
Reputation Points: 1422
Solved Threads: 162
The Queen of DaniWeb
cscgal is online now Online
13,645 posts
since Feb 2002
Jun 14th, 2004
0

Re: Using a robots.txt to block links

Ok I am a knuckelhead. I guess this was a chicken before the egg thing. ;-)
Reputation Points: 16
Solved Threads: 2
Junior Poster
Arizona Web is offline Offline
118 posts
since Jun 2004
Jun 14th, 2004
0

Re: Using a robots.txt to block links

The robots.txt file I'm currently using is in my root directory (not my forum root) and looks like this:

User-agent: googlebot
Disallow: /techtalkforums/announcement.php
Disallow: /techtalkforums/faq.php
Disallow: /techtalkforums/forumdisplay.php
Disallow: /techtalkforums/login.php
Disallow: /techtalkforums/member.php
Disallow: /techtalkforums/newreply.php
Disallow: /techtalkforums/newthread.php
Disallow: /techtalkforums/online.php
Disallow: /techtalkforums/printthread.php
Disallow: /techtalkforums/search.php
Disallow: /techtalkforums/showthread.php

I am disallowing access to showthread.php and forumdisplay.php because I would rather Google only spider the .html mod_rewrite versions of the forums and threads, and therefore not get duplicate content. Was this done correctly? Am I excluding the correct things?
Administrator
Staff Writer
Reputation Points: 1422
Solved Threads: 162
The Queen of DaniWeb
cscgal is online now Online
13,645 posts
since Feb 2002
Jun 14th, 2004
0

Re: Using a robots.txt to block links

without going too far in, it looks decent to me. I am not sure however, if the robots.txt blocks the weakening link pop from all the links. The less links on a page the more potent the links are. A page with tons of links is spreading the pop thin. That would be a good question to ask SEO-Guy.
Reputation Points: 16
Solved Threads: 2
Junior Poster
Arizona Web is offline Offline
118 posts
since Jun 2004
Jun 14th, 2004
0

Re: Using a robots.txt to block links

It would be very nice if the robots.txt would block the weakening spread of PR. However, even if it doesn't do this, it would still be valuable because it would eliminate spidering duplicate content (i.e. showthread.php?t=10 and thread10.html)
Administrator
Staff Writer
Reputation Points: 1422
Solved Threads: 162
The Queen of DaniWeb
cscgal is online now Online
13,645 posts
since Feb 2002
Jun 14th, 2004
0

Re: Using a robots.txt to block links

Quote originally posted by cscgal ...
It would be very nice if the robots.txt would block the weakening spread of PR. However, even if it doesn't do this, it would still be valuable because it would eliminate spidering duplicate content (i.e. showthread.php?t=10 and thread10.html)
*nods* for sure.
Reputation Points: 16
Solved Threads: 2
Junior Poster
Arizona Web is offline Offline
118 posts
since Jun 2004
Jun 18th, 2004
0

Re: Using a robots.txt to block links

I don't see how this will help at all.
Reputation Points: 10
Solved Threads: 0
Newbie Poster
AlanM is offline Offline
10 posts
since Jun 2004
Jun 19th, 2004
0

Re: Using a robots.txt to block links

Google frowns upon multiple pages with the same content. For example, if two different URLs have the exact same content on them, google considers it spamming their search engine. This forum uses Apache's mod_rewrite to rewrite URLs to have a .html extension for search engine purposes. Therefore, the webpage showthread.php?t=100 is the exact same thing as thread100.html - if google spiders see this duplicate contact, they will think that daniweb.com is trying to inflate its page count in google by having multiple URLs with the same content. However, by using robots.txt to block google from spidering the showthread.php pages, google only spiders the pages ending in .html - and therefore doesn't penalize us for duplicate content.
Administrator
Staff Writer
Reputation Points: 1422
Solved Threads: 162
The Queen of DaniWeb
cscgal is online now Online
13,645 posts
since Feb 2002

This thread is more than three months old

No one has posted to this discussion for at least three months. Please let old threads die and do not reply to them unless you feel you have something new and valuable to contribute that absolutely must be added to make the discussion complete. Otherwise, please start a new thread in this forum instead.
Message:
Previous Thread in Search Engine Optimization Forum Timeline: Help.
Next Thread in Search Engine Optimization Forum Timeline: Search Engines into LSI





About Us | Contact Us | Advertise | Acceptable Use Policy
Forum Index | Build Custom RSS Feed


Follow us on Twitter


© 2011 DaniWeb® LLC