Please support our Linux Servers and Apache advertiser: Programming Forums
![]() |
•
•
Join Date: Jul 2005
Posts: 1
Reputation:
Rep Power: 0
Solved Threads: 0
My Robots.txt Was Not Blocking Out Sites From Any Search Engine. A Friend Suggested That I Should Create An .htaccess File
During A Web Search Using Google, Yahoo, Excite, Msn, Altavista, Hotbot, Infoseek, Lycos, Overture, Ask Jeeves, Teoma, Or Other Search Engines, I "do Not" Want These Sites To Be Listed. Can You Help Me To Improve My Search So These Cannot Show?
The Administrators Will Not Remove The Postings.
During A Web Search Using Google, Yahoo, Excite, Msn, Altavista, Hotbot, Infoseek, Lycos, Overture, Ask Jeeves, Teoma, Or Other Search Engines, I "do Not" Want These Sites To Be Listed. Can You Help Me To Improve My Search So These Cannot Show?
The Administrators Will Not Remove The Postings.
if You're wanting to totally Block ALL Robots, from MY understanding of the file, Youshould only need these two lines.
user-agent: *
disallow: /
otherwise, Note the changes I made in Your htaccess file
post back, lemme know how things go for ya
user-agent: *
disallow: /
otherwise, Note the changes I made in Your htaccess file
post back, lemme know how things go for ya
•
•
•
•
Originally Posted by Ck
Ye Men of Honour, Knowledge, Wisdom, and Truth:
Be Dead To This World
•
•
Join Date: Sep 2005
Posts: 9
Reputation:
Rep Power: 0
Solved Threads: 0
If you are using a global header you may try using this meta tag as well:
<META NAME="ROBOTS" CONTENT="INDEX,NOFOLLOW">
But really you really only need to use that when you have files in a directory that you don't want to put in your robots.txt file, and want to have a tidy robots file.
not sure what using .htaccess gains you, unless you are willing to play around with rules based on user agents, which may end up blocking alot of traffic you don't really want to and also would take alot of work to keep up with all the different robot agents out there. Not trivial.
if your friend meant to do an .htaccess to password protect it that's a different matter and kind of a different issue, but what jusCk said in his post about the robots.txt should really be sufficient unless you're running into bots with bad etiquette.
<META NAME="ROBOTS" CONTENT="INDEX,NOFOLLOW">
But really you really only need to use that when you have files in a directory that you don't want to put in your robots.txt file, and want to have a tidy robots file.
not sure what using .htaccess gains you, unless you are willing to play around with rules based on user agents, which may end up blocking alot of traffic you don't really want to and also would take alot of work to keep up with all the different robot agents out there. Not trivial.
if your friend meant to do an .htaccess to password protect it that's a different matter and kind of a different issue, but what jusCk said in his post about the robots.txt should really be sufficient unless you're running into bots with bad etiquette.
![]() |
Similar Threads
Other Threads in the Linux Servers and Apache Forum
- Flash Player stop working since Ad-aware (Web Browsers)
- Internet Explorer not working. (Web Browsers)
- Flash Player stop working and won't reinstall (Windows NT / 2000 / XP / 2003)
- Acer laptop keys not working (Troubleshooting Dead Machines)
- messanger not working. (Web Browsers)
- cd burner not working (Storage)
- Working on new design (DaniWeb Community Feedback)
Other Threads in the Linux Servers and Apache Forum
- Previous Thread: mod rewrite problem (tomcat with od_jk as well)
- Next Thread: Mod_Rewrite help!
•
•
•
•
Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)





Linear Mode