954,359 Members — Technology Publication meets Social Media
Username:
Password:
Lost login information?
Have something to say? Contribute New Article Reply to this Article

How to stop a crawler from crawling my website

Hi,

My website is being crawled by a crawler by name yandex.ru almost 24 hours a day. How can I stop it from crawling my website. My website is USA based. Hence, I don't want this crawler to crawl my website all the time.

Any advice please.

Thanks in advance

Allison2009
Junior Poster
148 posts since Jul 2009
Reputation Points: 10
Solved Threads: 3
 

Create a robots.txt file with the following content. This will prevent Yandex from crawling your site.

User-agent: Yandex
Disallow: /

User-agent: Yandex/1.01.001
Disallow: /

inspiroHost
Newbie Poster
16 posts since Jan 2010
Reputation Points: 10
Solved Threads: 1
 

edit your robots.txt file

dailyearner
Junior Poster
Banned
132 posts since Apr 2009
Reputation Points: -6
Solved Threads: 4
Infraction Points: 20
 

Thanks to all the post, esp., to inspirohost. Now I have implemented it. Now that spider is not crawling my website.

Thanks again

Allison2009
Junior Poster
148 posts since Jul 2009
Reputation Points: 10
Solved Threads: 3
 

Very useful information, thanks.

julibe
Newbie Poster
3 posts since Feb 2010
Reputation Points: 10
Solved Threads: 1
 

Thanks to all the post, esp., to inspirohost. Now I have implemented it. Now that spider is not crawling my website.

Thanks again

No problem at all, glad to help :)

inspiroHost
Newbie Poster
16 posts since Jan 2010
Reputation Points: 10
Solved Threads: 1
 

write your robots.txt file and stop it.

pmolds
Newbie Poster
4 posts since Feb 2010
Reputation Points: 10
Solved Threads: 1
 

You can also do it on per page basis..put meta robot tag in head section of your page...

ameto
Newbie Poster
20 posts since Sep 2009
Reputation Points: 10
Solved Threads: 1
 

Is there a drawback to restricting Yandex? i.e. Is anyone seeing quality traffic from the site?

vicjg
Newbie Poster
2 posts since Mar 2010
Reputation Points: 10
Solved Threads: 1
 

you can use robot.txt file to stop crawling particular pages.

joelchrist
Posting Whiz
345 posts since Mar 2010
Reputation Points: 2
Solved Threads: 8
 

You could block them in your robots.txt file if such crawlers obey the rules. If not, I would recommend you to block them directly via IP blocker inside your hosting control panel.

Have a nice day,

AirForceOne
Posting Pro in Training
457 posts since Jun 2009
Reputation Points: 19
Solved Threads: 15
 

Hi,

My website is being crawled by a crawler by name yandex.ru almost 24 hours a day. How can I stop it from crawling my website. My website is USA based. Hence, I don't want this crawler to crawl my website all the time.

Any advice please.

Thanks in advance

nice! your site is continuly indexing.what strategy you are using ? good knowledge i got here.

thanks

skseo
Junior Poster in Training
84 posts since Feb 2010
Reputation Points: 7
Solved Threads: 2
 

This question has already been solved

Post: Markdown Syntax: Formatting Help
You
View similar articles that have also been tagged: