We're a community of 1077K IT Pros here for help, advice, solutions, professional growth and fun. Join us!
1,076,515 Members — Technology Publication meets Social Media
Username:
Password:
Lost login information?
Start New Discussion Reply to this Discussion

Display content based on IP

Hey.

I have a website that is user submitted which has a repot button and the Google crawler keeps spamming that button. I am aware of robots.txt but it seems that nothing is working to stop Google from clicking it. So instead, I'd like to assign the link an ID and hide it from Google's IP address.

Can this be done?

Thanks.

5
Contributors
4
Replies
12 Hours
Discussion Span
7 Months Ago
Last Updated
5
Views
louie540
Light Poster
34 posts since Oct 2011
Reputation Points: 10
Solved Threads: 0
Skill Endorsements: 0

@louie540

I have a website that is user submitted which has a repot button and the Google crawler keeps spamming that button. I am aware of robots.txt but it seems that nothing is working to stop Google from clicking it. So instead, I'd like to assign the link an ID and hide it from Google's IP address.

This question is more suited for Internet Marketing Section rather than PHP section. I understand what you are mention about the robot.txt but if you have a code in PHP than it is OK to post it here. Otherwise post it there. If you have a PHP code just post the code in the PHP section.

LastMitch
Industrious Poster
4,212 posts since Mar 2012
Reputation Points: 134
Solved Threads: 336
Skill Endorsements: 45

Google will pry into your life not matter how much you try and stop them.... Just sayin

leviathan185
Junior Poster
157 posts since May 2009
Reputation Points: 24
Solved Threads: 19
Skill Endorsements: 0

You could achieve this with the user agent maybe?

Echo89
Junior Poster
109 posts since Mar 2012
Reputation Points: 9
Solved Threads: 4
Skill Endorsements: 0

Hi
Googlebot will ALWAYS comply with robots.txt directives.

I think you may be suffering from Googlebot impersonation attempt.

Here is a link to learn more about that:
http://www.incapsula.com/the-incapsula-blog/item/369-was-that-really-a-google-bot-crawling-my-site

My advise is:

  • Use Google WMT and do a manual "Fetch" for the button URL. If you succesed than your robots.txt is not really blocking it and you'll need to modify it accordingly.

  • If "Fetch" fails (meaning your robots.txt settings are fine) than you should get the user-agent and IP info of this "Googlebot" and verify them on http://www.Botopedia.org

If user-agent data or IPs won't match you'll know that something is wrong. In this case the visiting IP should be blocked because this is either a spam attempt or a vuln. scan performed for future hacking purposes. (somethinkg like Havij for exmple)

Igal-Incapsula
Newbie Poster
1 post since Oct 2012
Reputation Points: 0
Solved Threads: 0
Skill Endorsements: 0

This article has been dead for over three months: Start a new discussion instead

Post: Markdown Syntax: Formatting Help
 
You
 
© 2013 DaniWeb® LLC
Page rendered in 0.0759 seconds using 2.8MB