Web Hosting Talk







View Full Version : Block spider / user for ever


Webgypsy
10-21-2003, 04:06 AM
How can I block 230-216.205.94.dellhost.com from getting access to my sites. I think he is going to try to copy my sites. Dellhost isn't responding.

Adding it to robots.txt file is not an option, because he doesn't care about that. I want to block him (and other future problems), not ask him not to spider my site.

Can I add something to my http.conf to deny access?

sprintserve
10-21-2003, 06:59 AM
Why don't you just add his ip to your firewall and drop all traffic from that ip?

Webgypsy
10-21-2003, 07:06 AM
I don't have a firewall, but I am trying to convince my technical support I need one. He thinks it is not needed and too complicated to install. (FreeBSD).

sprintserve
10-21-2003, 07:26 AM
You need a new technical support guy...

That aside, temporarily, you can use .htaccess in your website using mod_rewrite.

Order Allow,Deny
Allow from all
Deny from xxx.xxx.xxx.xxx


Replace the x with the real ip. If he is on dynamic, you can block a range. But note that that typically blocks a lot of innocent users from the same isp

In order for this to work you must also have Overrides allow in your Apache config.

Webgypsy
10-21-2003, 07:41 AM
I'll let him know.

Isn't it possible to add it in http.conf without the .htaccess

I solved another problem (with some help) by adding this in http.conf for each domain :

RewriteEngine on
RewriteCond %{HTTP_USER_AGENT} grub\-client
RewriteRule /* http://www.grub.org

HostMidwest
10-21-2003, 12:17 PM
If your website contains important data that you don't want people mining, you may want to consider implementing a human verification script.

E.G: A script that gives an image with numbers and makes them to enter them into a textbox to access the site.

I'm sure there are good scripts out there... you may want to look into one.

You could also take more drastic measures, such as a harassment lawsuit...

HostMidwest
10-21-2003, 12:20 PM
I should add that if he's mining a script, you can program it to pick up environmental data about the user- if his address is

230-216.205.94.dellhost.com

just program it to show:

"BEAT IT, LOSER!", instead of posting information, etc.

Webgypsy
10-21-2003, 02:15 PM
Affhosting, if you a good script let me know; I can't find one.

The problem is, the User Agent doesn't say he is a spider (Mozilla/4.0 (compatible; MSIE 4.0; Windows 95)). I can't block this, because regular visitors will be blocked also.