Apache Development
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
User Name:
Password:
Remember me
Go Back   Dev Shed ForumsSystem AdministrationApache Development

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread Dev Shed Forums Sponsor:
  #1  
Old December 10th, 2001, 07:27 PM
mezz mezz is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Oct 2001
Posts: 310 mezz User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 8
What's better way to stop Spider/Robot?

I am wondering what's best way to stop (trap) them? Using the mod_rewrite, perl script or what? My opinion, put something in Apache is the best method to trap them.

Thanks,
Mezz

Reply With Quote
  #2  
Old December 10th, 2001, 09:26 PM
freebsd freebsd is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Jan 2001
Posts: 5 freebsd User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 0
Just put whatever like BrowserMatchNoCase ^Scooter bad_robot within <IfModule mod_setenvif.c> or define them globally.
Then set Deny from env=bad_robot anywhere you wish, preferably within <Directory>.
Multiple robots can assign the same env so you just need to Deny bad_robot once.

Reply With Quote
  #3  
Old December 11th, 2001, 05:10 PM
mezz mezz is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Oct 2001
Posts: 310 mezz User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 8
Thanks freebsd!

I am going to put "Deny from env=bad_robot" in <Directory /></Directory>, so is it good idea? I will have to test this afternoon to see if it will stop the spider/robot.


<IfModule mod_setenvif.c>
BrowserMatch "Mozilla/2" nokeepalive
BrowserMatch "MSIE 4\.0b2;" nokeepalive downgrade-1.0 force-response-1.0
BrowserMatch "RealPlayer 4\.0" force-response-1.0
BrowserMatch "Java/1\.0" force-response-1.0
BrowserMatch "JDK/1\.0" force-response-1.0
BrowserMatchNoCase ^Scooter bad_robot
</IfModule>

Reply With Quote
  #4  
Old December 11th, 2001, 05:20 PM
mezz mezz is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Oct 2001
Posts: 310 mezz User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 8
Umm, I already tested it recently. I have decided to do it right now and it doesn't works, because the spider tool that I have can fool the user_agent. When I put the it as "Mozilla/4" then it can spider/robot on server as well, so is there better way or just impossible to stop it?

Thanks,
Mezz

Reply With Quote
  #5  
Old December 12th, 2001, 12:37 AM
freebsd freebsd is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Jan 2001
Posts: 5 freebsd User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 0
>> so is there better way or just impossible to stop it?

Impossible.

Reply With Quote
Reply

Viewing: Dev Shed ForumsSystem AdministrationApache Development > What's better way to stop Spider/Robot?


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump


Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 





© 2003-2008 by Developer Shed. All rights reserved. DS Cluster 6 hosted by Hostway
Stay green...Green IT