Development Software
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
User Name:
Password:
Remember me
Go Back   Dev Shed ForumsWeb Site ManagementDevelopment Software

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
  #1  
Old March 23rd, 2006, 08:44 AM
pdstein pdstein is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Jan 2006
Posts: 6 pdstein User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 51 m 38 sec
Reputation Power: 0
Filtering search spider "clicks"

I posted a question over in the PHP forum because that's the language our ad management software is written in, but then when I saw this forum I thought it would probabably be more appropriate for this forum. My apologies. Mods - if this is a violation, then you can delete the topic from the other forum.

The advertising management software we're using on our site does not filter out "clicks" incurred when search spiders follow the ad links. I'm trying to figure out the best way to do that. I have some ideas, but it doesn't seem like any of them are a complete solution.

1) Use a robots.txt file to tell search spiders not to access the URL of the click processing script. This would probably reduce spider clicks dramatically, but it won't do anything about the spam bots that ignore robots.txt

2) Check the HTTP_REFERRER variable and throw out clicks with no referrer. After some limited observation, I found that while this did eliminate all the spider clicks it also threw out about half the legitimate clicks. I guess some browsers do not send the referrer data.

3) Check the IP address or user-agent against a list of known spiders. The downside of this is that the list has to be constantly updated.

4) Check the time of the last click from that IP address. If the click is within a second of the last click, disregard it, and if comparing to a list of known bots, add the IP to the list. The downside of this is some people use "browser accelerators" that automatically follow links on a page so that when the user clicks on a link that page has already loaded. It's difficult (impossible?) to distinguish between that users real clicks and when their accelerator is following the ad links.

Any suggestions as to a better approach? Anyone know how the server stats programs differentiate between real visitors and search spiders?

Thanks!

Reply With Quote
Reply

Viewing: Dev Shed ForumsWeb Site ManagementDevelopment Software > Filtering search spider "clicks"


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump

 Free IT White Papers!
 
Accelerating Trading Partner Performance
One in five. That's how many partner transactions have at least one error. That is an amazing statistic, particularly given the extraordinary leaps in innovation across the global supply chain during the past two decades. Download this white paper to learn more.

 
Competing on Analytics
This Tech Analysis is designed to help identify characteristics shared by analytics competitors, and includes information about 32 organizations that have made a commitment to quantitative, fact-based analysis.

 
Cost Effective Scaling with Virtualization and Coyote Point Systems
An overview of the industry trend toward virtualization, how server consolidation has increased the importance of application uptime and the steps being taken to integrate load balancing technology with virtualized servers.

 
Five Checkpoints to Implementing IP Telephony
Implementation planning for IP PBX software and IP telephony has become vital as businesses replace discontinued legacy PBX phone systems. This informative whitepaper outlines five "checkpoints" for any implementation plan that will help make IP communications a successful proposition.

 
Hosted Email Security: Staying Ahead of New Threats
In the last two years, email has become a fierce battleground between the nefarious forces of spam and malware, and the heroes of messaging protection. The spam volumes increased alarmingly every month, bringing clever new forms of phishing and virus propagation attacks.

 

Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 





© 2003-2008 by Developer Shed. All rights reserved. DS Cluster 6 hosted by Hostway