Search Engine Optimization
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
User Name:
Password:
Remember me
Go Back   Dev Shed ForumsWeb DesignSearch Engine Optimization

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread Dev Shed Forums Sponsor:
  #1  
Old June 5th, 2006, 06:13 AM
bainser's Avatar
bainser bainser is offline
Give us a kiss
Dev Shed Intermediate (1500 - 1999 posts)
 
Join Date: Jun 2004
Location: Fife, Scotland
Posts: 1,788 bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)  Folding Points: 55650 Folding Title: Beginner FolderFolding Points: 55650 Folding Title: Beginner FolderFolding Points: 55650 Folding Title: Beginner Folder
Time spent in forums: 1 Month 1 Week 4 Days 9 h 34 m 8 sec
Reputation Power: 650
Send a message via AIM to bainser
Subscription Based Sites

One of the sites I am looking after is a subscription only based site and apart from the login page that will have info about what the site is about and the info it offers we don't want search engines to crawl the rest of the site.

Any ideas how we should best set this up? Would it be enough just to set the robots file/meta tag to disallow robots viewing anything other than the front page?

Thanks

Reply With Quote
  #2  
Old June 5th, 2006, 07:41 AM
Hombre's Avatar
Hombre Hombre is offline
Pixel Cruncher
Dev Shed Novice (500 - 999 posts)
 
Join Date: Jan 2005
Location: UK
Posts: 647 Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)  Folding Points: 3232 Folding Title: Novice Folder
Time spent in forums: 2 Weeks 1 Day 9 h 1 m 56 sec
Reputation Power: 102
Hi,

The robots.txt file is respected by a lot of the crawlers including some of the big guns. such as Google, MSN and so on.

There are some, however, who ignore the instructions and plough through.

I have used PHP access control instructions to restrict page access and have not noticed any being indexed. I assume that the crawlers see the page pretty much as a browser does so would not be able to 'view' the restricted areas. If they did and someone tried to view the page the viewer would fail the PHP access criteria and be redirected to the login area.

ATB
Comments on this post
Gnome101 agrees: Robots and the login should keep them out! Well said!

Reply With Quote
  #3  
Old June 5th, 2006, 08:44 AM
bainser's Avatar
bainser bainser is offline
Give us a kiss
Dev Shed Intermediate (1500 - 1999 posts)
 
Join Date: Jun 2004
Location: Fife, Scotland
Posts: 1,788 bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)bainser User rank is Brigadier General (60000 - 70000 Reputation Level)  Folding Points: 55650 Folding Title: Beginner FolderFolding Points: 55650 Folding Title: Beginner FolderFolding Points: 55650 Folding Title: Beginner Folder
Time spent in forums: 1 Month 1 Week 4 Days 9 h 34 m 8 sec
Reputation Power: 650
Send a message via AIM to bainser
Thanks hombre, can't rep you right now but it's on its way.

So would I just put something like this in the robots.txt

User-agent: *
Disallow: /content_folders/

placing the intro page in the main folder and then all subsequent files into the content_folders area? Or is there another way you'd recommend?

Reply With Quote
  #4  
Old June 5th, 2006, 09:07 AM
Hombre's Avatar
Hombre Hombre is offline
Pixel Cruncher
Dev Shed Novice (500 - 999 posts)
 
Join Date: Jan 2005
Location: UK
Posts: 647 Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)Hombre User rank is Second Lieutenant (5000 - 10000 Reputation Level)  Folding Points: 3232 Folding Title: Novice Folder
Time spent in forums: 2 Weeks 1 Day 9 h 1 m 56 sec
Reputation Power: 102
I use a combination of the methods you mentioned to restrict the crawlers as follows...

User-agent: *
Disallow: /restricted_directory/
Disallow: /anypage.php

Most of my restricted pages are held in their own directory and that directory - and all pages contained within it - are covered by Disallow:/restricted_directory/

It was easier for me to have one or two pages contained within the root itself that I also didn't want crawled and those are named individually in the robots file: Disallow:/anypage.php

ATB
Comments on this post
bainser agrees: there you go, cheers min

Reply With Quote
Reply

Viewing: Dev Shed ForumsWeb DesignSearch Engine Optimization > Subscription Based Sites


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump


Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 





© 2003-2008 by Developer Shed. All rights reserved. DS Cluster 6 hosted by Hostway
Stay green...Green IT