Search Engine Optimization
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
User Name:
Password:
Remember me
Go Back   Dev Shed ForumsWeb DesignSearch Engine Optimization

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread Dev Shed Forums Sponsor:
  #1  
Old October 21st, 2006, 04:19 PM
alexo alexo is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Jul 2006
Posts: 140 alexo User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 1 Day 3 h 53 sec
Reputation Power: 3
Send a message via ICQ to alexo
Exclude text from Google indexation

Hello

on my site, i have secondary menu (it's only useful tool for users), which in include in all pages of this site (~30k pages)

file size 12-15kb.

that's why i want to exclude content of this file from google indexation.

how can i do it ?

this is separate file, which i include (php) in all files

is there any code like in adsese is (<!-- google_ad_section_start(weight=ignore) -->) or may be other way?

thank you

P.s. can this file cause any problem for this site. if google check the same file in all pages. (page have ~50-80kb size and of course unique content, title, and description)

Reply With Quote
  #2  
Old October 22nd, 2006, 03:07 AM
KC Tan's Avatar
KC Tan KC Tan is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Sep 2006
Location: Singapore
Posts: 78 KC Tan User rank is Sergeant (500 - 2000 Reputation Level)KC Tan User rank is Sergeant (500 - 2000 Reputation Level)KC Tan User rank is Sergeant (500 - 2000 Reputation Level)KC Tan User rank is Sergeant (500 - 2000 Reputation Level)KC Tan User rank is Sergeant (500 - 2000 Reputation Level) 
Time spent in forums: 11 h 14 m 51 sec
Reputation Power: 10
Send a message via MSN to KC Tan
You should not worry if all the generic pages are of unique content. Google see a webpage as a whole, not as separate files because they see it from the client side.

By the way, you cannot exclude a file (which is part of a page) from Google's index. You can, however, exclude pages from Google bots. Refer to robots.txt if it interests you

http://www.google.com/support/webmasters/bin/answer.py?answer=35303
Comments on this post
displeaser agrees: for robots.txt suggestion

Reply With Quote
  #3  
Old October 22nd, 2006, 03:02 PM
alexo alexo is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Jul 2006
Posts: 140 alexo User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 1 Day 3 h 53 sec
Reputation Power: 3
Send a message via ICQ to alexo
Quote:
You should not worry if all the generic pages are of unique content. Google see a webpage as a whole, not as separate files because they see it from the client side.


yes, may be u are right, but in real, just now google keep in his index only 15-16k pages, and only 87 pages "Results 1 - 87 of about 15,500 from" is showing and all with supplemental mark.

Quote:
By the way, you cannot exclude a file (which is part of a page) from Google's index.


in can do , if i include this file as javascript

Reply With Quote
  #4  
Old October 22nd, 2006, 03:25 PM
displeaser's Avatar
displeaser displeaser is offline
Periodically energetic Perler
Dev Shed Regular (2000 - 2499 posts)
 
Join Date: May 2005
Location: Dublin, Ireland
Posts: 2,266 displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)  Folding Points: 76661 Folding Title: Intermediate FolderFolding Points: 76661 Folding Title: Intermediate FolderFolding Points: 76661 Folding Title: Intermediate FolderFolding Points: 76661 Folding Title: Intermediate Folder
Time spent in forums: 4 Weeks 5 h 23 m 13 sec
Reputation Power: 532
Quote:
Originally Posted by alexo
yes, may be u are right, but in real, just now google keep in his index only 15-16k pages, and only 87 pages "Results 1 - 87 of about 15,500 from" is showing and all with supplemental mark.



in can do , if i include this file as javascript


A robots.txt is easier to do and needs no maintenance as long as the index name stays the same. Keep it simple .

Displeaser
__________________
Vi Veri Veniversum Vivus Vici.

Reply With Quote
  #5  
Old October 22nd, 2006, 03:54 PM
alexo alexo is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Jul 2006
Posts: 140 alexo User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 1 Day 3 h 53 sec
Reputation Power: 3
Send a message via ICQ to alexo
Quote:
A robots.txt is easier to do and needs no maintenance as long as the index name stays the same. Keep it simple .



i'm sorry, can u explain how can i do it via robots.txt ?

1. Disallow: /included.php in robots.txt

2. template for all files is <?php include ("../path/included.php"); ?>

so u think that in this case Gbot don't check this included.php file content ?

i don't think so :-(

Reply With Quote
  #6  
Old October 22nd, 2006, 04:35 PM
displeaser's Avatar
displeaser displeaser is offline
Periodically energetic Perler
Dev Shed Regular (2000 - 2499 posts)
 
Join Date: May 2005
Location: Dublin, Ireland
Posts: 2,266 displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)displeaser User rank is Colonel (50000 - 60000 Reputation Level)  Folding Points: 76661 Folding Title: Intermediate FolderFolding Points: 76661 Folding Title: Intermediate FolderFolding Points: 76661 Folding Title: Intermediate FolderFolding Points: 76661 Folding Title: Intermediate Folder
Time spent in forums: 4 Weeks 5 h 23 m 13 sec
Reputation Power: 532
Google I would imagine is a well behaved crawler so should follow robots.txts rules (read, it does )

See here on how to implement it, See also this tutorial which specifically stops google looking at a page.

Displeaser

Last edited by displeaser : October 22nd, 2006 at 04:36 PM. Reason: spelling, grammer blah blah blah

Reply With Quote
  #7  
Old October 22nd, 2006, 05:44 PM
alexo alexo is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Jul 2006
Posts: 140 alexo User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 1 Day 3 h 53 sec
Reputation Power: 3
Send a message via ICQ to alexo
Seems, that u don't understand me right, or don't know php

php is server side/
Gbot don't understand is this file included or no ..
Gbot only check the last file (in this file u can include 10 other files, but Gbot see only content(text-images) from this files and even don't know in which folders are located this included files)

IMO

Reply With Quote
  #8  
Old October 23rd, 2006, 09:29 AM
KC Tan's Avatar
KC Tan KC Tan is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Sep 2006
Location: Singapore
Posts: 78 KC Tan User rank is Sergeant (500 - 2000 Reputation Level)KC Tan User rank is Sergeant (500 - 2000 Reputation Level)KC Tan User rank is Sergeant (500 - 2000 Reputation Level)KC Tan User rank is Sergeant (500 - 2000 Reputation Level)KC Tan User rank is Sergeant (500 - 2000 Reputation Level) 
Time spent in forums: 11 h 14 m 51 sec
Reputation Power: 10
Send a message via MSN to KC Tan
Quote:
Originally Posted by alexo
i'm sorry, can u explain how can i do it via robots.txt ?

1. Disallow: /included.php in robots.txt

2. template for all files is <?php include ("../path/included.php"); ?>

so u think that in this case Gbot don't check this included.php file content ?

i don't think so :-(

Google will not index the included.php file if you were to place the above specified robots.txt in your directory. However, Google will continue to index all your other pages (with the embedded included file).

As what has been mentioned before, Google should not see any problems utilizing PHP includes (as this process is transparent to them). You might want to refer to the following video from Google regarding supplemental results:

http://video.google.com/videoplay?docid=-3494613828170903728

More on supplemental results:
http://www.google.com/support/webmasters/bin/answer.py?answer=34473&topic=8523

Hope it helps

Reply With Quote
Reply

Viewing: Dev Shed ForumsWeb DesignSearch Engine Optimization > Exclude text from Google indexation


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump


Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 





© 2003-2008 by Developer Shed. All rights reserved. DS Cluster 4 hosted by Hostway
Stay green...Green IT