|
|
|||||||||
|
|||||||||
| |||||||||
|
|
|
| |||||||||
![]() |
|
|
«
Previous Thread
|
Next Thread
»
|
Thread Tools | Search this Thread | Rate Thread | Display Modes |
|
|
|
Stop making mediocre tutorials.The best tutorials are video! Camtasia Studio makes it easy to create engaging, buzz-building screen videos at any size, in any popular format. Download the free trial!
|
|
#1
|
|||
|
|||
|
we have a robots.txt file in the root directory of our nimlok.com web site to stop web searches from accessing sertain directories. I have checked with several references to make sure it is written correctly, and cannot find any problems with it.
Here it is: Code:
User-agent: * Disallow: /cgi-bin/ Disallow: /distributor/ Disallow: /ecommerce/ Disallow: /cad/ Disallow: /robots/ Disallow: /imanager/ Disallow: /images/ Disallow: /technical/ Disallow: /analog3.0/ Disallow: /new/ But if you go to Google and search for "Nimlok" a link to the distributor directory is listed as a sub-section of the first result. I am not sure if other directories also appear in search results - I did not see them on the first page, but chances are - they are displayed somewhere later in search results. What's teh deal with it? |
|
#2
|
|||
|
|||
|
Did you register with google before you added the robots.txt file? If so it is probable that their robot grabbed these pages in the past and is hanging on to them
In my experience, search engines have a habit of holding on to information for a long time without checking its still valid! If this is the case you'll have to contact them and have the informatiuon removed.
__________________
Andrew "Chaos Theory is a theory invented by scientists panicked by the thought that the public were beginning to understand the old ones." |
![]() |
| Viewing: Dev Shed Forums > Other > Dev Shed Lounge > Preventing Search Engines from Accessing Certain Directories - robots.txt problem |
| Thread Tools | Search this Thread |
| Display Modes | Rate This Thread |
|
|
|
|
|