|
|
|||||||||
|
|||||||||
| |||||||||
|
|
|
| |||||||||
![]() |
|
|
«
Previous Thread
|
Next Thread
»
|
Thread Tools | Search this Thread | Rate Thread | Display Modes |
|
#1
|
|||
|
|||
|
Controlling Robots
My main index page pulls in pages from
/includes/content/* For example: index.php?section=section1 Pulls content from /includes/content/section1 I'm guessing that doing Disallow: /includes/content/section1 in my robots.txt will prevent the search engine from getting that content. Is there any way to restrict access to these pages? |
|
#2
|
||||
|
||||
|
If you are using mod_rewrite you will have to block the virtual directories you want blocked as the robots will not be aware of thew hidden directories. They see what a user sees.
|
|
#3
|
|||
|
|||
|
I'm not using mod_rewrite, just php includes. So if they request
index.php?section=section1 Then it'll include includes/content/section1/index.php as required. |
|
#4
|
||||
|
||||
|
Then you'll have to either dynamically add the meta noindex tag for those pages or list each page individually in your robots.txt. Google allows for a wildcard to be used which would make your life easier, but the other search engines don't so that's not an option.
|
|
#6
|
||||
|
||||
|
Because they are using PHP includes to add the content to the page. That means that directory isn't being accessed directly.
|
![]() |
| Viewing: Dev Shed Forums > Web Design > Search Engine Optimization > Controlling Robots |
| Thread Tools | Search this Thread |
| Display Modes | Rate This Thread |
|
|
|
|