I am working on a site and have various navigation sets being pulled in dynamically via a PHP include command. I tried using a robots.txt file to exclude the includes folder that contains all of the snippets that get pulled in, but it still looks like the navigation is getting indexed by the spider. Any ideas?
I don't see how you can exclude the navigation from the spider since it gets served when the spider 'views' the page. Why do you want to exclude the navigation anyway?
The reason i'd like to exclude the navigation is say that someone is searching for a term that refers to a specific product or service, and the main info for that comes up on a single page. But if that term is contained as a link in the navigation of 50 other pages which have nothing contextually to do with the product or service being sought, then the engine returns 51 results, with the latter 50 showing the navigation in the result where the term is used.
Unless it's Googlebot - although I've never tried this myself, the Google robots.txt information page, seems to suggest that Googlebot does recognize Allow:... ...unless I'm reading this wrong.