Hi, I need a solution for a lil problem that I have with some PDF-docs. I don`t like Google offering a HTML version of my PDFdocs. What can I do to prevent Google from offering HTML versions? Any ideas? Oh and by the way...does Google chache PDF-Files? Thank you for your help. Jab
As far as I know they cache them.. the only option would be removing them completely from the search engine..
disallow: /*.pdf add that code to your robots.txt file. google will delete current pdf files and will not index any more pdf files
Ok, but I cannot prevent Google from caching/ offering HTML versions without banning my files from the SERPs? Of course I want to rank with them I just don`t want to be cached.
Just want to let you know that I found a solution to my problem: A special case is PDF files that should be indexed, but not cached. There is no way to directly include meta information in a PDF file, but if security is enabled for a PDF file it will be treated as if the noarchive tag was specified. Security settings can be controlled using Adobe Acrobat (not the free Reader). http://www.uky.edu/Providers/searching.html#caching Thanks anyway for trying to help me out.