I have come across so many conflicting methods/codes online, and I have no idea which actually works. Can someone please post here the EXACT code to use in both robots.txt AND .htaccess? NOTE #1: I do not want to block Alexa, only the WayBackMachine archiver. NOTE #2: I want to prevent the WayBackMachine archiver from accessing one particular page of my site. I do want it to archive the rest of my pages. Thanks for the help!
Afaik is Alexa and the IA Archiver the same thing, so you can't block one without blocking the other. But if you want to block certain parts of your web just write something like this: User-agent: ia_archiver Disallow: /Folder/
From what I understand, Alexa and Archive.org use different bots: ia_archiver = Alexa ia_archiver-web.archive.org = Archive.org Both of these bots visit my site regularly.
The same problem. Do not want to show a particular <img> on every page of my site to WayBackMachine archiver
to have a good archive.org history of your site may one day add to the reputation and credibility of your site in addition may be one day yoiu find the only backup of a deleted page in archive.org ... ( this happened to me years ago once ) in todays broadband world the little traffic of archive.org really should never matter anymore - and if you have a huge site you also have an even larger server paid by an even larger adsense revenue there are OTHER resource-vampires such as - fake chinese traffic by the ten thousands of pageloads per months - hotlinked pics from myspace, hi5, space.msn, etc GB of traffic each months if hotlink-unprotected pics