Hi guys, Does anybody have any good way of preventing those god-awful websites that scrape your content and use it on theirs in a tactic to draw people to their MFA sites? It's really getting on my nerves! Those are NOT quality backlinks at all! ;-) Any pointers would be much appreciated. Mike.
You could check the browser being used, and if it isn't a bot or web browser, then send them something amusing to stick on their mfa site The scraper could easily spoof the browser, but it's a start. J.
Unfortunatly i see scraping alot. I go to a freelance website and there are always people wanting to pay someone to clone or scrape websites. I do not take these jobs as i deem it unethical in my mind. You can try to do a search on how to disable the right clicking (which is java i think) and a java script i belive exists for disabling the save and save as options on most browsers. Or set the file(s) for your website as READ-ONLY, but this just stops people from straight downloading your content. Even then your files will still be put in to someones temp internet files. So if someone knows what they are doing they can still scrape your website BUT. If your dealing with some one not particularly hell bent on getting your website content the above should work just fine! Good luck
I have been pretty successful with the DMCA notice. If someone else is using your content, simply email/mail/fax the host a DMCA notification and they will get in touch with the owner. The good news is that if the site fails to respond or does not remove the content, they will end up suspending the account until they do.
Use the <base href= "http://www.yoursite.com"> tag in your head, and let them give you free links to your site when they scrape.
Ha! Nice. Of course that only works for relative links, and it's liable to get some 404's, and the scraping site might be from a bad neighborhood, but still it's an ingenious, often overlooked method to screw with people.
I put this javascript in my pages so if someone copys the HTML and puts it on there domain it redirects the user back to mine. <script type="text/javascript"> /* remove directories and http://*/ Url = document.location.href.split( "/" ); /* if domain name is wrong redirect */ if( Url[2] != 'mydomain.com' && Url[2] != 'www.mydomain.com' ) document.location.href = 'test.com'; </script> Code (markup):