Hi Every One, I am facing a problem and it's from the morning that today when i was searching at google blog search with my blog name it showed me an blog containing my name in it. When i clicked that particular url it showed me my name in it. The most ridiculous scenerio is that it's having adult related contents. So, i would like to know that will it harm my blog in any sense and how can i get my name removed from there! Any suggestions will be appreciated. Thanks.
I'm not quite understanding... Some splogs will generate a post based off of your RSS feed. That could be it?
I have the same problem don't know if there's much to do about it though. You can really stop anyone from writing the name of a domain somewhere.
Block robot access to your feed page using a robots.txt file. I had the same problem on a couple of sites and this sorted it instantly. Create a file on the root of the server, name it robots.txt In that, write the following: User-agent: * Disallow: Disallow: /feed/ Code (markup): This will stop robots - including search engine robots! - from accessing the feed and thus, leeching your content. Good luck.
It will only stop search engine robots. They may have made their own crawler and if so I doubt they stop at what the robots.txt allows.
Please explain how a robots.txt will stop anyone from ripping content from another site, I don't get it. It's late and I'm sleepy, sorry. Do you mean that they've taken the content from already crawled site from a legit search engine that respects the robots.txt? I don't disagree that a robots.txt may help but it's only a textfile that can be ignored by "bad" crawlers. And Amits, I feel with you mate, it's not a nice situation at all.
Spider-Man, I don't mean to say that it doesn't work, it obviously does and you've proved it. Just trying to understand how that's all.
It is possible. Most spiders will obey robots.txt, but yes, not all do. Another couple of ideas to help you are: 1) Attempt to identify the IP of the offending leecher and block via IP Deny Manager, or .htaccess, etc. 2) Only provide excerpts of posts, by using the <!--more--> command in WordPress when you want to 'break' the post. (This also can help if using Adsense on your blog - twice the page views) 3) Serve a copyright notice in the footer or header of all blog posts. 4) Only link using full URL's. Never use site-specific URL's. For example, link like this - 'http://www.yourdomain.com/pageyouarelinkingto/' as opposed to '/pageyouarelinkingto/'. This way, if they are stealing your content, at the very least you're getting a backlink These are some of the ideas that I have deployed also.
Thanks for the great details but my blog is on blogger so i can not do the things said above means changing my robot.txt file.