help stop my content stealers.

Discussion in 'Search Engine Optimization' started by darrens, May 13, 2007.

  1. #1
    My site keeps getting pounded everynite by some kind of bot thats stealing my content ...

    How can i stop it?

    So far all i know is that the ip's (different every day) are all from Vietnam?

    I thought there was a way of blocking this via .htaccess???

    please help
     
    darrens, May 13, 2007 IP
  2. wokaka

    wokaka Peon

    Messages:
    2,346
    Likes Received:
    44
    Best Answers:
    0
    Trophy Points:
    0
    #2
    what do you mean? So they stealing your content and paste it in their websites?
     
    wokaka, May 13, 2007 IP
  3. darrens

    darrens Peon

    Messages:
    808
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #3
    darrens, May 13, 2007 IP
  4. EconomicalSpace.com

    EconomicalSpace.com Guest

    Messages:
    8
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #4
    lol you can't do anything about it. They can use proxy to access your site or some proxy browser. So, blocking ips will not make good to it.
    Here is the ans. to your ques.
    Increase the ranking of your site, let google crawl your page before it crawls stealer's website. Once your articles gets index b4 the stealer's article, after few days google will remove their article which was stolen from your site and will consider it as a duplicate content.
     
    EconomicalSpace.com, May 13, 2007 IP
  5. tradeya

    tradeya Notable Member

    Messages:
    5,729
    Likes Received:
    275
    Best Answers:
    0
    Trophy Points:
    230
    #5
    what if the stealer site rank higher than your site? :p
     
    tradeya, May 14, 2007 IP
  6. MattUK

    MattUK Notable Member

    Messages:
    6,950
    Likes Received:
    377
    Best Answers:
    0
    Trophy Points:
    275
    #6
    Use robots.txt to prevent access, either by IP address or ban the bot completely.

    Use copyscape.com to check for duplicate content from your site. Then email the hosting company and they'll take the site down until the content is removed.
     
    MattUK, May 14, 2007 IP
  7. EconomicalSpace.com

    EconomicalSpace.com Guest

    Messages:
    8
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #7
    I mentioned. Let google crawl your site. Use sitemap and let google check your site map as soon as possible after to post an article.
     
    EconomicalSpace.com, May 15, 2007 IP
  8. Janna122003

    Janna122003 Banned

    Messages:
    1,375
    Likes Received:
    39
    Best Answers:
    0
    Trophy Points:
    0
    #8
    You can't stop them from stealing your content but those content won't give any help to them. This will only put their site in supplemental result.
     
    Janna122003, May 16, 2007 IP
  9. Ritu

    Ritu Active Member

    Messages:
    1,077
    Likes Received:
    20
    Best Answers:
    0
    Trophy Points:
    80
    #9
    You can disable right mouse button but this is primitive and easy to breake. There are some content securing companies, try them :)
     
    Ritu, May 16, 2007 IP
  10. andre75

    andre75 Peon

    Messages:
    1,203
    Likes Received:
    45
    Best Answers:
    0
    Trophy Points:
    0
    #10
    I don't think a scraper will care about your robots file.

    When you say the IP changes every day, does it change a lot or stay within a certain class c? If the first 2 or 3 blocks stay the same, block the entire IP range in your .htaccess

    e.g.

    deny from 234.32.21

    will block this entire class c block from 234.32.21.0 to 234.32.21.255

    Alternatively you can try to download GeoIP.dat from MaxMind (the free database) and redirect all traffic to check the Geolocation first.
    If the country comes up to be Vietnam, you can simply block them.

    Or you could analyze the referrer or user agent if they are unique to this scraper and block those easily in .htaccess.

    There are many things you can try. Disabling right mouseclick, robots.txt and Copyscape won't prevent bots from stealing your stuff though.

    Since traffic from Vietnam probably isn't converting in a lot of sales for you, I'd simply block the entire country.
    Or you could try even funnier things and redirect them somewhere else.
    If I can tie a bot to a certain site, I usually redirect them to themselves and let them scrape their own stuff. That can look funny on their site if they scrape, post then find the new post, scrape again and post again .....

    You can also try to do something on the firewall level. I have a brute force script installed and if a bot gets too nasty it gets blocked at the firewall level thus saving my apache resources.
     
    andre75, May 16, 2007 IP
  11. Ratman2050

    Ratman2050 Well-Known Member

    Messages:
    2,748
    Likes Received:
    176
    Best Answers:
    0
    Trophy Points:
    155
    #11
    Try using copyscape.com .

    Add that banner to your site, use their search, see who is copying your content, and sue them (or threaten to do so). If you have a Copyright mark at your footer of your site I am sure a nice threat will stop them.
     
    Ratman2050, May 16, 2007 IP
  12. andre75

    andre75 Peon

    Messages:
    1,203
    Likes Received:
    45
    Best Answers:
    0
    Trophy Points:
    0
    #12
    Yes, a Bot from china (or the guy who wrote it) are certainly going to care about US copyright laws. Makes perfect sense :cool:
     
    andre75, May 16, 2007 IP