Site scraping? Massive hits from Chinese IP's

Discussion in 'Google' started by drhfinegifts, May 9, 2007.

  1. #1
    I have been getting tons of hits from China IP's that are running through my site and links on a continual basis.

    At first I tried to block the IP's from accessing my site, but since they are dynamic IP's, it's nearly impossible, so I gave up.

    I can't think of any reason for these IP's to be crawling my site so often except for perhaps they are scraper sites.

    Is there a way to stop this or block a particular country from accessing my site? I'm sure it is sucking up a nice little chunk of bandwidth each month.

    There is really no other reason I can think of that anyone from China would need access to my site.

    Any thoughts?
     
    drhfinegifts, May 9, 2007 IP
  2. trichnosis

    trichnosis Prominent Member

    Messages:
    13,785
    Likes Received:
    333
    Best Answers:
    0
    Trophy Points:
    300
    #2
    i have two suggests and one question.

    * my suggest is blocking the ip range . do not block the single ips. and the second option is contacting the site owner who is linking to your site

    * my question is that the reason for blocking visitors. why do you want to block them? you are getting visitors:)
     
    trichnosis, May 10, 2007 IP
  3. drhfinegifts

    drhfinegifts Peon

    Messages:
    368
    Likes Received:
    12
    Best Answers:
    0
    Trophy Points:
    0
    #3
    I don't feel that I am blocking visitors....these are bots!

    Second, I don't ship to China, so why do I need visitors from there?
     
    drhfinegifts, May 10, 2007 IP
  4. amnezia

    amnezia Peon

    Messages:
    990
    Likes Received:
    31
    Best Answers:
    0
    Trophy Points:
    0
    #4
    amnezia, May 10, 2007 IP
  5. twelfty

    twelfty Active Member

    Messages:
    361
    Likes Received:
    10
    Best Answers:
    0
    Trophy Points:
    90
    #5
    I've recently been getting a visit or two every day from

    www.f a v o r s t a r.com

    Like you, I've been wondering what the f@€k?

    By the way, What's scraping?
     
    twelfty, May 10, 2007 IP
    Obelia likes this.
  6. sakib000

    sakib000 Active Member

    Messages:
    415
    Likes Received:
    17
    Best Answers:
    0
    Trophy Points:
    58
    #6
    just use .htaccess

    you can block ip or ip range very easily.

    create text file, name it htaccess.txt paste the following code in it

    <Limit GET HEAD POST>
    order allow,deny
    deny from 221.135
    deny from 221.134.100.1
    allow from all
    </LIMIT>

    here an entire range 221.135 is blocked and an ip is blocked 221.134.100.1
    you can more "deny from xx.xx.xx.xx" according to your need.


    PLEASE NOTE THIS IS SAMPLE IP AND ITS MY OWN :D

    After you complete your file save it as htaccess.txt and upload it to the root directory of your site and rename it .htaccess (if possible upload it above the root directory cause that is more secure)
     
    sakib000, May 10, 2007 IP
  7. Obelia

    Obelia Notable Member

    Messages:
    2,083
    Likes Received:
    171
    Best Answers:
    0
    Trophy Points:
    210
    #7
    I've been getting that as well, once or twice a day to each of my domains. It's referral spamming, they are looking to gain backlinks through the many open referrer logs that can be found on the internet. Try not to link to them in your posts.
     
    Obelia, May 10, 2007 IP
  8. wilderr67

    wilderr67 Active Member

    Messages:
    204
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    58
    #8
    Thanks for this.
     
    wilderr67, May 10, 2007 IP
  9. drhfinegifts

    drhfinegifts Peon

    Messages:
    368
    Likes Received:
    12
    Best Answers:
    0
    Trophy Points:
    0
    #9
    It's when a site will 'scrape' snippets of several different sites to try to target a specific keyword. Usually these are MFA sites and the like.

    They take sites that rank well for certain keywords and use bits & pieces of each site to build content for their site. Generally when you view these sites, the content is garbage since it is a garbled mess of other sites. It's strictly for Adsense.
     
    drhfinegifts, May 10, 2007 IP
    twelfty likes this.
  10. KeyboardHustler

    KeyboardHustler Well-Known Member

    Messages:
    1,087
    Likes Received:
    7
    Best Answers:
    0
    Trophy Points:
    145
    #10
    Thanks a lot! Having the same problem !
     
    KeyboardHustler, May 10, 2007 IP
  11. twelfty

    twelfty Active Member

    Messages:
    361
    Likes Received:
    10
    Best Answers:
    0
    Trophy Points:
    90
    #11
    Thanks for the info. :) 1 green for you.

    Should we feel flattered that our sites are worth stealing from? :(

    twelfty
     
    twelfty, May 10, 2007 IP
  12. Obelia

    Obelia Notable Member

    Messages:
    2,083
    Likes Received:
    171
    Best Answers:
    0
    Trophy Points:
    210
    #12
    Unfortunately, no. They seem to do it indiscriminately, for every site that appears anywhere remotely in the SERPs, for just about any old phrase.
     
    Obelia, May 11, 2007 IP