Stopping Directory Scrapers

Discussion in 'Directories' started by CReed, Oct 1, 2007.

  1. #1
    CReed, Oct 1, 2007 IP
    Brian1970, mann3r and EveryQuery like this.
  2. mann3r

    mann3r Peon

    Messages:
    1,416
    Likes Received:
    100
    Best Answers:
    0
    Trophy Points:
    0
    #2
    thanks for this script creed, this scraper should be dealt accordingly, this hurting all directories in the market.
     
    mann3r, Oct 1, 2007 IP
  3. indyguidedotinfo

    indyguidedotinfo Notable Member

    Messages:
    3,254
    Likes Received:
    202
    Best Answers:
    0
    Trophy Points:
    245
    #3
    sorry if i am slow but what exactly is going on here?
     
    indyguidedotinfo, Oct 1, 2007 IP
  4. Fastian

    Fastian Peon

    Messages:
    2,085
    Likes Received:
    235
    Best Answers:
    0
    Trophy Points:
    0
    #4
    Fastian, Oct 1, 2007 IP
  5. sizzler_chetan

    sizzler_chetan Prominent Member

    Messages:
    7,838
    Likes Received:
    664
    Best Answers:
    0
    Trophy Points:
    390
    #5
    Thanks for the heads up..

    Good work by !Ask!
    Will imply that in the directory..
    Man that scraper bhomiyo is still in top for aviva brand name keyword!
     
    sizzler_chetan, Oct 1, 2007 IP
  6. Red_Virus

    Red_Virus Well-Known Member

    Messages:
    3,756
    Likes Received:
    249
    Best Answers:
    0
    Trophy Points:
    135
    #6
    So does this script blocks, the IP of all the proxies that tries to access your directory by proxy ?
     
    Red_Virus, Oct 1, 2007 IP
  7. CReed

    CReed Prominent Member

    Messages:
    3,969
    Likes Received:
    595
    Best Answers:
    0
    Trophy Points:
    310
    #7
    You can ask questions about the script over at that forum. Unfortunately !Ask! is still banned here.
     
    CReed, Oct 1, 2007 IP
  8. bhushna222

    bhushna222 Banned

    Messages:
    51
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #8
    thanks for this script
     
    bhushna222, Oct 1, 2007 IP
  9. williamjack

    williamjack Notable Member

    Messages:
    2,189
    Likes Received:
    324
    Best Answers:
    0
    Trophy Points:
    225
    #9
    williamjack, Oct 1, 2007 IP
  10. mikey1090

    mikey1090 Moderator Staff

    Messages:
    15,869
    Likes Received:
    1,055
    Best Answers:
    0
    Trophy Points:
    445
    Digital Goods:
    2
    #10
    I think it blocks google from indexing proxified content from your site:)
     
    mikey1090, Oct 1, 2007 IP
  11. MeetHere

    MeetHere Prominent Member

    Messages:
    15,399
    Likes Received:
    994
    Best Answers:
    0
    Trophy Points:
    330
    #11
    Sad that ASK is banned on DP..

    Thanks for the update CReed and ASK too :)
     
    MeetHere, Oct 1, 2007 IP
  12. Obelia

    Obelia Notable Member

    Messages:
    2,083
    Likes Received:
    171
    Best Answers:
    0
    Trophy Points:
    210
    #12
    Am I right in thinking that this is a proxy, and not a scraper? If this is the case, it's important not to link to it at all.
     
    Obelia, Oct 2, 2007 IP
  13. SilkySmooth

    SilkySmooth Well-Known Member

    Messages:
    1,583
    Likes Received:
    269
    Best Answers:
    0
    Trophy Points:
    180
    #13
    Obelia, that is correct, however certain people are using this proxy caching as a way to hack Google. Please see this article for details.
     
    SilkySmooth, Oct 2, 2007 IP
  14. Obelia

    Obelia Notable Member

    Messages:
    2,083
    Likes Received:
    171
    Best Answers:
    0
    Trophy Points:
    210
    #14
    I read that article, and this is what I want to highlight from it:

    As far as I can see, the proxies aren't doing any scraping or caching, which is why when you implement a .htaccess ban on their IP the page disappears immediately. If this doesn't happen, you know you have a scraper problem and not just a proxy.

    The practical implications of this are that banning bots won't make any difference to the proxy problem (although that shouldn't stop you from banning any unwanted bots which suck up your bandwidth). You just have to ban by IP. And linking to these proxy sites is going to mess up some people's SERPs.
     
    Obelia, Oct 2, 2007 IP
  15. Razvan

    Razvan Well-Known Member

    Messages:
    712
    Likes Received:
    27
    Best Answers:
    0
    Trophy Points:
    160
    #15
    CReed thanks for posting this, I'm already using it on my dir.
     
    Razvan, Oct 2, 2007 IP
  16. gostats

    gostats Peon

    Messages:
    325
    Likes Received:
    11
    Best Answers:
    0
    Trophy Points:
    0
    #16
    gostats, Oct 2, 2007 IP
    EveryQuery likes this.
  17. lhaizza

    lhaizza Peon

    Messages:
    132
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #17
    Great help. Thank you for sharing.
     
    lhaizza, Oct 5, 2007 IP