Is it legal to scrape content for personal use

Discussion in 'Legal Issues' started by camjohnson95, Oct 19, 2009.

  1. #1
    I have made an application that scrapes and stores results from the Australian TAB Trackside website, for personal use only. Is there any legal issues involved with me doing this? Of course in doing this I am viewing thousands of pages from the website in short periods of time, as all results are stored on seperate pages. Am I going to get into any trouble if they notice so many hits from my IP?
     
    camjohnson95, Oct 19, 2009 IP
  2. Traffic-Bug

    Traffic-Bug Active Member

    Messages:
    1,866
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    80
    #2
    Good question, do they have a policy to the effect anywhere on the website? Something that reads like 'no content scrapers are allowed regardless of purpose;' or 'personal usage content scraping is allowed'? In any case why dont you pace your website spider algorithm to use different ips all spread over multiple proxies over a period of time (that is not too tightly bound in time). You can get hundreds of proxy ips from fresh proxy lists available across the internet daily. Some of these may even have an API for use in client applications (a webservice to return fresh, tested, working proxy ip - 5 numbers - with every call etc.)
     
    Traffic-Bug, Oct 19, 2009 IP
  3. camjohnson95

    camjohnson95 Active Member

    Messages:
    737
    Likes Received:
    17
    Best Answers:
    0
    Trophy Points:
    60
    #3
    Hi Traffic-Bug, Thanks for your reply.

    No it doesn't say that anywhere. And I probably worded it wrong too. It scrapes data, rather than actual content, but I guess that it isn't relevant anyway. I think that at the worst, if they catch on, they will limit requests by IP. In which case I will have to use proxies, but I'm not going to put in the extra effort until it is necessary, as it is only for personal use.
     
    camjohnson95, Oct 20, 2009 IP
  4. MattyAsia

    MattyAsia Peon

    Messages:
    37
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #4
    Simply don't run a multi-threaded downloading monster.
    Make it space the downloading out a bit.
    2 requests a second wouldn't be a lot in my opinion and that's 120 pages of data a minute, or 1 page per second.
    The less influence you have on their server, the better.
    Now if you wrote a scraper with 100 threads, had each running at full speed, you'd be looking at big problems.
     
    MattyAsia, Nov 4, 2009 IP
  5. lynnj

    lynnj Greenhorn

    Messages:
    47
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    16
    #5
    I personally do not think u will get n trouble but yea the guy above has a good plan :p
     
    lynnj, Nov 4, 2009 IP