Need recommendation for a site scraper / web page bulk downloader

Discussion in 'Content Management' started by Labcoat88, Mar 2, 2006.

  1. #1
    I'm looking for an application (preferably Windows) or a Unix script that can scrape a portion or entire website to local files. "Why I need this" is a long story, but I need to be able to slurp down HTML files, graphics, CSS, javascript, etc., and have the URL paths rewritten to work from a local hard drive.

    Any software recommendations for software that people have used? As for cost, 'free' would be nice, but I'm willing to pay some $ if the product is good and professional.

    thanks!
    LC
     
    Labcoat88, Mar 2, 2006 IP
    Barefootsies likes this.
  2. Labcoat88

    Labcoat88 Peon

    Messages:
    192
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    0
    #2
    hehe... replying to my own thread.

    I found a great one that does exactly what I needed: MetaProducts "Offline Explorer" ... http://www.metaproducts.com/

    LC
     
    Labcoat88, Mar 2, 2006 IP
  3. jimrthy

    jimrthy Guest

    Messages:
    283
    Likes Received:
    13
    Best Answers:
    0
    Trophy Points:
    0
    #3
    $70...ouch.

    I don't have much use for this sort of program, but, when I have had the situation come up, I just use wget. It takes a little bit (though not much) to learn, but I'm comfortable using command line tools. And it's free.

    I'm not sure about a windows version, though. If there's not, I bet it would work with cygwin.
     
    jimrthy, Mar 5, 2006 IP
  4. rehash

    rehash Well-Known Member

    Messages:
    1,502
    Likes Received:
    30
    Best Answers:
    0
    Trophy Points:
    150
    #4
    yeah, wget should do it most of times
     
    rehash, Mar 7, 2006 IP
  5. iatbm

    iatbm Prominent Member

    Messages:
    5,151
    Likes Received:
    352
    Best Answers:
    0
    Trophy Points:
    360
  6. xern

    xern Peon

    Messages:
    2
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #6
    Maybe you can consider to use FEAR::API, another site scraping framework based on Perl. You can use it to create your own specific scraping scripts.

    Best,
    Yung-chung Lin
     
    xern, May 16, 2006 IP