Recompiling a cached Forum from Wayback Machine

Discussion in 'Forum Management' started by matrixdutch, Jan 17, 2011.

  1. #1
    Hello all,

    I was wondering if it is possible to restore a forum from cached pages on the Wayback Machine in an efficient manner. This was a forum that was hacked and taken down.

    My friend just told me he did not have a database backup, and his webhosts in Singapore (which aren't that experienced from what I've heard) dont' have a database snapshot either.

    I tried using this to extract what I could from there, but it didn't work:

    http://www.httrack.com/

    Also, would anyone happen to know of software that will let me extract what's left from the webarchive to see what threads are remaining..sort of look at the entire site on my hard drive?

    Any assistance is greatly appreciated.

    Thanks in advance.
     
    matrixdutch, Jan 17, 2011 IP
  2. flapjack_

    flapjack_ Guest

    Messages:
    176
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #2
    I know there are scripts to crawl the sites, but they're written in python. I've done it some time ago for an smf forum, don't remember the details I'm afraid, only that it wasn't that difficult.
     
    flapjack_, Jan 17, 2011 IP
  3. RectangleMan

    RectangleMan Notable Member

    Messages:
    2,825
    Likes Received:
    132
    Best Answers:
    0
    Trophy Points:
    210
    #3
    It's not a viable option to regain the content but to recall forum structure, get images, and rebuild the theme it's handy.
     
    RectangleMan, Jan 20, 2011 IP
  4. EvanP

    EvanP Banned

    Messages:
    1,731
    Likes Received:
    17
    Best Answers:
    0
    Trophy Points:
    0
    #4
    It's not technically possible as the database isn't stored on that site. Sorry.
     
    EvanP, Jan 20, 2011 IP
  5. hillsha

    hillsha Peon

    Messages:
    7
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #5
    Would require a LOOOOOOOT of flipping through the "wayback pages" but possible if you are willing to spend time !
     
    hillsha, Jan 21, 2011 IP
  6. bryanon

    bryanon Well-Known Member

    Messages:
    806
    Likes Received:
    29
    Best Answers:
    0
    Trophy Points:
    145
    #6
    Yeah. Mind you though - you'd need to manually redo all members, their postcounts etc. as well. Definitely not an easy task!
     
    bryanon, Jan 23, 2011 IP
  7. matrixdutch

    matrixdutch Peon

    Messages:
    3
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #7
    Hello all,

    Thank you all for your input. The majority of the content is still there. I'm actually debating whether to hire someone to assist in scraping it for me, but am not sure how much that would cost. Also I wouldn't be restoring the forum exactly as is....I'm not concerned about usernames or images. I just really care about the content.

    Any additional insights would be helpful
     
    matrixdutch, Jan 24, 2011 IP