Looking for a server-sided web crawler

Discussion in 'PHP' started by pfeiffel, Jul 20, 2010.

  1. #1
    Good evening,

    for a couple of months now I am looking for a web spider. However, I couldn't find what I was looking for.

    Basically, it should be similar to Xenu Link Sleuth, however, it should be programmed in a server-sided language such as PHP.

    At a minimum I am looking for these features:

    * crawling a whole list of websites and checking the http header status of all external links
    * exporting the result (URL+http header code) in a csv or writing it in a mysql database
    * multi-threading

    If anyone of you has an idea where I could look or which existing (PHP) script would be easy to tweak I am more than grateful!

    Thanks a lot.

    Mark
     
    pfeiffel, Jul 20, 2010 IP
  2. sarahk

    sarahk iTamer Staff

    Messages:
    28,901
    Likes Received:
    4,555
    Best Answers:
    123
    Trophy Points:
    665
    #2
    sarahk, Jul 20, 2010 IP
  3. bencummins

    bencummins Peon

    Messages:
    55
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    #3
    You may be able to write a front end to Axel, you can find it on freshmeat
     
    bencummins, Jul 20, 2010 IP