Search Engine Spiders?

Discussion in 'Programming' started by tjsocr22, Jun 4, 2010.

  1. #1
    Hey guys,

    Just wondering how search engine spiders work? And does it take a lot of effort to make one?

    Thanks!
     
    tjsocr22, Jun 4, 2010 IP
  2. Blue Star Ent.

    Blue Star Ent. Well-Known Member

    Messages:
    1,989
    Likes Received:
    31
    Best Answers:
    0
    Trophy Points:
    160
    #2
    They follow hyperlinks. Use Java. Here is the link : LINK

    Let me know how you do !
     
    Blue Star Ent., Jun 14, 2010 IP
  3. Lever

    Lever Deep Thought

    Messages:
    1,823
    Likes Received:
    94
    Best Answers:
    0
    Trophy Points:
    145
    #3
    Lever, Jun 14, 2010 IP
  4. stephan2307

    stephan2307 Well-Known Member

    Messages:
    1,277
    Likes Received:
    33
    Best Answers:
    7
    Trophy Points:
    150
    #4
    It is not too difficult to write one in php. you can do that in a couple of hours.

    However in order to have a proper spider you need also to handle robots.txt properly.

    In order to write one all you need is a database, curl and preg_match_all
     
    stephan2307, Jun 15, 2010 IP
  5. ttyler333

    ttyler333 Member

    Messages:
    62
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    43
    #5
    i would just download sphider and read the code, learn how it works.
     
    ttyler333, Jun 18, 2010 IP
    Lever likes this.
  6. Lever

    Lever Deep Thought

    Messages:
    1,823
    Likes Received:
    94
    Best Answers:
    0
    Trophy Points:
    145
    #6
    Lever, Jun 19, 2010 IP
  7. ttyler333

    ttyler333 Member

    Messages:
    62
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    43
    #7
    Hey, thanks for gathering that link.. i guess i forgot to do that. Anywho hopefully that helps people ;)
     
    ttyler333, Jun 19, 2010 IP
  8. Lever

    Lever Deep Thought

    Messages:
    1,823
    Likes Received:
    94
    Best Answers:
    0
    Trophy Points:
    145
    #8
    No problem at all, always happy to help out :)

    And thanks for pointing out Sphyder, I'll have a proper look at that later myself.
     
    Lever, Jun 19, 2010 IP
  9. ttyler333

    ttyler333 Member

    Messages:
    62
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    43
    #9
    Not a problem either. I was into using a spider for a site but it did not do what i wanted so i created my own. The spider i created only crawls indepth of all links on that page. I was working on making it crawl the crawled links but anywho. Not bad for first php spider attempt.
     
    ttyler333, Jun 20, 2010 IP