Data Crawling for company size

Discussion in 'Programming' started by JoshJD, Aug 18, 2013.

  1. #1
    Hey guys,

    I need to profile a bunch of music company sites for the number of artist on their roster, to get an idea of the size of the company. What would be the most accurate & effective way of doing this? I was thinking maybe by having software count the number of artist profile shots or names, on the "Artist" or "Roster" page of each website?

    many thanks,
    j
     
    JoshJD, Aug 18, 2013 IP
  2. EmmanuelFlossie

    EmmanuelFlossie Active Member

    Messages:
    159
    Likes Received:
    11
    Best Answers:
    2
    Trophy Points:
    65
    #2
    If your intentions are to crawl pages and fetch data the easiest php option would be to use simple html dom parser, just google the highlighted text
     
    EmmanuelFlossie, Aug 18, 2013 IP
  3. kutchbhi

    kutchbhi Active Member

    Messages:
    130
    Likes Received:
    4
    Best Answers:
    2
    Trophy Points:
    70
    #3
    Simple HTML DOM is an option. But IMHO a superior option is querypath (its jquery like )
    Also for crawling you can use php/curl
     
    kutchbhi, Aug 20, 2013 IP