Hey guys, I need to profile a bunch of music company sites for the number of artist on their roster, to get an idea of the size of the company. What would be the most accurate & effective way of doing this? I was thinking maybe by having software count the number of artist profile shots or names, on the "Artist" or "Roster" page of each website? many thanks, j
If your intentions are to crawl pages and fetch data the easiest php option would be to use simple html dom parser, just google the highlighted text
Simple HTML DOM is an option. But IMHO a superior option is querypath (its jquery like ) Also for crawling you can use php/curl