How do big local directories get their listing data? The local Canadian one weblocal.ca has 13,000,000 pages indexed. Is it automatic/do they send spiders? How about yelp? Is it all user generated?
Yes, if they have over 13,000,000 pages indexed they're using web spiders to retrieve that massive amount of data.
Doesn't those make it illegal to take their content? Do they have another 3rd party provider of content?
They are using scraper script. Google for it and you'll find a whole bunch of them. Some companies will sell you ready to go Country, State, City etc. database. fastreplies
They definately use scrapers or pay someone to provide the data (who is then using a scraper for them)
I read in their terms that they pay 3rd party? Who are these 3rd parties? Do they scrap or get it other way?