Hi I am going to build a plagiarisms checker like copyscape, but for this I need Indexing like Google and Yahoo does. But both search engines does not allow unlimited searches. I want to know, if there is any company which provides Indexing, and provides API access like Google search does. So i can send them text and get matching results. We need this to start initially, if any guy helps me out about this. I will share my work on this project with him/her. Any clue and hint will be warmly appreciated. Many Thanks
I know this won't solve the issue. But i looked at something else that needed the Google api and one developer suggested that ifyou had a members login, you could have it so that it used their api key and not the sites a pi key. If it worked it would give each member 100uses per day. Compared to the standard of 100 a pi uses site wide. With the membership system its then easy to offer more searches (paid)
Check out "Web Crawl." You can use Amazon to query this cache of a good chunk of the internet. The cache may not be the most up-to-date though. Other than that, you'll have to do your own crawl of the internet.