Alrite, Time for another job opening. Here's the deal, I have a directory 'x' on my folder with many sub-folders and in them are many text files with articles in them. Now I want an application built that looks for these articles and search the internet to find if there is any duplicate content in them. If there is a copy of the said article anywhere it'd generate a file that shows the link where the said copy is located. In a gist, am looking for a duplicate content finder much like copyscape and dupecop but on a larger scale. There are several other minutes, but I think this gives you a bright enough idea. So if you think, this is feasible and you "can" do it, I'd appreciate a Private Message. Abhishek
Rather than an MSN, could one of you message me with a brief on how you're going to do it. Some API or something similar. I've had proposals to spider the internet and then look for duplicate copy, but that's pretty much not possible. Please let me know. Thanks
I can do it in 12 hours, What is the config of your server? Do you have an idea its going to crawl the Search Databases by passing strings? So it will obviously be high on server resources. I will do it in $1.5k and i will provide support for a week, It will require no overhead job, But i will take care for a week to make sure you are running fine. EDIt : Python is my preferred scripting, But can do it in PHP as well.
@D'Godown, How do you think will it search databases? Will there be a specific AI that'd be created? Or would it work on some existent API viz. Copyscape ? Abhishek