Need a Programmer to do the following

Discussion in 'Programming' started by -Abhishek-, Jul 5, 2007.

  1. #1
    Alrite,

    Time for another job opening.

    Here's the deal,

    I have a directory 'x' on my folder with many sub-folders and in them are many text files with articles in them.

    Now I want an application built that looks for these articles and search the internet to find if there is any duplicate content in them.

    If there is a copy of the said article anywhere it'd generate a file that shows the link where the said copy is located.

    In a gist, am looking for a duplicate content finder much like copyscape and dupecop but on a larger scale.

    There are several other minutes, but I think this gives you a bright enough idea.

    So if you think, this is feasible and you "can" do it, I'd appreciate a Private Message.

    Abhishek
     
    -Abhishek-, Jul 5, 2007 IP
  2. Scoding

    Scoding Well-Known Member

    Messages:
    1,091
    Likes Received:
    18
    Best Answers:
    0
    Trophy Points:
    155
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #2
    pm me your msn, able to do it!
     
    Scoding, Jul 5, 2007 IP
  3. -Abhishek-

    -Abhishek- Regaining my Momentum!

    Messages:
    2,109
    Likes Received:
    302
    Best Answers:
    0
    Trophy Points:
    0
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #3
    Rather than an MSN, could one of you message me with a brief on how you're going to do it. Some API or something similar.

    I've had proposals to spider the internet and then look for duplicate copy, but that's pretty much not possible.

    Please let me know.

    Thanks
     
    -Abhishek-, Jul 13, 2007 IP
  4. ratman87

    ratman87 Guest

    Messages:
    47
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    0
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #4
    how much are you planning to spend on this? :)
     
    ratman87, Jul 13, 2007 IP
  5. -Abhishek-

    -Abhishek- Regaining my Momentum!

    Messages:
    2,109
    Likes Received:
    302
    Best Answers:
    0
    Trophy Points:
    0
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #5
    Depends on how much my Programmers wants me to spend.
     
    -Abhishek-, Jul 13, 2007 IP
  6. D'Godown

    D'Godown Well-Known Member

    Messages:
    1,093
    Likes Received:
    25
    Best Answers:
    0
    Trophy Points:
    140
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #6
    I can do it in 12 hours, What is the config of your server?
    Do you have an idea its going to crawl the Search Databases by passing strings? So it will obviously be high on server resources.

    I will do it in $1.5k and i will provide support for a week, It will require no overhead job, But i will take care for a week to make sure you are running fine.

    EDIt : Python is my preferred scripting, But can do it in PHP as well.
     
    D'Godown, Jul 26, 2007 IP
  7. JLEville

    JLEville Peon

    Messages:
    147
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #7
    This thread died on July 13th....14 days ago. W/e
     
    JLEville, Jul 26, 2007 IP
  8. -Abhishek-

    -Abhishek- Regaining my Momentum!

    Messages:
    2,109
    Likes Received:
    302
    Best Answers:
    0
    Trophy Points:
    0
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #8
    @D'Godown,

    How do you think will it search databases? Will there be a specific AI that'd be created? Or would it work on some existent API viz. Copyscape ?

    Abhishek
     
    -Abhishek-, Jul 26, 2007 IP