Duplicate Content deletion [Is there something like this]

Discussion in 'General Chat' started by -Abhishek-, Jun 4, 2007.

  1. #1
    Hello guys,

    Alright, I got 2 different files (both text and excel), one contains around 2000 domains and another contains around 360 domains. Now some of these domains repeat in both of these files, I want the duplicate content to be deleted and the originals to be retained. I need to import this list somewhere!

    Is there some application that does that ? Or some excel feature or whatever ?

    Any help appreciated!

    Abhishek
     
    -Abhishek-, Jun 4, 2007 IP
  2. sawz

    sawz Prominent Member

    Messages:
    8,225
    Likes Received:
    808
    Best Answers:
    0
    Trophy Points:
    360
    #2
    sawz, Jun 4, 2007 IP
  3. -Abhishek-

    -Abhishek- Regaining my Momentum!

    Messages:
    2,109
    Likes Received:
    302
    Best Answers:
    0
    Trophy Points:
    0
    #3
    errr... dude! I think you didn't really get what I am wanting ?
     
    -Abhishek-, Jun 4, 2007 IP
  4. SumitBahl

    SumitBahl Reign of Chaos

    Messages:
    5,170
    Likes Received:
    596
    Best Answers:
    0
    Trophy Points:
    310
    #4
    Send me the files, I will weed out the duplicates.

    Catch me on IM if files contains some sensitive data.
     
    SumitBahl, Jun 4, 2007 IP
  5. sachin410

    sachin410 Illustrious Member

    Messages:
    6,422
    Likes Received:
    573
    Best Answers:
    0
    Trophy Points:
    410
    #5
    You can use MS Excel.

    Select the entire domain name column.
    Use Data>Filter>Advanced Filter>
    Select "Copy To Another Location" and Check "Unique Records Only"
    Specify a new column and press "OK".
     
    sachin410, Jun 4, 2007 IP
  6. mymaldives

    mymaldives Well-Known Member

    Messages:
    153
    Likes Received:
    10
    Best Answers:
    0
    Trophy Points:
    108
    #6
    Past that 360 domain list (excel) below the 2000 domain list. Go to Data > Filter > Advance Filter > Unique records only (Excel 2007)

    Thats it

    Edit: I was bit slow :)
     
    mymaldives, Jun 4, 2007 IP
  7. Indian

    Indian Peon

    Messages:
    1,572
    Likes Received:
    105
    Best Answers:
    0
    Trophy Points:
    0
    #7
    Yes that will work. Other option is to use duplicate remover from ablebits.com
     
    Indian, Jun 4, 2007 IP
  8. -Abhishek-

    -Abhishek- Regaining my Momentum!

    Messages:
    2,109
    Likes Received:
    302
    Best Answers:
    0
    Trophy Points:
    0
    #8
    Thanks for the help guys, but this is not exactly what I was looking for!

    Let me give you an example,

    We have two Datasets Provided, Dataset A is the supplied task, whereas Dataset B is the completed task. Now what I want is Dataset A minus Dataset B. So that the pending task is retained, whereas duplicated task is deleted.

    Just wanting to know if something like this exists! Saving myselves the trouble to write a script for this!

    Abhishek

    Edit - I think Ablebit will do what am looking for!
     
    -Abhishek-, Jun 4, 2007 IP