big URL list

Discussion in 'Programming' started by babynl, Sep 19, 2009.

  1. #1
    Dear DP members,

    For something I am creating i need a huge URL list, it does not matter what kinds of urls are in the list as long as they are all unique.
    by this i mean not


    bladiebla.com
    bladiebla.com/bla
    bladiebla.com/bla2

    etc.

    but
    bla.com
    bladie.com
    bladiebla.com

    any list would help a great deal in my project.

    thanks in advance.

    john.
     
    babynl, Sep 19, 2009 IP
  2. ccoonen

    ccoonen Well-Known Member

    Messages:
    1,606
    Likes Received:
    71
    Best Answers:
    0
    Trophy Points:
    160
    #2
    why not built a quick app or script - that builds it in a brute force manner? As in add "a.com" to the list, have script verify it exists, then "b.com", verify it exists, and so on - this should build a monstrous list quick without leaving any out (depending on how long you let it run) :)
     
    ccoonen, Sep 19, 2009 IP
  3. babynl

    babynl Member

    Messages:
    357
    Likes Received:
    1
    Best Answers:
    1
    Trophy Points:
    40
    #3
    omg that i did not think of that... lol....

    thank you for the idea :D!!
     
    babynl, Sep 20, 2009 IP
  4. ccoonen

    ccoonen Well-Known Member

    Messages:
    1,606
    Likes Received:
    71
    Best Answers:
    0
    Trophy Points:
    160
    #4
    Glad it helped... you could probably do it pretty quick too if you just verified the domain was registered instead of screen scrape the page to verify it exists and has content.
     
    ccoonen, Sep 20, 2009 IP
  5. babynl

    babynl Member

    Messages:
    357
    Likes Received:
    1
    Best Answers:
    1
    Trophy Points:
    40
    #5
    i could just let it read the first line of html code and by that check if its a working page or not :D
     
    babynl, Sep 20, 2009 IP
  6. EpicServices

    EpicServices Peon

    Messages:
    111
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #6
    A script is more efficient & hella easier.
     
    EpicServices, Sep 22, 2009 IP
  7. phprightnow

    phprightnow Peon

    Messages:
    296
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #7
    I've read on several places all 4 letter domain names are pretty much took up, there's not many left out there. So if you do a brute force of numbers and letters on at least 4 letter level, that's about 10,000+ domains easy.
     
    phprightnow, Sep 22, 2009 IP
  8. babynl

    babynl Member

    Messages:
    357
    Likes Received:
    1
    Best Answers:
    1
    Trophy Points:
    40
    #8
    yea true im writing the script for it.
    the possibilities are pretty big
     
    babynl, Sep 23, 2009 IP