I need a text file with every known computer character/symbol in it

Discussion in 'Programming' started by joshvelco, May 25, 2010.

  1. #1
    Hi everyone. I recently found a really interesting piece of software which allows the user to encrypt and decrypt files and text. I saw great potential in the concept, but the porduct i was left with didn't really do what I wanted. So, I have decided to build my own piece of software (it basically will encrypt text, and the contents of a file using a substitution cipher). The program will take every plain text character e.g 0123456789 abcdefghijklmnopqrstuvwxyz ABCDEFGHIJKLMNOPQRSTUVWXYZ !"£$€^&*()-+=?:;@'[]{}~#/
    etc. Basically all characters that you can type noramlly using an english keyboard; then substitute each character with a random string. the user will be able to define the length of the string e.g 8 characters or whatever. The only problem is, when each character is converted to the string, at the moment, I can only use plain text characters within the string itself, and a few alt+ keys that I know. I was wondering if anyone had a complete list of characters: plaintext, special symbols, and foreign language symbols e.g russian, hini-urdu, chinese, japanese, korean etc. If anyone does have one please reply here so that anyone else wishing to make use of the resource can also do so. If it is a .txt file all the better so that I don't have to copy and paste each character individually into my text editor.

    Thanks very much, Josh.
     
    joshvelco, May 25, 2010 IP
  2. plog

    plog Peon

    Messages:
    298
    Likes Received:
    11
    Best Answers:
    1
    Trophy Points:
    0
    #2
    Here's a really quick answer--you can't really do what you want to do. Every character is stored as a number and when it gets displayed that number gets run through a character encoding scheme to see what to actually show. If you run 68 through the ASCII encoding scheme 'D' gets displayed, in another 68 may display something else. That means you have to pick one encoding scheme and stick with it because your data is just numbers that map back to the specific encoding scheme you are using.

    Google 'character encoding' to learn more about it. For now, here's an overview and list of common encoding schemes from wikipedia:

    http://en.wikipedia.org/wiki/Character_encoding

    Here's an in-depth article that explains it better:

    http://www.cs.tut.fi/~jkorpela/chars.html
     
    plog, May 25, 2010 IP