ascii and unicode
clark@hushmail.com | updates closed
ascii

ascii is an acronym for "american standard code for information interchange". used for storing/displaying text, simple formatting and a few other control characters. below is the ascii table, giving the decimal value, the hexidecimal value, the glyph and the binary value for each. the binary value is how its stored on a disk.

if you are interested in html codes; click here
000 00 00000000
001 01 00000001
002 02 00000010
003 03 00000011
004 04 00000100
005 05 00000101
006 06 00000110
007 07 00000111
008 08 00001000
009 09 00001001
010 0A 00001010
011 0B 00001011
012 0C 00001100
013 0D 00001101
014 0E 00001110
015 0F 00001111
016 10 00010000
017 11 00010001
018 12 00010010
019 13 00010011
020 14 00010100
021 15 00010101
022 16 00010110
023 17 00010111
024 18 00011000
025 19 00011001
026 1A 00011010
027 1B 00011011
028 1C 00011100
029 1D 00011101
030 1E 00011110
031 1F 00011111
032 20 00100000
033 21 00100001
034 22 00100010
035 23 00100011
036 24 00100100
037 25 00100101
038 26 00100110
039 27 00100111
040 28 00101000
041 29 00101001
042 2A 00101010
043 2B 00101011
044 2C 00101100
045 2D 00101101
046 2E 00101110
047 2F 00101111
048 30 00110000
049 31 00110001
050 32 00110010
051 33 00110011
052 34 00110100
053 35 00110101
054 36 00110110
055 37 00110111
056 38 00111000
057 39 00111001
058 3A 00111010
059 3B 00111011
060 3C 00111100
061 3D 00111101
062 3E 00111110
063 3F 00111111
064 40 01000000
065 41 01000001
066 42 01000010
067 43 01000011
068 44 01000100
069 45 01000101
070 46 01000110
071 47 01000111
072 48 01001000
073 49 01001001
074 4A 01001010
075 4B 01001011
076 4C 01001100
077 4D 01001101
078 4E 01001110
079 4F 01001111
080 50 01010000
081 51 01010001
082 52 01010010
083 53 01010011
084 54 01010100
085 55 01010101
086 56 01010110
087 57 01010111
088 58 01011000
089 59 01011001
090 5A 01011010
091 5B 01011011
092 5C 01011100
093 5D 01011101
094 5E 01011110
095 5F 01011111
096 60 01100000
097 61 01100001
098 62 01100010
099 63 01100011
100 64 01100100
101 65 01100101
102 66 01100110
103 67 01100111
104 68 01101000
105 69 01101001
106 6A 01101010
107 6B 01101011
108 6C 01101100
109 6D 01101101
110 6E 01101110
111 6F 01101111
112 70 01110000
113 71 01110001
114 72 01110010
115 73 01110011
116 74 01110100
117 75 01110101
118 76 01110110
119 77 01110111
120 78 01111000
121 79 01111001
122 7A 01111010
123 7B 01111011
124 7C 01111100
125 7D 01111101
126 7E 01111110
127 7F 01111111
0-31 and 127 are control codes. the glyph is an onscreen representation
000 NUL - null
001 SOH - start of header
002 STX - start of text
003 ETX - end of text
004 EOT - end of transmission
005 ENQ - enquire
006 ACK - acknowledge
007 BEL - bell
008 BS  - backspace
009 HT  - horizontal tabulation
010 LF  - line feed
011 VT  - vertical tabulation
012 FF  - form feed
013 CR  - carriage return
014 SO  - shift out
015 SI  - shift in
016 DLE - data link escape
017 DC1 - device control one
018 DC2 - device control two
019 DC3 - device control three
020 DC4 - device control four
021 NAK - negative acknowledge
022 SYN - synchronous idle
023 ETB - end of transmission block
024 CAN - cancel
025 EM  - end of medium
026 SIB - substitute
027 ESC - escape
028 FS  - file separator
029 GS  - group separator
030 RS  - record separator
031 US  - unit separator

127 DEL - delete
128 to 255 vary; from dos or notepad (font == terminal)
128 80 10000000
129 81 10000001
130 82 10000010
131 83 10000011
132 84 10000100
133 85 10000101
134 86 10000110
135 87 10000111
136 88 10001000
137 89 10001001
138 8A 10001010
139 8B 10001011
140 8C 10001100
141 8D 10001101
142 8E 10001110
143 8F 10001111
144 90 10010000
145 91 10010001
146 92 10010010
147 93 10010011
148 94 10010100
149 95 10010101
150 96 10010110
151 97 10010111
152 98 10011000
153 99 10011001
154 9A 10011010
155 9B 10011011
156 9C 10011100
157 9D 10011101
158 9E 10011110
159 9F 10011111
160 A0 10100000
161 A1 10100001
162 A2 10100010
163 A3 10100011
164 A4 10100100
165 A5 10100101
166 A6 10100110
167 A7 10100111
168 A8 10101000
169 A9 10101001
170 AA 10101010
171 AB 10101011
172 AC 10101100
173 AD 10101101
174 AE 10101110
175 AF 10101111
176 B0 10110000
177 B1 10110001
178 B2 10110010
179 B3 10110011
180 B4 10110100
181 B5 10110101
182 B6 10110110
183 B7 10110111
184 B8 10111000
185 B9 10111001
186 BA 10111010
187 BB 10111011
188 BC 10111100
189 BD 10111101
190 BE 10111110
191 BF 10111111
192 C0 11000000
193 C1 11000001
194 C2 11000010
195 C3 11000011
196 C4 11000100
197 C5 11000101
198 C6 11000110
199 C7 11000111
200 C8 11001000
201 C9 11001001
202 CA 11001010
203 CB 11001011
204 CC 11001100
205 CD 11001101
206 CE 11001110
207 CF 11001111
208 D0 11010000
209 D1 11010001
210 D2 11010010
211 D3 11010011
212 D4 11010100
213 D5 11010101
214 D6 11010110
215 D7 11010111
216 D8 11011000
217 D9 11011001
218 DA 11011010
219 DB 11011011
220 DC 11011100
221 DD 11011101
222 DE 11011110
223 DF 11011111
224 E0 11100000
225 E1 11100001
226 E2 11100010
227 E3 11100011
228 E4 11100100
229 E5 11100101
230 E6 11100110
231 E7 11100111
232 E8 11101000
233 E9 11101001
234 EA 11101010
235 EB 11101011
236 EC 11101100
237 ED 11101101
238 EE 11101110
239 EF 11101111
240 F0 11110000
241 F1 11110001
242 F2 11110010
243 F3 11110011
244 F4 11110100
245 F5 11110101
246 F6 11110110
247 F7 11110111
248 F8 11111000
249 F9 11111001
250 FA 11111010
251 FB 11111011
252 FC 11111100
253 FD 11111101
254 FE 11111110
255 FF 11111111
unicode

ascii has been superseded by unicode, a double byte character system designed to store and display a much wider range of letters.(65,536) the extra include foreign languages and mathematical/scientific symbols, plus space for future expansion.

all ascii characters still exist in unicode, either prefixed or suffixed by a null. the difference being big-endian or little-endian respectfully. for more on endians check http://www.noveltheory.com/TechPapers/endian.asp. unicode text documents have a two byte header to inform the system of which endian is used.

                     68,65,6c,6c,6f = hello (ascii)
fe,ff,00,68,00,65,00,6c,00,6c,00,6f = hello (big-endian)
ff,fe,68,00,65,00,6c,00,6c,00,6f,00 = hello (little-endian)

unicode is still under development - check http://www.unicode.org for details.

to see how unicode is currently displayed on your system or within a particular program. download ascii and unicode files (1.07mb)

opening the unicode files may take a lot of processor power. you may want to add other foreign languages. in nt see regional settings, you will need the setup cd.
you must get permission from the respective author before reproduction