The text encoding used is a part of UTF-8. The part used is compatible with commonly used western text encodings: ASCII, ISO/IEC 8859-1 and Windows-1252.
Characters
The only allowed characters are 9, 10, 13 and 32-126, inclusive. Each element of the file must fit one character. The elements can be bytes, but need not be. Program files with element values other than these are invalid. All valid files are valid UTF-8 files.
Decimal | Hex | Character |
---|---|---|
9 | 0x9 | Horizontal Tab, \t |
10 | 0xA | Line Feed, \n |
13 | 0xD | Carriage Return*, \r |
32 | 0x20 | Space |
33 | 0x21 | ! |
34 | 0x22 | “ |
35 | 0x23 | # |
36 | 0x24 | $ |
37 | 0x25 | % |
38 | 0x26 | & |
39 | 0x27 | ‘ |
40 | 0x28 | ( |
41 | 0x29 | ) |
42 | 0x2A | * |
43 | 0x2B | + |
44 | 0x2C | , |
45 | 0x2D | – |
46 | 0x2E | . |
47 | 0x2F | / |
48 | 0x30 | 0 |
49 | 0x31 | 1 |
50 | 0x32 | 2 |
51 | 0x33 | 3 |
52 | 0x34 | 4 |
53 | 0x35 | 5 |
54 | 0x36 | 6 |
55 | 0x37 | 7 |
56 | 0x38 | 8 |
57 | 0x39 | 9 |
58 | 0x3A | : |
59 | 0x3B | ; |
60 | 0x3C | < |
61 | 0x3D | = |
62 | 0x3E | > |
63 | 0x3F | ? |
64 | 0x40 | @ |
65 | 0x41 | A |
66 | 0x42 | B |
67 | 0x43 | C |
68 | 0x44 | D |
69 | 0x45 | E |
70 | 0x46 | F |
71 | 0x47 | G |
72 | 0x48 | H |
73 | 0x49 | I |
74 | 0x4A | J |
75 | 0x4B | K |
76 | 0x4C | L |
77 | 0x4D | M |
78 | 0x4E | N |
79 | 0x4F | O |
80 | 0x50 | P |
81 | 0x51 | Q |
82 | 0x52 | R |
83 | 0x53 | S |
84 | 0x54 | T |
85 | 0x55 | U |
86 | 0x56 | V |
87 | 0x57 | W |
88 | 0x58 | X |
89 | 0x59 | Y |
90 | 0x5A | Z |
91 | 0x5B | [ |
92 | 0x5C | \ |
93 | 0x5D | ] |
94 | 0x5E | ^ |
95 | 0x5F | _ |
96 | 0x60 | ` |
97 | 0x61 | a |
98 | 0x62 | b |
99 | 0x63 | c |
100 | 0x64 | d |
101 | 0x65 | e |
102 | 0x66 | f |
103 | 0x67 | g |
104 | 0x68 | h |
105 | 0x69 | i |
106 | 0x6A | j |
107 | 0x6B | k |
108 | 0x6C | l |
109 | 0x6D | m |
110 | 0x6E | n |
111 | 0x6F | o |
112 | 0x70 | p |
113 | 0x71 | q |
114 | 0x72 | r |
115 | 0x73 | s |
116 | 0x74 | t |
117 | 0x75 | u |
118 | 0x76 | v |
119 | 0x77 | w |
120 | 0x78 | x |
121 | 0x79 | y |
122 | 0x7A | z |
123 | 0x7B | { |
124 | 0x7C | | |
125 | 0x7D | } |
126 | 0x7E | ~ |
*
Se notes on newlines.
Notes on newlines
Files must use a newline scheme consistently. A file must either:
- Use only
\n
, which is read as is. - Use only
\r\n
, which is read as\n
. - Use only
\r
, which is read as\n
.