Text

The text encoding used is a part of UTF-8. The part used is compatible with commonly used western text encodings: ASCII, ISO/IEC 8859-1 and Windows-1252.

Characters

The only allowed characters are 9, 10, 13 and 32-126, inclusive. Each element of the file must fit one character. The elements can be bytes, but need not be. Program files with element values other than these are invalid. All valid files are valid UTF-8 files.

Decimal Hex Character
9 0x9 Horizontal Tab, \t
10 0xA Line Feed, \n
13 0xD Carriage Return*, \r
32 0x20 Space
33 0x21 !
34 0x22
35 0x23 #
36 0x24 $
37 0x25 %
38 0x26 &
39 0x27
40 0x28 (
41 0x29 )
42 0x2A *
43 0x2B +
44 0x2C ,
45 0x2D
46 0x2E .
47 0x2F /
48 0x30 0
49 0x31 1
50 0x32 2
51 0x33 3
52 0x34 4
53 0x35 5
54 0x36 6
55 0x37 7
56 0x38 8
57 0x39 9
58 0x3A :
59 0x3B ;
60 0x3C <
61 0x3D =
62 0x3E >
63 0x3F ?
64 0x40 @
65 0x41 A
66 0x42 B
67 0x43 C
68 0x44 D
69 0x45 E
70 0x46 F
71 0x47 G
72 0x48 H
73 0x49 I
74 0x4A J
75 0x4B K
76 0x4C L
77 0x4D M
78 0x4E N
79 0x4F O
80 0x50 P
81 0x51 Q
82 0x52 R
83 0x53 S
84 0x54 T
85 0x55 U
86 0x56 V
87 0x57 W
88 0x58 X
89 0x59 Y
90 0x5A Z
91 0x5B [
92 0x5C |
93 0x5D ]
94 0x5E ^
95 0x5F _
96 0x60 `
97 0x61 a
98 0x62 b
99 0x63 c
100 0x64 d
101 0x65 e
102 0x66 f
103 0x67 g
104 0x68 h
105 0x69 i
106 0x6A j
107 0x6B k
108 0x6C l
109 0x6D m
110 0x6E n
111 0x6F o
112 0x70 p
113 0x71 q
114 0x72 r
115 0x73 s
116 0x74 t
117 0x75 u
118 0x76 v
119 0x77 w
120 0x78 x
121 0x79 y
122 0x7A z
123 0x7B {
124 0x7C |
125 0x7D }
126 0x7E ~

* Se notes on newlines.

Notes on newlines

Files must use a newline scheme consistently. A file must either:

  • Use only \n, which is read as is.
  • Use only \r\n, which is read as \n.
  • Use only \r, which is read as \n.