跳转至

附录 D(规范性)字符集和编码

Annex D (normative) Character Sets and Encodings

D.1 概述

D.1 General

本附录列出了任何兼容阅读器中应预先定义的字符集和编码。这里介绍了包含拉丁文本和一些符号的简单字体。有关CID键字体的预定义CMaps列表,请参见9.7.5.2,“预定义CMaps”。

D.2,“拉丁字符集和编码”,描述了Adobe标准拉丁文本字体的整个字符集。该字符集应由标准14种预定义字体中的Times、Helvetica和Courier字体系列支持;请参见9.6.2.2,“标准Type 1字体(标准14字体)”。对于每个命名字符,为四种不同的编码定义了一个八进制字符代码:StandardEncodingMacRomanEncodingWinAnsiEncodingPDFDocEncoding(请参见表D.1)。未编码的字符用连字符(—)表示。

D.3,“PDFDocEncoding字符集”,描述了可以使用PDFDocEncoding表示的所有字符的完整集合。它按数字顺序呈现这些字符,并描述了每个字符的Unicode表示形式。此表与D.2,“拉丁字符集和编码”中呈现的八进制字符代码信息有重叠。

D.4,“专家集和MacExpertEncoding”,描述了“专家”字符集,其中包含对复杂排版有用的额外字符,如小型大写字母、连字和分数。对于每个命名字符,在MacExpertEncoding中给出了一个八进制字符代码。

注意

专家字体程序中的内置编码可能与MacExpertEncoding不同。

D.5,“符号集和编码”,以及D.6,“ZapfDingbats集和编码”,描述了Symbol和ZapfDingbats(ITC Zapf Dingbats)字体程序的字符集和内置编码,这些字体应是标准14种预定义字体之一。这些字体具有各自独特的编码。由于该字体的名称没有意义,因此ZapfDingbats的字符按代码而非名称排序。

表D.1 – 拉丁文本编码
编码 描述
StandardEncoding Adobe标准拉丁文本编码。这是Type 1拉丁文本字体程序中定义的内置编码(但通常不在TrueType字体程序中定义)。兼容阅读器不应有名为StandardEncoding的预定义编码。然而,有必要描述这种编码,因为字体的内置编码可以用作编码字典中指定差异的基础编码。
MacRomanEncoding Mac OS用于西文书写系统中拉丁文本的标准编码。兼容阅读器应有名为MacRomanEncoding的预定义编码,该编码可用于Type 1和TrueType字体。
WinAnsiEncoding Windows代码页1252,通常称为“Windows ANSI”编码。这是西文书写系统中拉丁文本的标准Windows编码。兼容阅读器应有名为WinAnsiEncoding的预定义编码,该编码可用于Type 1和TrueType字体。
PDFDocEncoding 用于PDF文档内容流之外文本字符串的编码。这是可用于表示文本字符串的两种编码之一(另一种是Unicode);请参见7.9.2.2,“文本字符串类型”。PDF没有名为PDFDocEncoding的预定义编码;通常不使用这种编码来显示字体中的文本。
MacExpertEncoding 用于专家字体(包含专家字符集的字体)的编码。兼容阅读器应有名为MacExpertEncoding的预定义编码。尽管名称如此,但它不是特定于平台的编码;然而,只有某些字体具有适合与此编码一起使用的字符集。标准14种预定义字体中没有此类字体。

This annex lists the character sets and encodings that shall be predefined in any conforming reader. Simple fonts, encompassing Latin text and some symbols, are described here. See 9.7.5.2, "Predefined CMaps" for a list of predefined CMaps for CID-keyed fonts.

D.2, "Latin Character Set and Encodings", describes the entire character set for the Adobe standard Latin-text fonts. This character set shall be supported by the Times, Helvetica, and Courier font families, which are among the standard 14 predefined fonts; see 9.6.2.2, "Standard Type 1 Fonts (Standard 14 Fonts)". For each named character, an octal character code is defined for four different encodings: StandardEncoding, MacRomanEncoding, WinAnsiEncoding, and PDFDocEncoding (see Table D.1). Unencoded characters are indicated by a dash (—).

D.3, "PDFDocEncoding Character Set", describes the entire set of characters that can be represented using PDFDocEncoding. It presents these characters in numerical order and it describes the Unicode representation of each character. This table overlaps the information presented in D.2, "Latin Character Set and Encodings", with respect to the presented octal character codes.

D.4, "Expert Set and MacExpertEncoding", describes the “expert” character set, which contains additional characters useful for sophisticated typography, such as small capitals, ligatures, and fractions. For each named character, an octal character code is given in MacExpertEncoding.

NOTE

The built-in encoding in an expert font program may be different from MacExpertEncoding.

D.5, "Symbol Set and Encoding", and D.6, "ZapfDingbats Set and Encoding", describe the character sets and built-in encodings for the Symbol and ZapfDingbats (ITC Zapf Dingbats) font programs, which shall be among the standard 14 predefined fonts. These fonts have built-in encodings that are unique to each font. The characters for ZapfDingbats are ordered by code instead of by name, since the names in that font are meaningless.

Table D.1 – Latin-text encodings
Encoding Description
StandardEncoding Adobe standard Latin-text encoding. This is the built-in encoding defined in Type 1 Latin-text font programs (but generally not in TrueType font programs). Conforming readers shall not have a predefined encoding named StandardEncoding. However, it is necessary to describe this encoding, since a font’s built-in encoding can be used as the base encoding from which differences may be specified in an encoding dictionary.
MacRomanEncoding Mac OS standard encoding for Latin text in Western writing systems. Conforming readers shall have a predefined encoding named MacRomanEncoding that may be used with both Type 1 and TrueType fonts.
WinAnsiEncoding Windows Code Page 1252, often called the “Windows ANSI” encoding. This is the standard Windows encoding for Latin text in Western writing systems. Conforming readers shall have a predefined encoding named WinAnsiEncoding that may be used with both Type 1 and TrueType fonts.
PDFDocEncoding Encoding for text strings in a PDF document outside the document’s content streams. This is one of two encodings (the other being Unicode) that may be used to represent text strings; see 7.9.2.2, "Text String Type". PDF does not have a predefined encoding named PDFDocEncoding; it is not customary to use this encoding to show text from fonts.
MacExpertEncoding An encoding for use with expert fonts—ones containing the expert character set. Conforming readers shall have a predefined encoding named MacExpertEncoding. Despite its name, it is not a platform- specific encoding; however, only certain fonts have the appropriate character set for use with this encoding. No such fonts are among the standard 14 predefined fonts.

D.2 拉丁字符集和编码

D.2 Latin Character Set and Encodings

CHAR NAME CHAR CODE (OCTAL)
STD MAC WIN PDF
A A 101 101101101
Æ AE 341 256306306
Á Aacute 347301301
 Acircumflex 345302302
Ä Adieresis 200304304
À Agrave 313300300
Å Aring 201305305
à Atilde 314303303
B B 102 102102102
C C 103 103103103
Ç Ccedilla 202307307
D D 104 104104104
E E 105 105105105
É Eacute 203311311
Ê Ecircumflex 346312312
Ë Edieresis 350313313
È Egrave 351310310
Ð Eth 320320
Euro 1 200240
F F 106 106106106
G G 107 107107107
H H 110 110110110
I I 111 111111111
Í Iacute 352315315
Î Icircumflex 353316316
Ï Idieresis 354317317
Ì Igrave 355314314
J J 112 112112112
K K 113 113113113
L L 114 114114114
Ł Lslash 350 225
M M 115 115115115
N N 116 116116116
Ñ Ntilde 204321321
O O 117 117117117
ŒOE352316214226
ÓOacute356323323
ÔOcircumflex357324324
ÖOdieresis205326326
ÒOgrave361322322
ØOslash351257330330
ÕOtilde315325325
PP120120120120
QQ121121121121
RR122122122122
SS123123123123
ŠScaron212227
TT124124124124
ÞThorn336336
UU125125125125
ÚUacute362332332
ÛUcircumflex363333333
ÜUdieresis206334334
ÙUgrave364331331
VV126126126126
WW127127127127
XX130130130130
YY131131131131
ÝYacute335335
ŸYdieresis331237230
ZZ132132132132
ŽZcaron 2216231
aa141141141141
áaacute207341341
âacircumflex211342342
´acute302253264264
äadieresis212344344
æae361276346346
àagrave210340340
&ampersand046046046046
åaring214345345
^asciicircum136136136136
~asciitilde176176176176
*asterisk052052052052
@at100100100100
ãatilde213343343
bb142142142142
\backslash134134134134
|bar174174174174
{braceleft173173173173
}braceright175175175175
[bracketleft133133133133
]bracketright135135135135
˘breve306371030
¦brokenbar246246
bullet 3267245225200
cc143143143143
ˇcaron317377031
çccedilla215347347
¸cedilla313374270270
¢cent242242242242
ˆcircumflex303366210032
:colon072072072072
,comma054054054054
©copyright251251251
¤currency 1250333244244
dd144144144144
dagger262240206201
daggerdbl263340207202
°degree241260260
¨dieresis310254250250
÷divide326367367
$dollar044044044044
˙dotaccent307372033
ıdotlessi365365232
ee145145145145
éeacute216351351
êecircumflex220352352
ëedieresis221353353
èegrave217350350
8eight 070070070070
ellipsis 274311205203
emdash 320321227204
endash 261320226205
=equal 075075075075
ðeth360360
!exclam 041041041041
¡exclamdown 241301241241
ff 146146146146
fi 256336223
5five 065065065065
fl 257337224
ƒflorin 246304203206
4four 064064064064
fraction 244332207
gg 147147147147
ßgermandbls 373247337337
`grave 301140140140
>greater 076076076076
«guillemotleft4 253307253253
»guillemotright4 273310273273
guilsinglleft 254334213210
guilsinglright 255335233211
hh 150150150150
˝hungarumlaut 315375034
-hyphen5 055055055055
ii 151151151151
íiacute222355355
îicircumflex224356356
ïidieresis225357357
ìigrave223354354
jj 152152152152
kk 153153153153
ll 154154154154
<less074074074074
¬logicalnot302254254
łlslash370233
mm155155155155
¯macron305370257257
minus212
μmu265265265
×multiply327327
nn156156156156
9nine071071071071
ñntilde226361361
#numbersign043043043043
oo157157157157
óoacute227363363
ôocircumflex231364364
öodieresis232366366
œoe372317234234
˛ogonek316376035
òograve230362362
1one061061061061
½onehalf275275
¼onequarter274274
¹onesuperior271271
ªordfeminine343273252252
ºordmasculine353274272272
øoslash371277370370
õotilde233365365
pp160160160160
paragraph266246266266
(parenleft050050050050
)parenright051051051051
%percent045045045045
.period056056056056
·periodcentered264341267267
perthousand275344211213
+plus053053053053
±plusminus261261261
qq161161161161
?question077077077077
¿questiondown277300277277
"quotedbl042042042042
quotedblbase271343204214
quotedblleft252322223215
quotedblright272323224216
quoteleft140324221217
quoteright047325222220
quotesinglbase270342202221
'quotesingle251047047047
rr162162162162
®registered250256256
˚ring312373036
ss163163163163
šscaron232235
§section247244247247
;semicolon073073073073
7seven067067067067
6six066066066066
/slash057057057057
 space6040040040040
£sterling243243243243
tt164164164164
þthorn376376
3three063063063063
¾threequarters276276
³threesuperior263263
˜tilde304367230037
trademark252231222
2two062062062062
²twosuperior262262
uu165165165165
úuacute234372372
ûucircumflex236373373
üudieresis237374374
ùugrave235371371
_underscore137137137137
vv166166166166
ww167167167167
xx170170170170
yy171171171171
ýyacute375375
ÿydieresis330377377
¥yen245264245245
zz172172172172
žzcaron2236236
0zero060060060060
  1. 在PDF 1.3中,欧元字符被添加到Adobe标准拉丁字符集中。它在WinAnsiEncoding中被编码为200,在PDFDocEncoding中被编码为240,分配了之前未使用的代码。苹果公司将Mac OS拉丁文本编码中代码333从货币字符更改为欧元字符。然而,这种不兼容的更改并未反映在PDF的MacRomanEncoding中,该编码应继续将代码333映射为货币字符。如果需要欧元字符,可以使用编码字典来指定与MacRomanEncoding的这一单个差异。

  2. 在PDF 1.3中,现有的Zcaron和zcaron字符作为之前未使用的代码216和236被添加到WinAnsiEncoding中。

  3. 在WinAnsiEncoding中,所有大于40的未使用代码都映射到项目符号字符。然而,只有代码225被专门指定为项目符号字符;其他代码可能会在未来重新分配。

  4. 字符名称guillemotleft和guillemotright拼写错误。这个标点符号的正确拼写是guillemet。然而,实际包含这些字符的字体和编码中使用的是拼写错误的名称。

  5. 连字符在WinAnsiEncoding中也被编码为255。这个重复代码的含义应为“软连字符”,但在排版上应与连字符相同。

  6. SPACE字符在MacRomanEncoding中也被编码为312,在WinAnsiEncoding中被编码为240。这个重复代码表示不换行空格;在排版上应与(U+003A) SPACE相同 。

  1. In PDF 1.3, the euro character was added to the Adobe standard Latin character set. It shall be en- coded as 200 in WinAnsiEncoding and 240 in PDFDocEncoding, assigning codes that were pre- viously unused. Apple changed the Mac OS Latin-text encoding for code 333 from the currency character to the euro character. However, this incompatible change has not been reflected in PDF’s MacRomanEncoding, which shall continue to map code 333 to currency. If the euro character is desired, an encoding dictionary may be used to specify this single difference from MacRomanEn-coding.
  2. In PDF 1.3, the existing Zcaron and zcaron characters were added to WinAnsiEncoding as the previously unused codes 216 and 236.
  3. In WinAnsiEncoding, all unused codes greater than 40 map to the bullet character. However, only code 225 shall be specifically assigned to the bullet character; other codes are subject to future re- assignment.
  4. The character names guillemotleft and guillemotright are misspelled. The correct spelling for this punctuation character is guillemet. However, the misspelled names are the ones actually used in the fonts and encodings containing these characters.
  5. The hyphen character is also encoded as 255 in WinAnsiEncoding. The meaning of this duplicate code shall be “soft hyphen,” but it shall be typographically the same as hyphen.
  6. The SPACE character shall also be encoded as 312 in MacRomanEncoding and as 240 in WinAnsiEncoding. This duplicate code shall signify a nonbreaking space; it shall be typographi- cally the same as (U+003A) SPACE.

D.3 PDFDocEncoding 字符集

D.3 PDFDocEncoding Character Set

标题为注释的列使用以下缩写:

U   PDFDocEncoding 中的 未定义 代码点

SR   Unicode 代码点,在某些情况下可能需要在 XML 中使用 特殊表示

The column titled Notes uses the following abbreviations:

U   Undefined code point in PDFDocEncoding

SR   Unicode codepoint that may require special representation in XML in some contexts.

Table D.2 – PDFDocEncoding Character Set
Character Dec Hex Octal Unicode Unicode character name or
(alternative alias)
Notes
^@ 0 0x00 0000 U+0000 (NULL) U
^A 1 0x01 0001 U+0001 (START OF HEADING) U
^B 2 0x02 0002 U+0002 (START OF TEXT) U
^C 3 0x03 0003 U+0003 (END OF TEXT) U
^D 4 0x04 0004 U+0004 (END OF TEXT) U
^E 5 0x05 0005 U+0005 (END OF TRANSMISSION) U
^F 6 0x06 0006 U+0006 (ACKNOWLEDGE) U
^G 7 0x07 0007 U+0007 (BELL) U
^H 8 0x08 0010 U+0008 (BACKSPACE) U
^I 9 0x09 0011 U+0009 (CHARACTER TABULATION) SR
^J 10 0x0a 0012 U+000A (LINE FEED) SR
^K 11 0x0b 0013 U+000B (LINE TABULATION) U
^L 12 0x0c 0014 U+000C (FORM FEED) U
^M 13 0x0d 0015 U+000D (CARRIAGE RETURN) SR
^N 14 0x0e 0016 U+000E (SHIFT OUT) U
^O 15 0x0f 0017 U+000F (SHIFT IN) U
^P 16 0x10 0020 U+0010 (DATA LINK ESCAPE) U
^Q 17 0x11 0021 U+0011 (DEVICE CONTROL ONE) U
^R 18 0x12 0022 U+0012 (DEVICE CONTROL TWO) U
^S 19 0x13 0023 U+0013 (DEVICE CONTROL THREE) U
^T 20 0x14 0024 U+0014 (DEVICE CONTROL FOUR) U
^U 21 0x15 0025 U+0015 (NEGATIVE ACKNOWLEDGE) U
^V 22 0x16 0026 U+0017 (SYNCRONOUS IDLE) U
^W 23 0x17 0027 U+0017 (END OF TRANSMISSION BLOCK) U
u 24 0x18 0030 U+02D8 BREVE
v 25 0x19 0031 U+02C7 CARON
^ 26 0x1a 0032 U+02C6 MODIFIER LETTER CIRCUMFLEX ACCENT
· 27 0x1b 0033 U+02D9 DOT ABOVE
28 0x1c 0034 U+02DD DOUBLE ACUTE ACCENT
, 29 0x1d 0035 U+02DB OGONEK
° 30 0x1e 0036 U+02DA RING ABOVE
~ 31 0x1f 0037 U+02DC SMALL TILDE
32 0x20 0040 U+0020 SPACE ( )
! 33 0x21 0041 U+0021 EXCLAMATION MARK SR
" 34 0x22 0042 U+0022 QUOTATION MARK (") SR
# 35 0x23 0043 U+0023 NUMBER SIGN
$ 36 0x24 0044 U+0024 DOLLAR SIGN
% 37 0x25 0045 U+0025 PERCENT SIGN
& 38 0x26 0046 U+0026 AMPERSAND (&)
' 39 0x27 0047 U+0027 APOSTROPHE (')
( 40 0x28 0050 U+0028 LEFT PARENTHESIS
) 41 0x29 0051 U+0029 RIGHT PARENTHESIS
* 42 0x2a 0052 U+002A ASTERISK
+ 43 0x2b 0053 U+002B PLUS SIGN
, 44 0x2c 0054 U+002C COMMA
- 45 0x2d 0055 U+002D HYPHEN-MINUS
. 46 0x2e 0056 U+002E FULL STOP (PERIOD)
/ 47 0x2f 0057 U+002F SOLIDUS (slash)
0 48 0x30 0060 U+0030 DIGIT ZERO
1 49 0x31 0061 U+0031 DIGIT ONE
2 50 0x32 0062 U+0032 DIGIT TWO
3 51 0x33 0063 U+0033 DIGIT THREE
4 52 0x34 0064 U+0034 DIGIT FOUR
5 53 0x35 0065 U+0035 DIGIT FIVE
6 54 0x36 0066 U+0036 DIGIT SIX
7 55 0x37 0067 U+0037 DIGIT SEVEN
8 56 0x38 0070 U+0038 DIGIT EIGJT
9 57 0x39 0071 U+0039 DIGIT NINE
: 58 0x3a 0072 U+003A COLON
; 59 0x3b 0073 U+003B SEMICOLON
< 60 0x3c 0074 U+003C LESS THAN SIGN (<) SR
= 61 0x3d 0075 U+003D EQUALS SIGN
> 62 0x3e 0076 U+003E GREATER THAN SIGN (>)
? 63 0x3f 0077 U+003F QUESTION MARK
@ 64 0x40 0100 U+0040 COMMERCIAL AT
A 65 0x41 0101 U+0041
B 66 0x42 0102 U+0042
C 67 0x43 0103 U+0043
D 68 0x44 0104 U+0044
E 69 0x45 0105 U+0045
F 70 0x46 0106 U+0046
G 71 0x47 0107 U+0047
H 72 0x48 0110 U+0048
I 73 0x49 0111 U+0049
J 74 0x4a 0112 U+004A
K 75 0x4b 0113 U+004B
L 76 0x4c 0114 U+004C
M 77 0x4d 0115 U+004D
N 78 0x4e 0116 U+004E
O 79 0x4f 0117 U+004F
P 80 0x50 0120 U+0050
Q 81 0x51 0121 U+0051
R 82 0x52 0122 U+0052
S 83 0x53 0123 U+0053
T 84 0x54 0124 U+0054
U 85 0x55 0125 U+0055
V 86 0x56 0126 U+0056
W 87 0x57 0127 U+0057
X 88 0x58 0130 U+0058
Y 89 0x59 0131 U+0059
Z 90 0x5a 0132 U+005A
[ 91 0x5b 0133 U+005 BLEFT SQUARE BRACKET
\ 92 0x5c 0134 U+005 CREVERSE SOLIDUS (backslash)
] 93 0x5d 0135 U+005 DRIGHT SQUARE BRACKET
^ 94 0x5e 0136 U+005 ECIRCUMFLEX ACCENT (hat)
_ 95 0x5f 0137 U+005 FLOW LINE (SPACING UNDERSCORE)
` 96 0x60 0140 U+006 0GRAVE ACCENT
a 97 0x61 0141 U+0061
b 98 0x62 0142 U+0062
c 99 0x63 0143 U+0063
d 100 0x64 0144 U+0064
e 101 0x65 0145 U+0065
f 102 0x66 0146 U+0066
g 103 0x67 0147 U+0067
h 104 0x68 0150 U+0068
i 105 0x69 0151 U+0069
j 106 0x6a 0152 U+006A
k 107 0x6b 0153 U+006B
l 108 0x6c 0154 U+006C
m 109 0x6d 0155 U+006D
n 110 0x6e 0156 U+006E
o 111 0x6f 0157 U+006F
p 112 0x70 0160 U+0070
q 113 0x71 0161 U+0071
r 114 0x72 0162 U+0072
s 115 0x73 0163 U+0073
t 116 0x74 0164 U+0074
u 117 0x75 0165 U+0075
v 118 0x76 0166 U+0076
w 119 0x77 0167 U+0077
x 120 0x78 0170 U+0078
y 121 0x79 0171 U+0079
z 122 0x7a 0172 U+007A
{ 123 0x7b 0173 U+007B LEFT CURLY BRACKET
| 124 0x7c 0174 U+007C VERTICAL LINE
} 125 0x7d 0175 U+007D RIGHT CURLY BRACKET
~ 126 0x7e 0176 U+007E TILDE
127 0x7f 0177 Undefined U
128 0x80 0200 U+2022 BULLET
129 0x81 0201 U+2020 DAGGER
130 0x82 0202 U+2021 DOUBLE DAGGER
131 0x83 0203 U+2026 HORIZONTAL ELLIPSIS
132 0x84 0204 U+2014 EM DASH
133 0x85 0205 U+2013 EN DASH
ƒ 134 0x86 0206 U+0192
135 0x87 0207 U+2044 FRACTION SLASH (solidus)
136 0x88 0210 U+2039 SINGLE LEFT-POINTING ANGLE QUOTATION MARK
137 0x89 0211 U+203A SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
Š 138 0x8a 0212 U+2212
139 0x8b 0213 U+2030 PER MILLE SIGN
140 0x8c 0214 U+201E DOUBLE LOW-9 QUOTATION MARK (quotedblbase)
141 0x8d 0215 U+201C LEFT DOUBLE QUOTATION MARK (doublequote left)
142 0x8e 0216 U+201D RIGHT DOUBLE QUOTATION MARK (quotedblright)
143 0x8f 0217 U+2018 LEFT SINGLE QUOTATION MARK (quoteleft)
144 0x90 0220 U+2019 RIGHT SINGLE QUOTATION MARK (quoteright)
145 0x91 0221 U+201A SINGLE LOW-9 QUOTATION MARK (quotesinglbase)
146 0x92 0222 U+2122 TRADE MARK SIGN
fi 147 0x93 0223 U+FB01 LATIN SMALL LIGATURE FI
fl 148 0x94 0224 U+FB02 LATIN SMALL LIGATURE FL
149 0x95 0225 U+0141 LATIN CAPITAL LETTER L WITH STROKE
OE 150 0x96 0226 U+0152 LATIN CAPITAL LIGATURE OE
Š 151 0x97 0227 U+0160 LATIN CAPITAL LETTER S WITH CARON
Ÿ 152 0x98 0230 U+0178 LATIN CAPITAL LETTER Y WITH DIAERESIS
Z hat 153 0x99 0231 U+017 DLATIN CAPITAL LETTER Z WITH CARON
i 154 0x9a 0232 U+0131 LATIN SMALL LETTER DOTLESS I
l/ 155 0x9b 0233 U+0142 LATIN SMALL LETTER L WITH STROKE
œ 156 0x9c 0234 U+0153 LATIN SMALL LIGATURE OE
š 157 0x9d 0235 U+0161 LATIN SMALL LETTER S WITH CARON
ž 158 0x9e 0236 U+017E LATIN SMALL LETTER Z WITH CARON
159 0x9f 0237 Undefined U
160 0xa0 0240 U+20AC EURO SIGN
¡ 161 0xa1 0241 U+00A1 INVERTED EXCLAMATION MARK
¢ 162 0xa2 0242 U+00A2 CENT SIGN
£ 163 0xa3 0243 U+00A3 POUND SIGN (sterling)
¤ 164 0xa4 0244 U+00A4 CURRENCY SIGN
¥ 165 0xa5 0245 U+00A5 YEN SIGN
¦ 166 0xa6 0246 U+00A6 BROKEN BAR
§ 167 0xa7 0247 U+00A7 SECTION SIGN
¨ 168 0xa8 0250 U+00A8 DIAERESIS
© 169 0xa9 0251 U+00A9 COPYRIGHT SIGN
ª 170 0xaa 0252 U+00AA FEMININE ORDINAL INDICATOR
« 171 0xab 0253 U+00AB LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
¬ 172 0xac 0254 U+00AC NOT SIGN
173 0xad 0255 Undefined U
® 174 0xae 0256 U+00AE REGISTERED SIGN
¯ 175 0xaf 0257 U+00AF MACRON
° 176 0xb0 0260 U+00B0 DEGREE SIGN
± 177 0xb1 0261 U+00B1 PLUS-MINUS SIGN
² 178 0xb2 0262 U+00B2 SUPERSCRIPT TWO
³ 179 0xb3 0263 U+00B3 SUPERSCRIPT THREE
´ 180 0xb4 0264 U+00B4 ACUTE ACCENT
µ 181 0xb5 0265 U+00B5 MICRO SIGN
182 0xb6 0266 U+00B6 PILCROW SIGN
· 183 0xb7 0267 U+00B7 MIDDLE DOT
¸ 184 0xb8 0270 U+00B8 CEDILLA
¹ 185 0xb9 0271 U+00B9 SUPERSCRIPT ONE
º 186 0xba 0272 U+00BA MASCULINE ORDINAL INDICATOR
» 187 0xbb 0273 U+00BB RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
¼ 188 0xbc 0274 U+00BC VULGAR FRACTION ONE QUARTER
½ 189 0xbd 0275 U+00BD VULGAR FRACTION ONE HALF
¾ 190 0xbe 0276 U+00BE VULGAR FRACTION THREE QUARTERS
¿ 191 0xbf 0277 U+00BF INVERTED QUESTION MARK
À 192 0xc0 0300 U+00C0
Á 193 0xc1 0301 U+00C1
 194 0xc2 0302 U+00C2
à 195 0xc3 0303 U+00C3
Ä 196 0xc4 0304 U+00C4
Å 197 0xc5 0305 U+00C5
Æ 198 0xc6 0306 U+00C6
Ç 199 0xc7 0307 U+00C7
È 200 0xc8 0310 U+00C8
É 201 0xc9 0311 U+00C9
Ê 202 0xca 0312 U+00CA
Ë 203 0xcb 0313 U+00CB
Ì 204 0xcc 0314 U+00CC
Í 205 0xcd 0315 U+00CD
Î 206 0xce 0316 U+00CE
Ï 207 0xcf 0317 U+00CF
Ð 208 0xd0 0320 U+00D0
Ñ 209 0xd1 0321 U+00D1
Ò 210 0xd2 0322 U+00D2
Ó 211 0xd3 0323 U+00D3
Ô 212 0xd4 0324 U+00D4
Õ 213 0xd5 0325 U+00D5
Ö 214 0xd6 0326 U+00D6
× 215 0xd7 0327 U+00D7
Ø 216 0xd8 0330 U+00D8
Ù 217 0xd9 0331 U+00D9
Ú 218 0xda 0332 U+00DA
Û 219 0xdb 0333 U+00DB
Ü 220 0xdc 0334 U+00DC
Ý 221 0xdd 0335 U+00DD
Þ 222 0xde 0336 U+00DE
ß 223 0xdf 0337 U+00DF
à 224 0xe0 0340 U+00E0
á 225 0xe1 0341 U+00E1
â 226 0xe2 0342 U+00E2
ã 227 0xe3 0343 U+00E3
ä 228 0xe4 0344 U+00E4
å 229 0xe5 0345 U+00E5
æ 230 0xe6 0346 U+00E6
ç 231 0xe7 0347 U+00E7
è 232 0xe8 0350 U+00E8
é 233 0xe9 0351 U+00E9
ê 234 0xea 0352 U+00EA
ë 235 0xeb 0353 U+00EB
ì 236 0xec 0354 U+00EC
í 237 0xed 0355 U+00ED
î 238 0xee 0356 U+00EE
ï 239 0xef 0357 U+00EF
ð 240 0xf0 0360 U+00F0
ñ 241 0xf1 0361 U+00F1
ò 242 0xf2 0362 U+00F2
ó 243 0xf3 0363 U+00F3
ô 244 0xf4 0364 U+00F4
õ 245 0xf5 0365 U+00F5
ö 246 0xf6 0366 U+00F6
÷ 247 0xf7 0367 U+00F7
ø 248 0xf8 0370 U+00F8
ù 249 0xf9 0371 U+00F9
ú 250 0xfa 0372 U+00FA
û 251 0xfb 0373 U+00FB
ü 252 0xfc 0374 U+00FC
ý 253 0xfd 0375 U+00FD
þ 254 0xfe 0376 U+00FE
ÿ 255 0xff 0377 U+00FF

D.4 专家集和 MacExpertEncoding

D.4 Expert Set and MacExpertEncoding

CHAR NAME CODE CHAR NAME CODE
æ AEsmall 276 j Jsmall 152
á Aacutesmall 207 k Ksmall 153
â Acircumflexsmall 211 ł Lslashsmall 302
´ Acutesmall 047 l Lsmall 154
ä Adieresissmall 212 ¯ Macronsmall 364
à Agravesmall 210 m Msmall 155
å Aringsmall 214 n Nsmall 156
a Asmall 141 ñ Ntildesmall 226
ã Atildesmall 213 œ OEsmall 317
˘ Brevesmall 363 ó Oacutesmall 227
b Bsmall 142 ô Ocircumflexsmall 231
ˇ Caronsmall 256 ö Odieresissmall 232
ç Ccedillasmall 215 ˛ Ogoneksmall 362
¸ Cedillasmall 311 ò Ogravesmall 230
ˆ Circumflexsmall 136 ø Oslashsmall 277
c Csmall 143 o Osmall 157
¨ Dieresissmall 254 õ Otildesmall 233
˙ Dotaccentsmall 372 p Psmall 160
d Dsmall 144 q Qsmall 161
é Eacutesmall 216 ˚ Ringsmall 373
ê Ecircumflexsmall 220 r Rsmall 162
ë Edieresissmall 221 š Scaronsmall 247
è Egravesmall 217 s Ssmall 163
e Esmall 145 þ Thornsmall 271
ð Ethsmall 104 ˜ Tildesmall 176
f Fsmall 146 t Tsmall 164
` Gravesmall 140 ú Uacutesmall 234
g Gsmall 147 û Ucircumflexsmall 236
h Hsmall 150 ü Udieresissmall 237
˝ Hungarumlautsmall 042 ù Ugravesmall 235
í Iacutesmall 222 u Usmall 165
î Icircumflexsmall 224 v Vsmall 166
ï Idieresissmall 225 w Wsmall 167
ì Igravesmall 223 x Xsmall 170
i Ismall 151 ý Yacutesmall 264
ÿ Ydieresissmall 330 4 fouroldstyle 064
y Ysmall 171 foursuperior 335
ž Zcaronsmall 275 fraction 057
z Zsmall 172 - hyphen 055
& ampersandsmall 046 - hypheninferior 137
a asuperior 201 - hyphensuperior 321
b bsuperior 365 i isuperior 351
¢ centinferior 251 l lsuperior 361
¢ centoldstyle 043 m msuperior 367
¢ centsuperior 202 nineinferior 273
: colon 072 9 nineoldstyle 071
colonmonetary 173 ninesuperior 341
, comma 054 nsuperior 366
, commainferior 262 onedotenleader 053
, commasuperior 370 oneeighth 112
\(</td> <td>dollarinferior</td> <td> 266</td> <td> 1</td> <td>onefitted</td> <td>174</td> </tr> <tr><td>\) dollaroldstyle 044 ½ onehalf 110
$ dollarsuperior 045 oneinferior 301
d dsuperior 353 1 oneoldstyle 061
eightinferior 245 ¼ onequarter 107
8 eightoldstyle 070 ¹ onesuperior 332
eightsuperior 241 onethird 116
e esuperior 344 o osuperior 257
¡ exclamdownsmall 326 parenleftinferior 133
! exclamsmall 041 parenleftsuperior 050
ff 126 parenrightinferior 135
ffi 131 parenrightsuperior 051
ffl 132 . period 056
fi 127 . periodinferior 263
figuredash 320 . periodsuperior 371
fiveeighths 114 ¿ questiondownsmall 300
fiveinferior 260 ? questionsmall 077
5 fiveoldstyle 065 r rsuperior 345
fivesuperior 336 Rp rupiah 175
fl 130 ; semicolon 073
fourinferior 242 seveneighths 115
seveninferior 246 threequartersemdash 075
7 sevenoldstyle 067 ³ threesuperior 334
sevensuperior 340 t tsuperior 346
sixinferior 244 twodotenleader 052
6 sixoldstyle 066 twoinferior 252
sixsuperior 337 2 twooldstyle 062
space 040 ² twosuperior 333
s ssuperior 352 twothirds 117
threeeighths 113 zeroinferior 274
threeinferior 243 0 zerooldstyle 060
3 threeoldstyle 063 zerosuperior 342
¾ threequarters 111

D.5 符号集和编码

D.5 Symbol Set and Encoding

123

123

123

D.6 ZapfDingbats 集和编码

D.6 ZapfDingbats Set and Encoding

123

123