HTML 编码(字符集)
要正确显示 HTML 页面,Web 浏览器必须知道要使用哪个字符集。
HTML 字符集属性
字符集在标记中指定:<meta>
1 | <meta charset="UTF-8"> |
HTML5 规范鼓励 Web 开发人员使用 UTF-8 字符集。
UTF-8 几乎涵盖了世界上所有的字符和符号!
[
]
[完整的 UTF-8 参考]
ASCII 字符集
ASCII 是 Web 的第一个字符编码标准。 它定义了 128 个可以在互联网上使用的不同字符:
- 英文字母 (A-Z)
- 数字 (0-9)
- 特殊字符,如 !$ + - ( ) @ < >.
ANSI 字符集
ANSI (Windows-1252) 是原始的 Windows 字符集:
- 与前 127 个字符的 ASCII 相同
- 从 128 到 159 的特殊字符
- 与 UTF-8 相同,从 160 到 255
1 | <meta charset="Windows-1252"> |
ISO-8859-1 字符集
ISO-8859-1 是 HTML 4 的默认字符集。此字符集 支持 256 种不同的字符代码。HTML 4 还支持 UTF-8。
- 与前 127 个字符的 ASCII 相同
- 不使用 128 到 159 之间的字符
- 与 ANSI 和 UTF-8 相同,从 160 到 255
1 | ### HTML 4 示例 |
UTF-8 字符集
- 对于从 0 到 127 的值,与 ASCII 相同
- 不使用 128 到 159 之间的字符
- 与 ANSI 和 8859-1 相同,从 160 到 255
- 从值 256 继续到 10 000 个字符
1 | <meta charset="UTF-8"> |
[完整的 HTML 字符集参考]
字符集之间的差异
下表显示了上述字符集之间的差异:
| Numb | ASCII | ANSI | 8859 | UTF‑8 | Description |
|---|---|---|---|---|---|
| 32 | space | ||||
| 33 | ! |
! |
! |
! |
exclamation mark |
| 34 | " |
" |
" |
" |
quotation mark |
| 35 | # |
# |
# |
# |
number sign |
| 36 | $ |
$ |
$ |
$ |
dollar sign |
| 37 | % |
% |
% |
% |
percent sign |
| 38 | & |
& |
& |
& |
ampersand |
| 39 | ' |
' |
' |
' |
apostrophe |
| 40 | ( |
( |
( |
( |
left parenthesis |
| 41 | ) |
) |
) |
) |
right parenthesis |
| 42 | * |
* |
* |
* |
asterisk |
| 43 | + |
+ |
+ |
+ |
plus sign |
| 44 | , |
, |
, |
, |
comma |
| 45 | - |
- |
- |
- |
hyphen-minus |
| 46 | . |
. |
. |
. |
full stop |
| 47 | / |
/ |
/ |
/ |
solidus |
| 48 | 0 | 0 | 0 | 0 | digit zero |
| 49 | 1 | 1 | 1 | 1 | digit one |
| 50 | 2 | 2 | 2 | 2 | digit two |
| 51 | 3 | 3 | 3 | 3 | digit three |
| 52 | 4 | 4 | 4 | 4 | digit four |
| 53 | 5 | 5 | 5 | 5 | digit five |
| 54 | 6 | 6 | 6 | 6 | digit six |
| 55 | 7 | 7 | 7 | 7 | digit seven |
| 56 | 8 | 8 | 8 | 8 | digit eight |
| 57 | 9 | 9 | 9 | 9 | digit nine |
| 58 | : |
: |
: |
: |
colon |
| 59 | ; |
; |
; |
; |
semicolon |
| 60 | < |
< |
< |
< |
less than |
| 61 | = |
= |
= |
= |
equals sign |
| 62 | > |
> |
> |
> |
greater than |
| 63 | ? |
? |
? |
? |
question mark |
| 64 | @ |
@ |
@ |
@ |
commercial at |
| 65 | A | A | A | A | Latin A |
| 66 | B | B | B | B | Latin B |
| 67 | C | C | C | C | Latin C |
| 68 | D | D | D | D | Latin D |
| 69 | E | E | E | E | Latin E |
| 70 | F | F | F | F | Latin F |
| 71 | G | G | G | G | Latin G |
| 72 | H | H | H | H | Latin H |
| 73 | I | I | I | I | Latin I |
| 74 | J | J | J | J | Latin J |
| 75 | K | K | K | K | Latin K |
| 76 | L | L | L | L | Latin L |
| 77 | M | M | M | M | Latin M |
| 78 | N | N | N | N | Latin N |
| 79 | O | O | O | O | Latin O |
| 80 | P | P | P | P | Latin P |
| 81 | Q | Q | Q | Q | Latin Q |
| 82 | R | R | R | R | Latin R |
| 83 | S | S | S | S | Latin S |
| 84 | T | T | T | T | Latin T |
| 85 | U | U | U | U | Latin U |
| 86 | V | V | V | V | Latin V |
| 87 | W | W | W | W | Latin W |
| 88 | X | X | X | X | Latin X |
| 89 | Y | Y | Y | Y | Latin Y |
| 90 | Z | Z | Z | Z | Latin Z |
| 91 | [ |
[ |
[ |
[ |
left square bracket |
| 92 | ``` | ``` | ``` | ``` | reverse solidus |
| 93 | ] |
] |
] |
] |
right square bracket |
| 94 | ^ |
^ |
^ |
^ |
circumflex accent |
| 95 | _ |
_ |
_ |
_ |
low line |
| 96 | ``` | ``` | ``` | ``` | grave accent |
| 97 | a | a | a | a | Latin small a |
| 98 | b | b | b | b | Latin small b |
| 99 | c | c | c | c | Latin small c |
| 100 | d | d | d | d | Latin small d |
| 101 | e | e | e | e | Latin small e |
| 102 | f | f | f | f | Latin small f |
| 103 | g | g | g | g | Latin small g |
| 104 | h | h | h | h | Latin small h |
| 105 | i | i | i | i | Latin small i |
| 106 | j | j | j | j | Latin small j |
| 107 | k | k | k | k | Latin small k |
| 108 | l | l | l | l | Latin small l |
| 109 | m | m | m | m | Latin small m |
| 110 | n | n | n | n | Latin small n |
| 111 | o | o | o | o | Latin small o |
| 112 | p | p | p | p | Latin small p |
| 113 | q | q | q | q | Latin small q |
| 114 | r | r | r | r | Latin small r |
| 115 | s | s | s | s | Latin small s |
| 116 | t | t | t | t | Latin small t |
| 117 | u | u | u | u | Latin small u |
| 118 | v | v | v | v | Latin small v |
| 119 | w | w | w | w | Latin small w |
| 120 | x | x | x | x | Latin small x |
| 121 | y | y | y | y | Latin small y |
| 122 | z | z | z | z | Latin small z |
| 123 | { |
{ |
{ |
{ |
left curly bracket |
| 124 | | | | | | | | | vertical line |
| 125 | } |
} |
} |
} |
right curly bracket |
| 126 | ~ |
~ |
~ |
~ |
tilde |
| 127 | DEL |
||||
| 128 | € |
euro sign | |||
| 129 | | | | NOT USED | |
| 130 | ‚ |
single low-9 quotation mark | |||
| 131 | ƒ |
Latin small f with hook | |||
| 132 | „ |
double low-9 quotation mark | |||
| 133 | … |
horizontal ellipsis | |||
| 134 | † |
dagger | |||
| 135 | ‡ |
double dagger | |||
| 136 | ˆ |
modifier letter circumflex accent | |||
| 137 | ‰ |
per mille sign | |||
| 138 | Š |
Latin S with caron | |||
| 139 | ‹ |
single left-pointing angle quotation mark | |||
| 140 | Œ |
Latin capital ligature OE | |||
| 141 | | | | NOT USED | |
| 142 | Ž |
Latin Z with caron | |||
| 143 | | | | NOT USED | |
| 144 | | | | NOT USED | |
| 145 | ‘ |
left single quotation mark | |||
| 146 | ’ |
right single quotation mark | |||
| 147 | “ |
left double quotation mark | |||
| 148 | ” |
right double quotation mark | |||
| 149 | • |
bullet | |||
| 150 | – |
en dash | |||
| 151 | — |
em dash | |||
| 152 | ˜ |
small tilde | |||
| 153 | ™ |
trade mark sign | |||
| 154 | š |
Latin small s with caron | |||
| 155 | › |
single right-pointing angle quotation mark | |||
| 156 | œ |
Latin small ligature oe | |||
| 157 | | | | NOT USED | |
| 158 | ž |
Latin small z with caron | |||
| 159 | Ÿ |
Latin Y with diaeresis | |||
| 160 | no-break space | ||||
| 161 | ¡ |
¡ |
¡ |
inverted exclamation mark | |
| 162 | ¢ |
¢ |
¢ |
cent sign | |
| 163 | £ |
£ |
£ |
pound sign | |
| 164 | ¤ |
¤ |
¤ |
currency sign | |
| 165 | ¥ |
¥ |
¥ |
yen sign | |
| 166 | `¦ | `¦ | ¦ |
broken bar | |
| 167 | § |
§ |
§ |
section sign | |
| 168 | ¨ |
¨ |
¨ |
diaeresis | |
| 169 | © |
© |
© |
copyright sign | |
| 170 | ª |
ª |
ª |
feminine ordinal indicator | |
| 171 | « |
« |
« |
left-pointing double angle quotation mark | |
| 172 | ¬ |
¬ |
¬ |
not sign | |
| 173 | | | | soft hyphen | |
| 174 | ® |
® |
® |
registered sign | |
| 175 | ¯ |
¯ |
¯ |
macron | |
| 176 | ° |
° |
° |
degree sign | |
| 177 | ± |
± |
± |
plus-minus sign | |
| 178 | ² |
² |
² |
superscript two | |
| 179 | ³ |
³ |
³ |
superscript three | |
| 180 | ´ |
´ |
´ |
acute accent | |
| 181 | µ |
µ |
µ |
micro sign | |
| 182 | ¶ |
¶ |
¶ |
pilcrow sign | |
| 183 | · |
· |
· |
middle dot | |
| 184 | ¸ |
¸ |
¸ |
cedilla | |
| 185 | ¹ |
¹ |
¹ |
superscript one | |
| 186 | º |
`º | º |
masculine ordinal indicator | |
| 187 | » |
» |
» |
right-pointing double angle quotation mark | |
| 188 | ¼ | ¼ | ¼ | vulgar fraction one quarter | |
| 189 | ½ | ½ | ½ | vulgar fraction one half | |
| 190 | ¾ | ¾ | ¾ | vulgar fraction three quarters | |
| 191 | ¿ | ¿ | ¿ | inverted question mark | |
| 192 | À | À | À | Latin A with grave | |
| 193 | Á | Á | Á | Latin A with acute | |
| 194 | Â | Â | Â | Latin A with circumflex | |
| 195 | Ã | Ã | Ã | Latin A with tilde | |
| 196 | Ä | Ä | Ä | Latin A with diaeresis | |
| 197 | Å | Å | Å | Latin A with ring above | |
| 198 | Æ | Æ | Æ | Latin AE | |
| 199 | Ç | Ç | Ç | Latin C with cedilla | |
| 200 | È | È | È | Latin E with grave | |
| 201 | É | É | É | Latin E with acute | |
| 202 | Ê | Ê | Ê | Latin E with circumflex | |
| 203 | Ë | Ë | Ë | Latin E with diaeresis | |
| 204 | Ì | Ì | Ì | Latin I with grave | |
| 205 | Í | Í | Í | Latin I with acute | |
| 206 | Î | Î | Î | Latin I with circumflex | |
| 207 | Ï | Ï | Ï | Latin I with diaeresis | |
| 208 | Ð | Ð | Ð | Latin Eth | |
| 209 | Ñ | Ñ | Ñ | Latin N with tilde | |
| 210 | Ò | Ò | Ò | Latin O with grave | |
| 211 | Ó | Ó | Ó | Latin O with acute | |
| 212 | Ô | Ô | Ô | Latin O with circumflex | |
| 213 | Õ | Õ | Õ | Latin O with tilde | |
| 214 | Ö | Ö | Ö | Latin O with diaeresis | |
| 215 | × | × | × | multiplication sign | |
| 216 | Ø | Ø | Ø | Latin O with stroke | |
| 217 | Ù | Ù | Ù | Latin U with grave | |
| 218 | Ú | Ú | Ú | Latin U with acute | |
| 219 | Û | Û | Û | Latin U with circumflex | |
| 220 | Ü | Ü | Ü | Latin U with diaeresis | |
| 221 | Ý | Ý | Ý | Latin Y with acute | |
| 222 | Þ | Þ | Þ | Latin Thorn | |
| 223 | ß | ß | ß | Latin small sharp s | |
| 224 | à | à | à | Latin small a with grave | |
| 225 | á | á | á | Latin small a with acute | |
| 226 | â | â | â | Latin small a with circumflex | |
| 227 | ã | ã | ã | Latin small a with tilde | |
| 228 | ä | ä | ä | Latin small a with diaeresis | |
| 229 | å | å | å | Latin small a with ring above | |
| 230 | æ | æ | æ | Latin small ae | |
| 231 | ç | ç | ç | Latin small c with cedilla | |
| 232 | è | è | è | Latin small e with grave | |
| 233 | é | é | é | Latin small e with acute | |
| 234 | ê | ê | ê | Latin small e with circumflex | |
| 235 | ë | ë | ë | Latin small e with diaeresis | |
| 236 | ì | ì | ì | Latin small i with grave | |
| 237 | í | í | í | Latin small i with acute | |
| 238 | î | î | î | Latin small i with circumflex | |
| 239 | ï | ï | ï | Latin small i with diaeresis | |
| 240 | ð | ð | ð | Latin small eth | |
| 241 | ñ | ñ | ñ | Latin small n with tilde | |
| 242 | ò | ò | ò | Latin small o with grave | |
| 243 | ó | ó | ó | Latin small o with acute | |
| 244 | ô | ô | ô | Latin small o with circumflex | |
| 245 | õ | õ | õ | Latin small o with tilde | |
| 246 | ö | ö | ö | Latin small o with diaeresis | |
| 247 | ÷ | ÷ | ÷ | division sign | |
| 248 | ø | ø | ø | Latin small o with stroke | |
| 249 | ù | ù | ù | Latin small u with grave | |
| 250 | ú | ú | ú | Latin small u with acute | |
| 251 | û | û | û | Latin small with circumflex | |
| 252 | ü | ü | ü | Latin small u with diaeresis | |
| 253 | ý | ý | ý | Latin small y with acute | |
| 254 | þ | þ | þ | Latin small thorn | |
| 255 | ÿ | ÿ | ÿ | Latin small y with diaeresis |
搜索
阅读量:
收藏数:
点赞数:


0评论