This page describes what character encodings are and how they should be used. |
Example of a Japanese text encoded in Shift_JIS (requires appropriate fonts available): | |
▶ | ひらがなカタカナ漢字 |
The byte sequence generated from the previous example in Shift_JIS: | |
▶ | 82 D0 82 E7 82 AA 82 C8 83 4A 83 5E 83 4A 83 69 8A BF 8E 9A |
What happens if the byte sequence is misinterpreted as being ISO 8859-1 (Latin-1) instead of Shift_JIS: | |
▶ | ‚Ђ炪‚ȃJƒ^ƒJƒiŠ¿Žš |
Encoding | Hexadecimal representation |
---|---|
UTF-8 | EF BB BF |
UTF-16 (big endian) | FE FF |
UTF-16 (little endian) | FF FE |
UTF-32 (big endian) | 00 00 FE FF |
UTF-32 (little endian) | FF FE 00 00 |