Min Hex Hex Max Binary Byte Sequence 00000000 0000007F 0vvvvvvv 00000080 000007FF 110vvvvv 10vvvvvv 00000600 0000FFFF 1110vvvv 10vvvvvv 10vvvvvv 00010000 001FFFFF 11110vvv 10vvvvvv 10vvvvvv 10vvvvvv 00200000 03FFFFFF 111110vv 10vvvvvv 10vvvvvv 10vvvvvv 10vvvvvv 04000000 7FFFFFFF 1111110v 10vvvvvv 10vvvvvv 10vvvvvv 10vvvvvv 10vvvvvvFor example, the greek character s (sigma) has the hex Unicode code 03C3, which is 0000 0011 1100 0011 in binary. 03C3 is between 0080 and 07FF, so in UTF-8, this becomes 11001111 10000011 = 1100 1111 1000 0011 = CF 83 hex. The red bits are from the unicode code 011 1100 0011 and the green bits are the special UTF-8 designations.
UTF-8 | EF BB BF | UTF-32 Big Endian | 00 00 FE FF | |
UTF-16 Big Endian | FE FF | UTF-32 Little Endian | FF FE 00 00 | |
UTF-16 Little Endian | FF FE |