Encoding Explorer
Explore how text is represented in Unicode and in different encodings. All byte values are shown in hex.
Unicode Code Points
This shows the number assigned by the Unicode standard to each character. Learn more
UTF-8
Encodes each code point into a sequence of one to four 8-bit units. ASCII characters are represented using a single byte, which makes it pretty efficient for typical text. Learn more
UTF-16 (Big-Endian)
Encodes each code point into a sequence of one to two 16-bit units. Characters outside the Basic Multilingual Plane require a "surrogate pair" of two units (4 bytes total). Learn more
UTF-32 (Big-Endian)
A fixed-width encoding that uses exactly 4 bytes for every character. Learn more