Unicode Character Inspector

Paste any text to see each character's Unicode code point, HTML entity, and UTF-8 and UTF-16 byte encodings. Great for debugging odd characters.

Each row is one Unicode code point (so an emoji made of several code points shows as several rows). The code point is the U+ value; the HTML entity works in web pages; UTF-8 and UTF-16 are the byte sequences used to store the character. Character names are not shown (that needs a large database).

Frequently asked questions

What does this show for each character?

For every Unicode code point in your text it shows the character itself, its code point (the U+ value), the decimal number, the numeric HTML entity, and the raw bytes in UTF-8 and UTF-16. It is handy for debugging stray or invisible characters.

Why does one emoji sometimes show several rows?

Many emoji are sequences of several code points (for example a base emoji plus a skin-tone modifier, joined by a zero-width joiner). The tool lists each code point on its own row, which is exactly how the text is encoded.

Why are character names not shown?

The official Unicode name for every character (like LATIN SMALL LETTER A) needs a large lookup database that would bloat the page. This tool focuses on code points and encodings, which run instantly with no data download.