UTF-8

From Valve Developer Community
Jump to navigation Jump to search
Underlinked - Logo.png
This article needs more Wikipedia icon links to other articles to help Wikipedia icon integrate it into the encyclopedia. Please help improve this article by adding links Wikipedia icon that are relevant to the context within the existing text.
January 2024

UTF-8 is a way to encode Unicode text in 8-bit chunks. It has the advantage of being backwards-compatible with ASCII, but will use two to four bytes to encode higher Unicode characters such as Cyrillic or Japanese text. (That is, any code point of greater value than 127 will require at least two bytes.)

See also

Joel Spolsky's "The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)"


Stub

This article or section is a stub. You can help by expanding it.