UTF-8: Difference between revisions

From Valve Developer Community
Jump to navigation Jump to search
(Point people to Joel's about-Unicode page)
(Tweaks...)
Line 1: Line 1:
{{stub}}
{{stub}}
UTF-8 is a way to encode [[Unicode]] text in 8-byte chunks.
UTF-8 is a way to encode [[Unicode]] text in 8-bit chunks. It has the advantage of being backwards-compatible with ASCII, but will use two, three or more bytes to encode higher Unicode characters such as Cyrillic or Japanese text.


== See Also ==
== See Also ==

Revision as of 00:50, 12 August 2005

Stub

This article or section is a stub. You can help by expanding it.

UTF-8 is a way to encode Unicode text in 8-bit chunks. It has the advantage of being backwards-compatible with ASCII, but will use two, three or more bytes to encode higher Unicode characters such as Cyrillic or Japanese text.

See Also

Joel Spolsky's "The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)"