UTF-8: Difference between revisions

Revision as of 03:01, 17 June 2008

Stub

This article or section is a stub. You can help by expanding it.

UTF-8 is a way to encode Unicode text in 8-bit chunks. It has the advantage of being backwards-compatible with ASCII, but will use two to four bytes to encode higher Unicode characters such as Cyrillic or Japanese text. (That is, any code point of greater value than 127 will require at least two bytes.)

@@ Line 1: / Line 1: @@
 {{stub}}
-UTF-8 is a way to encode [[Unicode]] text in 8-bit chunks. It has the advantage of being backwards-compatible with ASCII, but will use two, three or more bytes to encode higher Unicode characters such as Cyrillic or Japanese text.
+UTF-8 is a way to encode [[Unicode]] text in 8-bit chunks. It has the advantage of being backwards-compatible with ASCII, but will use two to four bytes to encode higher Unicode characters such as Cyrillic or Japanese text. (That is, any code point of greater value than 127 will require at least two bytes.)
 == See Also ==

UTF-8: Difference between revisions

Revision as of 03:01, 17 June 2008

See Also

Navigation menu

Search