UTF-8: Difference between revisions
Jump to navigation
Jump to search
PasouCcoch (talk | contribs) (oucorbole) |
m (Reverted edit of PasouCcoch, changed back to last version by Coldacid) |
||
Line 1: | Line 1: | ||
{{stub}} | {{stub}} | ||
UTF-8 is a way to encode [[Unicode]] text in 8-bit chunks. It has the advantage of being backwards-compatible with ASCII, but will use two to four bytes to encode higher Unicode characters such as Cyrillic or Japanese text. (That is, any code point of greater value than 127 will require at least two bytes.) | UTF-8 is a way to encode [[Unicode]] text in 8-bit chunks. It has the advantage of being backwards-compatible with ASCII, but will use two to four bytes to encode higher Unicode characters such as Cyrillic or Japanese text. (That is, any code point of greater value than 127 will require at least two bytes.) |
Revision as of 22:19, 3 January 2009
UTF-8 is a way to encode Unicode text in 8-bit chunks. It has the advantage of being backwards-compatible with ASCII, but will use two to four bytes to encode higher Unicode characters such as Cyrillic or Japanese text. (That is, any code point of greater value than 127 will require at least two bytes.)