UTF-8: Difference between revisions
Jump to navigation
Jump to search
Cargo Cult (talk | contribs) (Tweaks...) |
Thunder4ik (talk | contribs) m (Unicodifying, replaced: See Also → See also, added underlinked tag) |
||
(3 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
{{ | {{Underlinked|date=January 2024}} | ||
UTF-8 is a way to encode [[Unicode]] text in 8-bit chunks. It has the advantage of being backwards-compatible with ASCII, but will use two | |||
'''UTF-8''' is a way to encode [[Unicode]] text in 8-bit chunks. It has the advantage of being backwards-compatible with ASCII, but will use two to four bytes to encode higher Unicode characters such as Cyrillic or Japanese text. (That is, any code point of greater value than 127 will require at least two bytes.) | |||
== See | == See also == | ||
[http://www.joelonsoftware.com/articles/Unicode.html Joel Spolsky's "The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)"] | [http://www.joelonsoftware.com/articles/Unicode.html Joel Spolsky's "The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)"] | ||
[[ | [[Category:Glossary]] | ||
{{stub}} |
Latest revision as of 15:06, 7 January 2024

This article needs more
links to other articles to help
integrate it into the encyclopedia. Please help improve this article by adding links
that are relevant to the context within the existing text.
January 2024



January 2024
UTF-8 is a way to encode Unicode text in 8-bit chunks. It has the advantage of being backwards-compatible with ASCII, but will use two to four bytes to encode higher Unicode characters such as Cyrillic or Japanese text. (That is, any code point of greater value than 127 will require at least two bytes.)
See also