Comparison of Unicode encodings: Difference between revisions

Content deleted Content added
No edit summary
Sapphaline (talk | contribs)
In detail: hatnote
 
Line 46:
 
== In detail ==
{{hatnote|The tables below list the numbernumbers of bytes per code point, fornot differentper user-visible "character" Unicode(or ranges"grapheme cluster"). AnyIt additionalcan commentstake neededmultiple arecode includedpoints into thedescribe table.a Thesingle figuresgrapheme assumecluster, thatso overheadseven atin theUTF-32, startcare andmust endbe oftaken thewhen block ofsplitting textor areconcatenating negligiblestrings.}}
 
The tables below list the number of bytes per code point for different Unicode ranges. Any additional comments needed are included in the table. The figures assume that overheads at the start and end of the block of text are negligible.
<blockquote>
'''N.B.''' The tables below list numbers of bytes per ''code point'', ''not'' per user-visible "character" (or "grapheme cluster"). It can take multiple code points to describe a single grapheme cluster, so even in UTF-32, care must be taken when splitting or concatenating strings.
</blockquote>
 
=== Eight-bit environments ===