Comparison of Unicode encodings: Difference between revisions

Content deleted Content added
8 bit environments: saving work in progress
Line 19:
|010000 - 10FFFF||4||4||4||4
|}
 
<!--
===7 bit environments===
{| {{prettytable}}
|code range (hexadecimal)||[[UTF-7]]||[[UTF-8]] [[quoted printable]]||UTF-8 [[base64]]||[[UTF-16]] quoted printable||UTF-16 base64||[[UTF-32]] quoted printable||UTF-32 base64||[[GB18030]] quoted printable||[[GB18030]] base64
|-
|000000 - 000032||same as 000080-00FFFFFF||3||1&#x2153;||6||2&#x2154;||12||5⅓||3||1&#x2153;
|-
|000033 - 00007F||1 mostly depending partly on if encoded choses to use "optional direct characters" 2 for +, otherwise same as 000080-00FFFFFF||1||1&#x2153;||4||2&#x2154;||10||5⅓||1||1&#x2153;
|-
|000080 - 0007FF||rowspan=2|5 for an isolted case inside a run of single byte characters. For runs 2&#x2154; per character plus padding to make it a whole number of bytes plus two to start and finish the run ||6||2&#x2154;||rowspan=2|2-6 depending on if the byte values need to be escaped||2⅔||rowspan=3|8-12 depending on if the final two byte values need to be escaped||5⅓||rowspan=2|4-6 for stuff inherited from [[GB2312]]/[[GBK]] (e.g.<br>most chineese stuff) 6-10 for everything else.||rowspan=2|2&#x2154; for stuff inherited from [[GB2312]]/[[GBK]] (e.g.<br>most chineese stuff) 45⅓ for everything else.
|-
|000800 - 00FFFF|| 9||94||22⅔||45⅓
|-
|010000 - 10FFFF||same as two characters from above||12||5⅓||8-12 depending on if the low bytes of the surrogates need to be escaped.||5⅓||5⅓||6-10||5⅓
|010000 - 10FFFF|| ||4||4||4||4
|}
-->