Revision as of 01:07, 5 August 2024 edit Padgriffin (talk \| contribs) Extended confirmed users, Page movers, New page reviewers, Pending changes reviewers, Rollbackers 20,935 edits →Compatibility issues: Copyedit extremely confusing sentence Tags: Mobile edit Mobile web edit Advanced mobile edit ← Previous edit		Revision as of 00:36, 15 August 2024 edit undo Polygnotus (talk \| contribs) Extended confirmed users, File movers, Rollbackers 33,730 edits howver → however Next edit →
Line 27: ===Processing time=== Text with variable-length encoding such as UTF-8 or UTF-16 is harder to process if there is a need to work with individual code units as opposed to working with code points. Searching is unaffected by whether the characters are variably sized since a search for a sequence of code units does not care about the divisions. ~~Howver~~However, it does require that the encoding be [[self-synchronizing code\|self-synchronizing]], which both UTF-8 and UTF-16 are. A common misconception is that there is a need to "find the ''n''th character" and that this requires a fixed-length encoding; however, in real use the number ''n'' is only derived from examining the {{nowrap\|''n−1''}} characters, thus sequential access is needed anyway.{{Citation needed\|date=October 2013}} Efficiently using character sequences in one [[endianness\|endian order]] loaded onto a machine with a different endian order requires extra processing. Characters may either be converted before use or processed with two distinct systems. Byte-based encodings such as UTF-8 do not have this problem.{{why\|date=July 2024}} [[UTF-16BE]] and [[UTF-32BE]] are [[endianness\|big-endian]], [[UTF-16LE]] and [[UTF-32LE]] are [[endianness\|little-endian]].

Comparison of Unicode encodings: Difference between revisions