Windows code page: Difference between revisions

Content deleted Content added
mNo edit summary
History: Fix plurality
Line 61:
Initially, computer systems and system programming languages did not make a distinction between [[character (computing)|character]]s and [[byte]]s: for the [[List of writing systems#Segmental script|segmental scripts]] used in most of Africa, the Americas, southern and south-east Asia, the Middle East and Europe, a character needs just one byte, but two or more bytes are needed for the [[ideographic]] sets used in the rest of the world. This subsequently led to much confusion. Microsoft software and systems prior to the [[Windows NT]] line are examples of this, because they use the OEM and ANSI code pages that do not make the distinction.
 
Since the late 1990s, software and systems have adopted [[Unicode]] as their preferred storage format; this trend has been improved by the widespread adoption of [[XML]] which defaultdefaults to [[UTF-8]] but also provides a mechanism for labelling the encoding used.<ref>{{cite web
|url=http://www.w3.org/TR/xml11/#charencoding
|title=Extensible Markup Language (XML) 1.1 (Second Edition): Character encodings