Unicode and HTML: Difference between revisions

Content deleted Content added
m typo(s) fixed: For example → For example, (2) using AWB
Encoding defaults: code pages are for alphabets, not languages/locates
Line 49:
 
===Encoding defaults===
An encoding default applies when there is no external or internal encoding declaration and also no Byte order mark. While the encoding default for HTML pages served as XML is required to be UTF-8, the encoding default for a regular Web page (that is: for HTML pages serialized as <code>text/html</code>) varies depending on the localization of the browser. For a system set up mainly for Western European languages, it will generally be [[ISO 8859-1#Windows-1252|Windows-1252]]. For theCyrillic Russianalphabet localelocales, the default is typically [[Windows-1251]]. For a browser from a ___location where ''legacy'' multi-byte character encodings are prevalent, some form of auto-detection is likely to be applied.
 
===Encoding trends===