Revision as of 03:10, 28 December 2016 edit Wavelength (talk \| contribs) Extended confirmed users, Pending changes reviewers 179,502 edits removing 1 hyphen: —> "newly created"—WP:HYPHEN, sub-subsection 3, point 4 ← Previous edit		Revision as of 17:37, 6 February 2017 edit undo 119.30.39.13 (talk) No edit summary Tags: Mobile edit Mobile web edit Next edit →
Line 10: In RFC 1866, the initial HTML 2.0 standard, the document character set was defined as ISO-8859-1. It was extended to [[ISO 10646]] (which is basically equivalent to Unicode) by RFC 2070. It does not vary between documents of different languages or created on different platforms. The external character encoding is chosen by the author of the document (or the software the author uses to create the document) and determines how the bytes used to store and/or transmit the document map to characters from the document character set. Characters not present in the chosen external character encoding may be represented by character entity references. The relationship between [[Unicode]] and HTML tends to be a difficult topic for many computer professionals, document authors, and [[World Wide Web\|web]] users alike. The accurate representation of text in [[web page]]s from different [[natural language]]s and [[writing system]]s is complicated by the details of [[character encoding]], [[markup language]] syntax, [[Computer font\|font]], and varying levels of support by [[web browser]]s. == HTML document characters ==

Unicode and HTML: Difference between revisions