Numeric character reference: Difference between revisions

Content deleted Content added
Compatibility issues: I don't understand much in this paragraph, but I do know that the left double quotation mark is “ (“), not " ("). More knowledgeable editors may find other errors.
Atighot (talk | contribs)
Restrictions: Added reference
Line 278:
While the syntax of SGML does not prohibit references to invalid or unassigned code points, such as <code>&amp;#xFFFF;</code>, SGML-derived markup languages such as HTML and XML can, and often do, restrict numeric character references to only those code points that are assigned to characters.
 
Restrictions may also apply for other reasons. For example, in HTML 4, <code>&amp;#12;</code>, which is a reference to a non-printing "form feed" control character, is allowed because a form feed character is allowed. But in XML, the form feed character cannot be used, not even by reference.<ref>{{cite web |title=HTML 5.2: 8. The HTML syntax |url=https://www.w3.org/TR/2017/WD-html52-20170228/syntax.html |website=www.w3.org}}</ref>{{Citation needed|date=May 2013}} As another example, <code>&amp;#128;</code>, which is a reference to another control character, is not allowed to be used or referenced in either HTML or XML, but when used in HTML, it is usually not flagged as an error by web browsers – some of which interpret it as a reference to the character represented by code value 128 in the [[Windows-1252]] encoding for compatibility reasons. This character, "€", has to be represented as <code>&amp;#8364;</code> in a standard-compliant HTML code. As a further example, prior to the publication of XML 1.0 Second Edition on October 6, 2000, XML 1.0 was based on an older version of ISO 10646 and prohibited using characters above U+FFFD, except in character data, thus making a reference like <code>&amp;#65536;</code> (U+10000) illegal. In XML 1.1 and newer editions of XML 1.0, such a reference is allowed, because the available character repertoire was explicitly extended.
 
Markup languages also place restrictions on where character references can occur.