Numeric character reference: Difference between revisions

Content deleted Content added
minor additions to intro, to make a better summary
m Restrictions: wiki linking
Line 37:
== Restrictions ==
 
[[ISO 10646]] (theThe Universal Character Set) defined by ISO 10646 is the "document character set" of SGML, HTML 4, so by default, any character in such a document, and any character ''referenced'' in such a document, must be in the UCS.
 
While the syntax of SGML does not prohibit unassigned code points such as  from being referenced, SGML-derived markup languages such as HTML and XML can, and often do, restrict numeric character references to reference only those code points that have not been assigned to characters (rather, code points not permanently unassigned).