Character encodings in HTML: Difference between revisions

Content deleted Content added
Dataphile (talk | contribs)
m spot caps, avoid redirect, wikify
Shlomital (talk | contribs)
Note on CERs in XML
Line 32:
 
Note that unnecessary use of HTML character references may significantly reduce the readability of HTML. If the character encoding for a web page is chosen appropriately then HTML character references are usually only required for a few special characters. The characters '''&''' and '''<''' always need to be encoded, as noted above.
 
In [[XML]] there are five built-in character entity references:
 
* & = & (ampersand, U+0026)
* < = < (left angle bracket, less-than sign, U+003C)
* > = > (right angle bracket, greater-than sign, U+003E)
* " = " (quotation mark, U+0022)
* ' = ' (apostrophe, U+0027)
 
All other character entity references have to be defined before they can be used. For example, use of é (which gives é, Latin small letter E with acute, U+0039, in HTML) in an XML document will generate an error unless the entity has already been defined.
 
==External link==