Unicode control characters: Difference between revisions

Content deleted Content added
Category "Cc" control codes (C0 and C1): unichar template is broken for control characters
Unicode introduced separators: remove duplicate links and useless html
Line 20:
 
== Unicode introduced separators ==
In an attempt to simplify the several [[newline]] characters used in legacy text{{citation needed|date=November 2014}}, Unicode introduces its own newline characters to separate either lines or paragraphs: {{unichar|2028|line separator|html=}} (abbreviated {{ctrl|LS}} or {{ctrl|LSEP}}) and {{unichar|2029|paragraph separator|html=}} (abbreviated {{ctrl|PS}} or {{ctrl|PSEP}}).
 
Like CR and LF, LS and PS are effectors for text formatting; unlike CR and LF, they are not treated as "control codes" for [[ECMA-35]]/[[ECMA-48]] purposes (category {{code|Cc}}), rather having semantics defined entirely by Unicode itself. They are assigned to ''[[sui generis]]'' [[Unicode character property#General Category|Unicode categories]] {{code|Zl}} and {{code|Zp}} respectively, under the major category {{code|Z}} (separator) used for certain [[whitespace character]]s.