Content deleted Content added
→A with ring above example: it was a cockup, not a conspiracy Tags: Mobile edit Mobile web edit Advanced mobile edit Reply |
|||
(11 intermediate revisions by 9 users not shown) | |||
Line 1:
{{WikiProject banner shell|class=C|
{{WikiProject Typography
|importance = Mid
}}
{{WikiProject Computing
|importance=Mid
}}
}}
Line 50:
I suggest to use this information in a way to improve the article, without making wikipedia article any «how to use wikipedia». [[Special:Contributions/86.75.160.141|86.75.160.141]] ([[User talk:86.75.160.141|talk]]) 19:14, 11 October 2012 (UTC)
I found this article looking for why 𝓌𝒾𝓀𝒾𝓅𝓮𝒹𝒾𝒶.org seemed to land me on wikipedia.org - might be a good illustrating example. [[Special:Contributions/155.94.127.118|155.94.127.118]] ([[User talk:155.94.127.118|talk]]) 23:36, 4 September 2019 (UTC)
==Well-formedness==
"Well-formedness" refers to whether the sequences of 8-bit, 16-bit or 32-bit storage units properly define a sequence of characters (technically, 'scalar values'). Having combining characters without base characters makes a string 'defective'. There are other faults in a well-formed string that have no name, such as broken Hangul syllable blocks, characters in the wrong order (not all scripts have been set up so that canonical equivalence will 'eliminate' ambiguities), and variation selectors in the wrong places. [[User:RichardW57|RichardW57]] ([[User talk:RichardW57|talk]]) 00:49, 17 June 2014 (UTC)
== External links modified ==
Hello fellow Wikipedians,
I have just added archive links to {{plural:1|one external link|1 external links}} on [[Unicode equivalence]]. Please take a moment to review [https://en.wikipedia.org/w/index.php?diff=prev&oldid=700471573 my edit]. If necessary, add {{tlx|cbignore}} after the link to keep me from modifying it. Alternatively, you can add {{tlx|nobots|deny{{=}}InternetArchiveBot}} to keep me off the page altogether. I made the following changes:
*Added archive https://web.archive.org/20100109162824/http://forums.macosxhints.com:80/archive/index.php/t-99344.html to http://forums.macosxhints.com/archive/index.php/t-99344.html
When you have finished reviewing my changes, please set the ''checked'' parameter below to '''true''' to let others know.
{{sourcecheck|checked=false}}
Cheers.—[[User:Cyberbot II|<sup style="color:green;font-family:Courier;">cyberbot II</sup>]]<small><sub style="margin-left:-14.9ex;color:green;font-family:Comic Sans MS;">[[User talk:Cyberbot II|<span style="color:green;">Talk to my owner</span>]]:Online</sub></small> 18:31, 18 January 2016 (UTC)
== Canonicality ==
Currently the article states (under [[Unicode equivalence#Combining and precomposed characters|Combining and precomposed characters]]) that "In general, precomposed characters are defined to be canonically equivalent to the sequence of their base letter and subsequent combining diacritic marks, in whatever order these may occur", but also (under [[Unicode equivalence#Canonical ordering|Canonical ordering]]) that the canonical decomposition (of U+1EBF) U+0065 U+0302 U+0301 "is not equivalent with U+0065 U+0301 U+0302". Either this is a contradiction and should be fixed, or some further clarification would help.
== German Article ==
There seems to be a German version, which is not linked: https://de.wikipedia.org/wiki/Normalisierung_(Unicode) [[User:Skillabstinenz|Skillabstinenz]] ([[User talk:Skillabstinenz|talk]]) 20:22, 23 June 2020 (UTC)
== Naming ==
: Why do we have this article named as «'''''Unicode equivalence'''''» instead of «'''''Unicode normalization forms'''''»? It's confusing.
<span style="font-weight: bold" >[[User:Alexander_Davronov|<span style="color:#a8a8a8;">AXO</span><span style="color:#000">NOV</span>]] [[User talk:Alexander_Davronov|(talk)]] [[Special:Contributions/Alexander_Davronov|⚑]]</span> 10:43, 21 January 2022 (UTC)
== A with ring above example ==
The text has:
<blockquote>For example, the distinct Unicode strings "U+212B" (the angstrom sign "Å") and "U+00C5" (the Swedish letter "Å")</blockquote>
But the two "A with ring above" characters appear to be the same Unicode character. This defeats the effectiveness of the example, for those who care to inspect the characters carefully (eg by copy-and-paste into some Unicode inspector tool). I would have to look into the article history in detail to see when these two characters were made the same. [[User:Cmcqueen1975|Cmcqueen1975]] ([[User talk:Cmcqueen1975|talk]]) 06:53, 29 October 2024 (UTC)
:I didn't write the original text but have rewritten the section to clarify. Does that respond to your concern?
:I think the intention was to convey that it doesn't matter which of the two codepoints you use, since they are canonically equivalent. (Which is a face-saving way of admitting that someone boobed way back. A codepoint for angstrom sign should never have been created but what's done is done.) [[User:JMF|𝕁𝕄𝔽]] ([[User talk:JMF|talk]]) 08:01, 29 October 2024 (UTC)
|