Talk:Unicode/Archive 7: Difference between revisions

Content deleted Content added
m Archiving 1 discussion(s) from Talk:Unicode) (bot
m Archiving 1 discussion(s) from Talk:Unicode) (bot
Line 468:
::About the ellipsis, the cited [[MOS:ELLIPSIS]] calls for the use of a a non-breaking space before an ellipsis. [[User:Chatul|Shmuel (Seymour J.) Metz Username:Chatul]] ([[User talk:Chatul|talk]]) 21:45, 7 May 2020 (UTC)
:::My interpretation of [[MOS:TM]] is that reliable, independent sources can be used to determine styling (like Ipad vs. iPad) but that ® and ™ are to be avoided except as needed for disambiguation. (Though sources independent of the trademark holder rarely use the trademark symbols.) [[MOS:CONFORMTITLE]] says that titles of works should be altered to conform to Wikipedia house style. -- [[User:Beland|Beland]] ([[User talk:Beland|talk]]) 12:12, 12 May 2020 (UTC)
 
== Number of valid characters ==
 
Article reports 143,859 valid characters ; a short python script that runs chr(value) with all possible values of length 1-4 bytes will report 2,294,016 valid characters (and 4,309,516,288 (!) invalid characters) in the byterange. How come there is a factor ~20 between the two data? <!-- Template:Unsigned IP --><small class="autosigned">—&nbsp;Preceding [[Wikipedia:Signatures|unsigned]] comment added by [[Special:Contributions/85.0.37.33|85.0.37.33]] ([[User talk:85.0.37.33#top|talk]]) 18:44, 25 May 2020 (UTC)</small> <!--Autosigned by SineBot-->
 
:No, that is not what the article reports. The article does have the text "there is a repertoire of 143,859 characters,"; note that the text neither uses the term valid nor refers to strings of 1-4 octets; it refers to characters that have been assigned code points in the range 0000&ndash;10FFFF by the [[Unicode Consortium]]. The only text that uses the term ''valid'' is distinguishing surrogate pairs from other code points. [[User:Chatul|Shmuel (Seymour J.) Metz Username:Chatul]] ([[User talk:Chatul|talk]]) 20:23, 25 May 2020 (UTC)