Module talk:Unicode chart: Difference between revisions

Content deleted Content added
No edit summary
Existing charts: new section
Line 56:
**It was because the [[Module:Unicode data/scripts]].ranges table skips certain chars, including the Ⴧ and Ⴭ in Georgian. Added a workaround. ―[[special:contributions/cobaltcigs|cobaltcigs]] 22:10, 9 September 2019 (UTC)
―[[special:contributions/cobaltcigs|cobaltcigs]] 20:49, 9 September 2019 (UTC)
 
== Existing charts ==
 
Interesting approach to create the Unicode code charts dynamically but I have many questions. Most only apply if this module is intended to replace the existing chart templates...
# What problem is this new approach solving? Is it just duplicating/replacing the existing templates? If not, what will this module be used for?
# Do the charts get created every time they're displayed? If so, do we care about the extra processing incurred?
# How to handle fonts? I saw the post at [[Template talk:Script#Module:Unicode chart]] and the notes above so I know this is a known issue.
# How to handle a varying number of reserved characters? The current charts leave off the "Gray areas" notice if there are no non-assigned code points because having the "gray areas" notice for those blocks would be confusing. And the wording changes if there is only one non-assigned code point.
# How to handle charts with additional footnotes? For example, [[Template:Unicode chart Arabic]]. And for the existing charts, the notes are indeed valuable.
# How to handle non-characters? For example, U+FDD0-FDEF in [[Template:Unicode chart Arabic Presentation Forms-A]].
# How to handle combining marks (which are referenced above)? Some charts have special additions for some combining characters. For example, U+A980 in [[Template:Unicode chart Javanese]] uses a dotted circle. Other combining marks, like U+1D242 in [[Template:Unicode chart Ancient Greek Musical Notation]] use a non-breaking space. Some combining marks use no additional character at all.
# How to handle characters with dashed boxes? For example, U+0600-0605, 061C, and 06DD in the [[Template:Unicode chart Arabic]] chart.
# How to handle control(ish) characters where we don't want the actual character in the chart? For example, U+061C in the [[Template:Unicode chart Arabic]] chart, and more obviously, control characters in [[Template:Unicode chart C0 Controls and Basic Latin]] and [[Template:Unicode chart C1 Controls and Latin-1 Supplement]].
# How to create character name aliases? See U+061C in [[Template:Unicode chart Arabic]] and the control characters in [[Template:Unicode chart C0 Controls and Basic Latin]] and [[Template:Unicode chart C1 Controls and Latin-1 Supplement]].
# How to handle block-specific formatting? For example [[Template:Unicode chart Javanese]] has a specific height and some of the characters in [[Template:Unicode chart Control Pictures]] use a different font size.
# How to handle character links? Like {{ping|BabelStone}}, I'm not a fan of linking specific characters (but others are). It looks like your code, optionally, will link every character if an article exists, but this could increase the number of linked characters. And many characters aren't linked to the character itself, like U+2245 in [[Template:Unicode chart Mathematical Operators]]. Some link to wikt, like U+0x2105 in [[Template:Unicode chart Letterlike Symbols]] and all the characters in [[Template:Unicode chart CJK Unified Ideographs Extension A]].
# Some blocks have special parameters that need to be taken into account: [[Template:Unicode chart Alphabetic Presentation Forms]], [[Template:Unicode chart Enclosed Alphanumeric Supplement]], [[Template:Unicode chart Enclosed CJK Letters and Months]], [[Template:Unicode chart Halfwidth and Fullwidth Forms]], [[Template:Unicode chart Miscellaneous Symbols]], and [[Template:Unicode chart Supplemental Symbols and Pictographs]]. As with most of these questions, this only only applies if you're replacing existing chart templates.
# How to determine the chart name? Most charts use the block name for the title but some don't. For example, "C0 Controls and Basic Latin" is the chart name for the "Basic Latin" block.
# How to determine what to link the chart name to. For example, the [[Template:Unicode chart Kangxi Radicals]] chart links to "Kangxi radical#Unicode". Most either link to the block name itself or the block name with "(Unicode block)" appended.
# Will the new approach be used for the list charts that make up [[List of CJK Unified Ideographs, part 1 of 4]] and [[List of CJK Unified Ideographs Extension B (Part 1 of 7)]]?
[[User:Drmccreedy|DRMcCreedy]] ([[User talk:Drmccreedy|talk]]) 04:51, 10 September 2019 (UTC)