Unicode block: Difference between revisions

Content deleted Content added
Undid revision 1175171100 by 93.57.249.193 (talk) Revert. Range delimiter should be consistent. Additionally, I would suggest an en dash (–) for ranges if you don't want to use the double dots used by Unicode.
Design and implementation: … those I've seen …
Line 7:
Unicode blocks are identified by unique names, which use only ASCII characters and are usually descriptive of the nature of the symbols, in [[English language|English]]; such as "Tibetan" or "Supplemental Arrows-A". (When comparing block names, one is supposed to equate uppercase with lowercase letters, and ignore any whitespace, hyphens, and underbars; so the last name is equivalent to "supplemental_arrows__a" and "SUPPLEMENTALARROWSA".<ref name=uniblocks>{{Cite web|url=https://www.unicode.org/Public/UNIDATA/Blocks.txt|title=Unicode Blocks data file, Unicode version 15.1|publisher=Unicode Consortium|access-date=2023-09-12}}</ref>
 
Blocks are [[intersection (set theory)|pairwise disjoint]]; that is, they do not overlap. The starting code point and the size (number of code points) of each block are always multiples of 16; therefore, in the [[hexadecimal notation]], the starting (smallest) point is U+''xxx''0 and the ending (largest) point is U+''yyy''F, where ''xxx'' and ''yyy'' are three or more hexadecimal digits. (These constraints are intended to simplify the display of glyphs in Unicode Consortium documents, as tables with 16 columnsrows labeled with the last hexadecimal digit of the code point.<ref name=uniblocks/>) The size of a block may range from the minimum of 16 to a maximum of 65,536 code points.
 
Every assigned code point has a glyph property called "Block", whose value is a character string naming the unique block that owns that point.<ref>{{Cite web |title=Glossary |url=https://www.unicode.org/glossary/#B |access-date=2022-08-07 |website=www.unicode.org}}</ref> However, a block may also contain unassigned code points, usually reserved for future additions of characters that "logically" should belong to that block. Code points not belonging to any of the named blocks, e.g. in the unassigned [[Plane (Unicode)|planes]] 4–13, have the value block="No_Block".<ref name=uniblocks/>