Revision as of 15:05, 22 March 2018 edit 62.77.47.65 (talk) →General layout ← Previous edit		Revision as of 23:39, 20 July 2018 edit undo Drmccreedy (talk \| contribs) Extended confirmed users, Template editors 26,286 edits m code point Next edit →
Line 321: Class names in Java, when fully qualified, are traditionally dot-separated, such as "java.lang.Object". However within the low-level Class reference constants, an internal form appears which uses slashes instead, such as "java/lang/Object". The Unicode strings, despite the moniker "UTF-8 string", are not actually encoded according to the Unicode standard, although it is similar. There are two differences (see [[UTF-8]] for a complete discussion). The first is that the ~~codepoint~~code point U+0000 is encoded as the two-byte sequence <code>C0 80</code> (in hex) instead of the standard single-byte encoding <code>00</code>. The second difference is that supplementary characters (those outside the [[Basic Multilingual Plane\|BMP]] at U+10000 and above) are encoded using a surrogate-pair construction similar to [[UTF-16]] rather than being directly encoded using UTF-8. In this case each of the two surrogates is encoded separately in UTF-8. For example, U+1D11E is encoded as the 6-byte sequence <code>ED A0 B4 ED B4 9E</code>, rather than the correct 4-byte UTF-8 encoding of <code>F0 9D 84 9E</code>. ==See also==

Java class file: Difference between revisions