Esperanto orthography

This is an old revision of this page, as edited by LittleDan (talk | contribs) at 16:20, 8 June 2003 (adding pronunciations of all letters). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Esperanto is written in a phonetic alphabet of twenty-eight letters. Twenty-two of these are identical in form to letters of the English alphabet (q, w, x, and y being omitted). The remaining six are accented letters, which appear as follows: ĉ, ĝ, ĥ, ĵ, ŝ (c, g, h, j, and s with circumflex), and ŭ (u with breve). The full alphabet appears as follows:-

a b c ĉ d e f g ĝ h ĥ i j ĵ k l m n o p r s &#349 t u &#365 v z

Pronunciation

Below is a list of all of the letters in the Esperanto alphabet and how to pronounce them translierated into English and SAMPA

  • a: ah, [a]
  • b: b, [b]
  • c: ts [ts]
  • ĉ: ch, [tS]
  • d: d, [d]
  • e: eh, [e]
  • f: f, [f]
  • g: hard g (as in go), [g]
  • ĝ: j, [dJ]
  • h: h, [h]
  • ĥ: kh (doesn't apear in English), [kx]
  • i: ee, [i]
  • j: y, [j]
  • ĵ: zh [Z]
  • k: k, [k]
  • l: l, [l, L, or 5]
  • m: m, [m]
  • n: n, [n]
  • o: oh, [o]
  • p: p, [p]
  • r: r, [4, r, r\`, R]
  • s: unvoiced s, [s]
  • ŝ: sh, [S]
  • t: t, [t]
  • u: oo, [u]
  • ŭ: w, [w]
  • v: v, [v]
  • z: z, [z]

The pronunciation of dipthongs such as aj (pronounced like eye) can be found by convertibg the letters to SAMPA and reading from that.

In handwritten Esperanto, the accented letters cause no problems. However, since none of them appear on standard alphanumeric keyboards, various different methods have been devised for representing them in printed and typed text using more standard characters. The original method was what is now referred to as the "h-system", but this has now largely been superceded by the so-called "x-system". With the advent of Unicode, the need for such systems is lessening.

The h-system

The original method of representing accented letters is due to the initiator of Esperanto, L. L. Zamenhof, who recommended the use of the letter h following a letter to indicate that the letter is to be interpreted as an accented one. For example, the single consonant ŝ is represented as sh, as in the words shi (ŝi, meaning she) and shanco (ŝanco, meaning chance).

Unfortunately this method suffers from two problems:-

  1. h is already a consonant in the language, so its use for another purpose would make the pronunciation and sometimes the meaning of words ambiguous.
  2. Simplistic ASCII-based rules for sorting English words fail badly for sorting Esperanto ones, because lexicographically words starting with ĉ should follow words starting with c and precede words staring with d. For example ĉu should be sorted after ci lexicographically, but written in the h-system, chu would be incorrectly sorted before ci.

The x-system

The most common system for typing in Esperanto today is the "x-system", which is the same as the h-system, but with the letter x used in place of h. For example, the single consonant ŝ is represented as sx, as in the words sxi (ŝi) and sxanco (ŝanco).

This method solves both of the problems inherent in the h-system:-

  1. x is not a consonant in the language, so its use introduces no ambiguity into the pronunciation or meaning.
  2. Words starting with cx now correctly follow words starting with c. Similarly, other accented letters are sorted after their unaccented friends. The sorting only fails when a word with cz or similar is encountered, but such words are relatively uncommon.

One problem with the x-system is when it is used alongside French text, because many French words end in ux.

Use of the caret

Another, les popular, system is the use of the caret character (^) to represent the accents, either before or after the letter to be accented. For example, ŝanco becomes ^sanco or s^anco. This shares the advantage of unambiguity with the x-system, and also has the advantage that the character itself resembles a circumflex accent, so that people unfamiliar with the system are likely to grasp what is meant. However, the system has not caught on in many places.

Exceptions to the rules

Some people do not bother to mark the accent on the letter u in the x-system, because ux is almost always part of the diphthong aux (like the ow in cow, without the rounding of the lips) or eux (as in Spanish Europa), whereas the letter combinations au and eu almost never occurs. Hence, morgau ("tomorrow") instead of morgaux, Europo ("Europe") instead of Euxropo. This is more common when using other systems.

Unicode

The entire Esperanto alphabet is part of the Latin-3 and Unicode character sets, so the above systems are no longer necessary on web pages, though the x-system remains common on Usenet and in e-mail where encoding support is rare and the limited availability of keyboard configurations makes it difficult for many to comfortably type.

See also