[This text] includes characters that require UTF-8 (Unicode) file encoding:
Ē ā ē ī ō ū (vowel with macron or “long” mark)
Ă Ĕ Ĭ ă ĕ ĭ ŏ (vowel with breve or “short” mark)
Ś ś ć (s, c with “acute”: mainly in Recording Indian Languages article)
ⁿ (small raised n, representing nasalized vowel)
ɔ ʇ ʞ (inverted letters)
‖ (double vertical line
There are also a handful of Greek words; transliterations are given in mouse-hover popups. Some compromises were made to accommodate font availability:
The ordinary “cents” sign ¢ was used in place of the correct form ȼ, and bracketed [¢] represents the capital letter Ȼ.
Turned (rotated) c is represented by ɔ (technically an open o).