Multilingual computing


Unicode

The Unicode specification includes a huge number of different letters, symbols, characters, mathematical and musical symbols, dingbats, etc (known collectively as 'glyphs'). Each glyph is given a unique number and any piece of software that supports Unicode can, in theory, display any of the glyphs. In practice, this only happens if you have Unicode enabled fonts containing the relevant glyphs.

Unicode enabled fonts generally include only a subset of the total Unicode specification, for example Latin, Greek, Cyrillic and Thai, though there are a few fonts, such as Arial Unicode MS, which include a large proportion of the specification. Arial Unicode MS is included with Microsoft Office 2000.

Unicode specifies over 110,000 glyphs, and includes the following scripts:

Arabic, Armenian, Balinese, Bamum, Bassa Vah, Batak, Bengali, Bopomofo, Brahmi, Braille, Buginese, Buhid, Burmese, Canadian Aboriginal Syllabics, Caucasian Albanian, Cherokee, Chinese characters, Coptic, Cyrillic, Deseret, Devanagari, Duployan shorthand, Elbasan, Ethiopic, Georgian, Glagolitic, Gothic, Grantha, Greek, Gujarati, Gurmukhi, Hanunoo, Hiragana, Latin, Limbu, Mandaic, Malayalam, Mongolian, Hebrew, Kannada, Katakana, Kharosthi, Khojki, Khudawadi, Korean, Lao, Linear A, Linear B, Mahajani, Mandaic, Manichaean, Mende, Modi, Mro, Nabataean, New Tai Lue, N'Ko, Ogham, Old Italic, Old North Arabian, Old Permic, Old Persian, Oriya, Osmanya (Somali), Phags-pa, Pahawh Hmong, Palmyrene, Pau Cin Hau, Phoenician, Psalter Pahlavi, Runic, Shavian, Siiddham, Sinhala, Sumero-Akkadian Cuneiform, Syloti Nagri, Syriac, Tagbanwa, Tagalog, Tai Le, Tamil, Telugu, Thai, Thaana, Tibetan, Tifinagh, Tirhuta, Ugaritic, Varang Kshiti / Warang Citi, Yi

The Unicode specification also includes mathmatical symbols, musical notation and other assorted miscellaneous symbols.

Note: the Unicode specification is regularly up-dated, so new scripts may have been added since the above list was compiled. If you know of any additions that are not listed, please contact me.

You can use Unicode to add text in just about any language to webpages and even to mix languages/scripts on the same page. You can see examples of this on the following pages:

Links

Information about Unicode
http://www.unicode.org
http://www.alanwood.net/unicode/
http://www.fontspace.com/unicode/

List of applications that support unicode
http://www.unicode.org/unicode/onlinedat/products.html

Information about character sets and internationalisation
http://www.w3.org/International

Unicode text editors
http://www.unipad.org
http://www.yudit.org
https://notepad-plus-plus.org/
http://www.babelstone.co.uk/Software/BabelPad.html

Computing with Accents, Symbols & Foreign Scripts
http://symbolcodes.tlt.psu.edu

Unicode fonts
http://www.wazu.jp
http://www.alanwood.net/unicode/fonts.html
http://www.slovo.info/unifonts.htm

[top]


Green Web Hosting - Kualo

You can support this site by Buying Me A Coffee, and if you like what you see on this page, you can use the buttons below to share it with people you know.

 

Learn a Language with gymglish

If you like this site and find it useful, you can support it by making a donation via PayPal or Patreon, or by contributing in other ways. Omniglot is how I make my living.

 

Note: all links on this site to Amazon.com, Amazon.co.uk and Amazon.fr are affiliate links. This means I earn a commission if you click on any of them and buy something. So by clicking on these links you can help to support this site.

[top]

iVisa.com