Wikipedia:Font

Wikipedia မှ
ဤနေရာသို့သွားရန် - အ​ညွှန်း​, ရှာ​ဖွေ​ရန်​
Info talk.png ဤစာမျက်နှာသည် ရည်ရွယ်ချက်ရှိလို့ အင်္ဂလိပ်စာနှင့်ရေးထားသည်။ ဘာသာမပြန်ပါနှင့်။
This article is written in English for a specific purpose. Please do not translate it.

This page is about the encoding and fonts that are used on the Myanmar Wikipedia. To get started right away, just skip to the external links to download a Unicode 5.1 font, install it and join the community.

For more help, consult here: Wikipedia:Myanmarsar Help.

Why Unicode?[ပြင်​ဆင်​ရန်​]

Unicode is a standard that defines how text should be saved into data and how it is read and written. Almost every script in the world, including Burmese, is defined in the Unicode Standard. Carefully designed by experts of the field around the world, this international standard is supported by virtually all the latest platforms--not only in major operating systems like Windows, Mac, and Linux but also in OSes of mobile phones and many electronic devices. The Unicode Standard is the most well-known encoding scheme to support the largest number of scripts/languages in a single page of code. More information is available at Unicode.org. See discussion on Unicode usage in Wikipedia. All Wikipedias, including the English language Wikipedia, use Unicode as it is Wikimedia Foundation policy.

The following table shows the differences in language support between Zawgyi-One and Unicode fonts. Zawgyi-One and other pseudo-Unicode fonts require nearly twice as many characters as Unicode fonts to support just one language: Burmese. Unlike Zawgyi-One and other pseudo-Unicode fonts, Unicode fonts use intelligent rendering to stack consonants and combine diacritics.

Unicode versus Zawgyi-One
Unicode Zawgyi-One
Unicode character map (color).png Zawgyi character map (color).png
Bigger version here. Uncolored version here.

     Myanmar      Shan      Mon      Sanskrit and Pali

     S'gaw Karen      Western Pwo Karen      Eastern Pwo Karen      Geba Karen

     Kayah      Rumai Palaung      No character

Bigger version here. Uncolored version here.

     Myanmar      No character for pseudo-Unicode      No character

Why use Unicode?[ပြင်​ဆင်​ရန်​]

  • It is the international accepted standard by the World Wide Web Consortium, the main international standards organization for the World Wide Web.
  • The fonts necessary to view and edit are freely available.
  • Search is seamless with Unicode.
  • Unicode makes it extremely easy to translate the Wikipedia's interface.
  • Unicode fonts support 11 languages that use the Myanmar script:
    • Burmese
    • 2 liturgical languages: Pali and Sanskrit
    • 8 minority languages: Mon, Shan, Kayah, four Karen languages and Rumai Palaung

Why not Zawgyi?[ပြင်​ဆင်​ရန်​]

Although Zawgyi is currently the most widely used solution for input in Myanmar-language websites, it is not Unicode-compliant and does not meet the standards set by the Unicode Standard (nor does it meet the W3C standards, which define the standards for the World Wide Web). Also, there are multiple ways of storing text, which makes search challenging.

However, for those who use Zawgyi and would like to contribute to the Myanmar Wikipedia, several converters (Zawgyi → Unicode 5.1) are available below:

An online typewriter that supports Zawgyi as well as Unicode fonts is available at Burglish System tester

  • Those who have Zawgyi or pseudo-Unicode fonts in their computers find their computers may display incorrect text and might affect other Unicode-supported applications.
  • There are serious problems with encoding and typing in Zawgyi.
    • When users type in Zawgyi, typing in different ordering will create different words. These different words, even though they may render the same, are not stored the same (so search is impossible if words are typed in different ordering). On the other hand, in Unicode, there is only correct way to store text, and incorrect ordering in typing is visible as you type (this means that searching for text is more difficult with Zawgyi).
      Unicode stores text in only one order and render correctly: Unicode rendering of mou.png
      Zawgyi can store text in several ways but superficially appear correct: Zawgyi rendering of mou.png
    • Many Zawgyi users type the number "zero" when they intend to input the consonant "wa." This creates a problem, because converters cannot intelligently find the instances where "zero" is intended and "wa" is intended. When using Unicode, "wa" and "zero" are separate (intelligent rendering shows you if you have typed "zero" instead of "wa").
      Unicode can intelligently tell apart wa and zero: Unicode rendering of 0, wa.png
      Zawgyi cannot intelligently render zero and wa separately: Zawgyi rendering of 0, wa.png
  • Zawgyi uses extra code points which are reserved for minority languages of Myanmar (Shan, Mon, Pwo Karen, S'gaw Karen, Geba Karen, Kayah and Rumai Palaung, as well as Sanskrit and Pali). This creates a conflict, because Unicode has reserved those points to support minority languages in Myanmar, not for Burmese characters that should be intelligently rendered by Unicode engines.
  • Zawgyi uses reserved code points for stacking Burmese consonants and creating combinations of diacritics and medials, whereas Unicode smartly renders these, so fewer code points are required for one language.
    Unicode can intelligently render yayit according to letter width with just one symbol: Unicode rendering of yayit.png
    Zawgyi needs to use two different symbols (total 8 different symbols for short head and short leg forms) for yayit: Zawgyi rendering of yayit.png

External links[ပြင်​ဆင်​ရန်​]

Unicode 5.2 Myanmar/Burmese fonts[ပြင်​ဆင်​ရန်​]

These are links for fonts that conform to the Burmese script encoded in Unicode 5.1 (all of them are available for free) -

Other sites[ပြင်​ဆင်​ရန်​]