Syllabification

Morphological and phonetic syllable break for Burmese text. Syllable break with zawgyi text will not be accurate / reliable. You probably should convert it to unicode before processing.

myanmar.language.MorphoSyllableBreak(text, encoding)[source]

Return an iterable of morphological / visual syllables in text.

>>> from myanmar.encodings import UnicodeEncoding
>>> slb = list(MorphoSyllableBreak("အကြွေးပေး", UnicodeEncoding()))
>>> list(s['syllable'] for s in slb)
['အ', 'ကြွေး', 'ပေး']
>>> slb[2]
{'syllable': 'ပေး', 'consonant': 'ပ', 'eVowel': 'ေ', 'visarga': 'း'}
myanmar.language.PhonemicSyllableBreak(text, encoding)[source]

Return an iterable of phonemic syllables in text.

>>> from myanmar.encodings import UnicodeEncoding
>>> slb = list(PhonemicSyllableBreak("သီးပင်အိုင်", UnicodeEncoding()))
>>> list(s['syllable'] for s in slb)
['သီး', 'ပင်', 'အိုင်']
>>> slb[0]
{'syllable': 'သီး', 'consonant': 'သ', 'iVowel': 'ီ', 'visarga': 'း'}