Syllabification¶
Morphological and phonetic syllable break for Burmese text. Syllable break with zawgyi text will not be accurate / reliable. You probably should convert it to unicode before processing.
-
myanmar.language.
MorphoSyllableBreak
(text, encoding)[source]¶ Return an iterable of morphological / visual syllables in text.
>>> from myanmar.encodings import UnicodeEncoding >>> slb = list(MorphoSyllableBreak("အကြွေးပေး", UnicodeEncoding())) >>> list(s['syllable'] for s in slb) ['အ', 'ကြွေး', 'ပေး'] >>> slb[2] {'syllable': 'ပေး', 'consonant': 'ပ', 'eVowel': 'ေ', 'visarga': 'း'}
-
myanmar.language.
PhonemicSyllableBreak
(text, encoding)[source]¶ Return an iterable of phonemic syllables in text.
>>> from myanmar.encodings import UnicodeEncoding >>> slb = list(PhonemicSyllableBreak("သီးပင်အိုင်", UnicodeEncoding())) >>> list(s['syllable'] for s in slb) ['သီး', 'ပင်', 'အိုင်'] >>> slb[0] {'syllable': 'သီး', 'consonant': 'သ', 'iVowel': 'ီ', 'visarga': 'း'}