Syllable-aware BPE tokenizer for the Amharic language (አማርኛ) – fast, accurate, trainable.
-
Updated
Oct 20, 2025 - Python
Syllable-aware BPE tokenizer for the Amharic language (አማርኛ) – fast, accurate, trainable.
🪿 Han-solo: Thai syllable segmenter
Tool for syllabificating (dividing words into syllables) Dutch or English words. Employs recent high-performance algorithms.
Program that reads Ukrainian text using eSpeak and SoX.
A library to work with the basic Ukrainian phonetics and syllable segmentation. Rewritten from the mmsyn6ukr and mmsyn7s packages. Comparing to the ukrainian-phonetics-basic package, all the vector-related functionality removed, it also removed from the dependencies and the mmsyn2 is changed to mmsyn2-array.
Syllabize and phonemize Swahili words!
Shows a sorted list of the Ukrainian sounds representations that can be used by mmsyn7 series of programs. A program and a library that show a sorted list of the Ukrainian sounds representations that can be used by mmsyn7 series of programs
Hangeul character syllable decomposing/composing library
Add a description, image, and links to the syllable-segmentation topic page so that developers can more easily learn about it.
To associate your repository with the syllable-segmentation topic, visit your repo's landing page and select "manage topics."