Unicode Standard tokenization routines and orthography profile segmentation
β41Mar 7, 2026Updated last month
Alternatives and similar repositories for segments
Users that are interested in segments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notationβ18Nov 28, 2023Updated 2 years ago
- π£οΈ Convert between phonetic alphabetsβ11Feb 7, 2022Updated 4 years ago
- a compact audio-to-phoneme aligner for singing voiceβ12Jan 17, 2024Updated 2 years ago
- Collection of small Lua modulesβ10Feb 15, 2026Updated 2 months ago
- A programming languageβ14Jan 24, 2015Updated 11 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A minimal modern (Lua)TeX distributionβ15May 12, 2024Updated last year
- A TeX implementation in a single C++11 class.β19Sep 19, 2020Updated 5 years ago
- Breaks a word into syllables using an LSTM-based neural network.β20Aug 14, 2023Updated 2 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languagesβ151Apr 5, 2024Updated 2 years ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023β27Apr 27, 2023Updated 3 years ago
- This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.β22Jan 22, 2024Updated 2 years ago
- 24-hour Automatic Speech Recognitionβ27Jun 4, 2021Updated 4 years ago
- Massively multilingual pronunciation miningβ365Apr 21, 2026Updated last week
- Simple text to phonemes converter for multiple languagesβ20Nov 21, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β19Mar 22, 2024Updated 2 years ago
- A lexicon compiler for non-suffixational morphologiesβ13Jan 29, 2026Updated 3 months ago
- β21May 12, 2012Updated 13 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS Systemβ35Aug 31, 2020Updated 5 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)β83Nov 13, 2021Updated 4 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectioβ¦β36Apr 25, 2025Updated last year
- Tools for working with the CMU Pronunciation Dictionaryβ36Sep 5, 2017Updated 8 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITIONβ46May 12, 2023Updated 2 years ago
- a mutable string support to lua.β26Mar 20, 2015Updated 11 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis modelβ36Apr 29, 2025Updated last year
- β14Apr 13, 2026Updated 2 weeks ago
- Cross-Linguistic Transcription Systemsβ17Mar 20, 2026Updated last month
- Data processing tools for preparing speech and labels for training TTS voicesβ29Aug 13, 2020Updated 5 years ago
- β20Jul 16, 2023Updated 2 years ago
- π§ LDWizard: A generic framework for simplifying the creation of linked data. Supported by the PLDN community.β18May 27, 2024Updated last year
- A database of number names for 186 languages, locales, and scriptsβ67Mar 3, 2023Updated 3 years ago
- speakr: A Wrapper for the Phonetic Software Praatβ27Feb 28, 2026Updated 2 months ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)β100Nov 20, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Woβ¦β24Dec 8, 2019Updated 6 years ago
- A web interface for viewing ELAN and FLEx files:β19Feb 16, 2024Updated 2 years ago
- Implementation of Android's TextToSpeechService that provides Estonian text-to-speechβ17Jan 19, 2019Updated 7 years ago
- Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)β36Nov 29, 2024Updated last year
- Text-to-Speech tutorial at SLTU 2016β35May 10, 2016Updated 9 years ago
- fast lua string operationsβ22Mar 21, 2020Updated 6 years ago
- a very simple vocal tract model, few tube model. generate vowel sound by itβ18Jul 9, 2023Updated 2 years ago