cldf / segments
Unicode Standard tokenization routines and orthography profile segmentation
☆33Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for segments
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Labeled data for homograph disambiguation☆53Updated last year
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆30Updated 9 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆135Updated this week
- Python module for syllabifying English ARPABET transcriptions☆64Updated 5 years ago
- phone inventory library☆15Updated last year
- Multilingual grapheme-to-phoneme conversion☆19Updated 6 years ago
- ☆40Updated 2 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- Trainable algorithm for automatic measurement of voice onset time☆62Updated last year
- Second SIGMORPHON Shared Task on Grapheme-to-Phoneme Conversions☆22Updated 3 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆48Updated 2 months ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆35Updated 3 years ago
- asr2k☆48Updated 5 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- A phoneme-allophone database for many languages☆48Updated 4 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆37Updated last year
- Grapheme to phoneme model for PyTorch☆40Updated 2 years ago
- ☆30Updated 5 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆22Updated this week
- simple textgrid to csv converter☆25Updated 3 years ago
- Breaks a word into syllables using an LSTM-based neural network.☆19Updated last year
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆81Updated 6 months ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆13Updated 4 years ago
- ☆32Updated 2 months ago
- Cross-Linguistic Transcription Systems☆14Updated 7 months ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆80Updated last year
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆37Updated last year
- Code for AccentDB.☆19Updated 3 years ago
- Workflow for forced alignment between languages☆17Updated 9 months ago