cainesap / syllabify
Automatically convert plain text into phonemes (US English pronunciation) and syllabify
☆27Updated 7 years ago
Alternatives and similar repositories for syllabify:
Users that are interested in syllabify are comparing it to the libraries listed below
- Python module for syllabifying English ARPABET transcriptions☆65Updated 6 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆40Updated last year
- Read, write, and manipulate Praat TextGrid files with Python☆126Updated last year
- Data and code for grapheme-to-phoneme transducers in lots of languages☆131Updated 10 months ago
- Labeled data for homograph disambiguation☆55Updated last year
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆38Updated 2 years ago
- Phonetically-Oriented Word Error Rate☆33Updated 5 years ago
- multilingual speech aligner☆72Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆147Updated 3 weeks ago
- ☆40Updated 3 years ago
- A phoneme-allophone database for many languages☆48Updated 4 years ago
- Alignment files of LibriTTS.☆61Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆74Updated 3 years ago
- Grapheme to phoneme model for PyTorch☆42Updated 2 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆34Updated 7 months ago
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆43Updated 2 years ago
- ☆79Updated 8 months ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆153Updated last year
- phone inventory library☆16Updated last year
- A Toolkit for ToBI Labeling with Python Data Structures☆24Updated 2 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆240Updated 5 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆72Updated 4 years ago
- ☆34Updated 5 months ago
- Trainable algorithm for automatic measurement of voice onset time☆64Updated last year
- ☆62Updated 9 months ago
- A curated list of awesome disfluency detection publications along with the released code and bibliographical information☆72Updated 3 years ago
- ☆37Updated 3 years ago
- CMU multilingual speech repository☆31Updated 2 years ago