cainesap / syllabify
Automatically convert plain text into phonemes (US English pronunciation) and syllabify
☆26Updated 7 years ago
Alternatives and similar repositories for syllabify:
Users that are interested in syllabify are comparing it to the libraries listed below
- Python module for syllabifying English ARPABET transcriptions☆64Updated 5 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆39Updated last year
- Labeled data for homograph disambiguation☆54Updated last year
- Data and code for grapheme-to-phoneme transducers in lots of languages☆131Updated 9 months ago
- ☆40Updated 2 years ago
- Read, write, and manipulate Praat TextGrid files with Python☆126Updated last year
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆37Updated last year
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆31Updated 11 months ago
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆43Updated 2 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated 2 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆72Updated 4 years ago
- multilingual speech aligner☆73Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆144Updated this week
- ☆34Updated 4 months ago
- Phonetically-Oriented Word Error Rate☆33Updated 5 years ago
- Alignment files of LibriTTS.☆60Updated 4 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- A phoneme-allophone database for many languages☆48Updated 4 years ago
- A curated list of awesome disfluency detection publications along with the released code and bibliographical information☆71Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆240Updated 5 years ago
- asr2k☆48Updated 7 months ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆151Updated last year
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆72Updated 3 years ago
- Linguistic processing for Common Voice☆52Updated last year
- ☆111Updated 2 years ago
- Grapheme to phoneme model for PyTorch☆40Updated 2 years ago