jhasegaw / phonecodes
python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.
☆32Updated last year
Alternatives and similar repositories for phonecodes:
Users that are interested in phonecodes are comparing it to the libraries listed below
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆40Updated last year
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆38Updated 2 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆131Updated 10 months ago
- Workflow for forced alignment between languages☆17Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆27Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆154Updated last year
- Phonetically-Oriented Word Error Rate☆33Updated 5 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆147Updated last month
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆86Updated last year
- ☆40Updated 3 years ago
- Linguistic processing for Common Voice☆53Updated last year
- Praat textgrid manipulation in Python☆52Updated last year
- Read, write, and manipulate Praat TextGrid files with Python☆126Updated last year
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆77Updated 10 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆34Updated 7 months ago
- A phoneme-allophone database for many languages☆48Updated 4 years ago
- ☆80Updated 8 months ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- multilingual speech aligner☆72Updated last year
- Layer-wise analysis of self-supervised pre-trained speech representations☆100Updated 4 months ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆43Updated 3 years ago
- ☆27Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Keyword spotting and forced alignment in any language☆51Updated 7 months ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆74Updated 3 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆130Updated 7 months ago