CohenPr-XPF / XPFLinks
☆36Updated last year
Alternatives and similar repositories for XPF
Users that are interested in XPF are comparing it to the libraries listed below
Sorting:
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆43Updated 2 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆173Updated 2 weeks ago
- Workflow for forced alignment between languages☆19Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆168Updated 2 years ago
- Universal multilingual automatic speech transcription into IPA☆66Updated 5 months ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆36Updated last year
- ☆80Updated 2 weeks ago
- Linguistic processing for Common Voice☆57Updated last year
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆95Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆22Updated 11 months ago
- Keyword spotting and forced alignment in any language☆63Updated this week
- Convert English text from written expressions into spoken forms☆26Updated 3 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆32Updated 2 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆38Updated 2 years ago
- phone inventory library☆16Updated 2 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆25Updated 5 months ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆32Updated last year
- Labeled data for homograph disambiguation☆59Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 3 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆57Updated last year
- asr2k☆52Updated last year
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆47Updated 3 years ago
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated last year
- A collection of utilities for handling IPA phones.☆25Updated last year
- ☆24Updated last year
- Collection of scripts from mHuBERT-147.☆29Updated 9 months ago
- ☆40Updated 3 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆12Updated 11 months ago