CohenPr-XPF / XPFLinks
☆38Updated last year
Alternatives and similar repositories for XPF
Users that are interested in XPF are comparing it to the libraries listed below
Sorting:
- Workflow for forced alignment between languages☆23Updated last year
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆46Updated 2 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆185Updated last week
- ☆80Updated 5 months ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Updated 2 years ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆99Updated 2 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆33Updated 2 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆143Updated last year
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆43Updated 4 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago
- Universal multilingual automatic speech transcription into IPA☆73Updated 10 months ago
- Linguistic processing for Common Voice☆58Updated last year
- Keyword spotting and forced alignment in any language☆82Updated 4 months ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- ☆40Updated 3 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Updated 3 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Updated last week
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆47Updated 3 years ago
- multilingual speech aligner☆76Updated 2 years ago
- ☆67Updated 7 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Updated 2 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 10 months ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆41Updated 2 years ago
- Various speech datasets made available to the public☆130Updated last year
- Prosodic Speech Segmentation with Transformers☆26Updated last year
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- asr2k☆52Updated last year
- Phoneme alignment representation compatible with multiple forced aligners☆22Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆65Updated last year