CohenPr-XPF / XPFLinks
☆38Updated last year
Alternatives and similar repositories for XPF
Users that are interested in XPF are comparing it to the libraries listed below
Sorting:
- Workflow for forced alignment between languages☆21Updated last year
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆45Updated 2 years ago
- Universal multilingual automatic speech transcription into IPA☆70Updated 8 months ago
- Linguistic processing for Common Voice☆58Updated last year
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆172Updated 2 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆179Updated last week
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆95Updated last year
- ☆80Updated 3 months ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Updated 2 years ago
- Keyword spotting and forced alignment in any language☆77Updated 2 months ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆40Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆32Updated 11 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 8 months ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 4 years ago
- Phoneme alignment representation compatible with multiple forced aligners☆20Updated last year
- asr2k☆52Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆32Updated 2 years ago
- multilingual speech aligner☆77Updated last year
- Data and code for grapheme-to-phoneme transducers in lots of languages☆140Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆23Updated last month
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆41Updated 2 months ago
- Labeled data for homograph disambiguation☆60Updated 2 years ago
- ☆40Updated 3 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆22Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆29Updated 3 weeks ago
- ☆27Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 2 years ago
- A pakage for crawling audio from Youtube☆42Updated 2 years ago