zouharvi / pwesuite
Suite for phonetic word embeddings, especially their evaluation and baseline models.
☆19Updated 2 months ago
Related projects: ⓘ
- Datasets for turn-taking research☆11Updated 8 months ago
- Collection of scripts from mHuBERT-147.☆21Updated 2 months ago
- A collection of utilities for handling IPA phones.☆22Updated 11 months ago
- ☆56Updated last year
- 🎹 pyannote + 🗒 notebook = pyannotebook☆25Updated last year
- ☆32Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- Forced Alignments for Common Voice☆29Updated 3 years ago
- Official code for Wav2Seq☆95Updated 2 years ago
- phone inventory library☆14Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆16Updated 3 weeks ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 7 months ago
- ☆11Updated 2 years ago
- vad☆14Updated last year
- asr2k☆48Updated 3 months ago
- ☆75Updated 3 months ago
- ☆12Updated last year
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆71Updated 2 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆37Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆74Updated 2 months ago
- Audio tokenization, in the fastest way possible!☆45Updated 3 weeks ago
- ☆74Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆34Updated last year
- Second SIGMORPHON Shared Task on Grapheme-to-Phoneme Conversions☆22Updated 3 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆26Updated last year
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆17Updated 3 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆8Updated 2 years ago