zouharvi / pwesuite
Suite for phonetic word embeddings, especially their evaluation and baseline models.
☆24Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for pwesuite
- ☆31Updated 2 weeks ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆19Updated 2 months ago
- ☆56Updated last year
- ☆33Updated 3 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Collection of scripts from mHuBERT-147.☆22Updated this week
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago
- Datasets for turn-taking research☆12Updated 11 months ago
- Official code for Wav2Seq☆95Updated 2 years ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆17Updated last year
- Transcribing Speech with Multinomial Diffusion, training code and models.☆75Updated last year
- Universal multilingual automatic speech transcription into IPA☆55Updated 2 months ago
- AudioBench: A Universal Benchmark for Audio Large Language Models☆93Updated last week
- ☆77Updated 5 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆135Updated this week
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆144Updated last year
- ☆32Updated 2 months ago
- A JAX library for building lattice-based speech transducer models☆40Updated 3 weeks ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- phone inventory library☆15Updated last year
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆30Updated last year
- Phoneme segmentation using pre-trained speech models☆54Updated 2 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆25Updated last year
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆19Updated last week
- ☆19Updated last year
- Audio tokenization, in the fastest way possible!☆45Updated 2 months ago