rudder-analytics / Goodness-of-PronounciationLinks
☆40Updated last year
Alternatives and similar repositories for Goodness-of-Pronounciation
Users that are interested in Goodness-of-Pronounciation are comparing it to the libraries listed below
Sorting:
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆180Updated 2 years ago
- A non-native English corpus for pronunciation scoring task☆147Updated last year
- Text to speech alignment using CTC forced alignment☆347Updated 3 weeks ago
- ☆17Updated 4 months ago
- ONNX Inference of Pyannote Segmentation☆93Updated 8 months ago
- Open source inference code for Rev's model☆424Updated 4 months ago
- This tool uses AI to evaluate your pronunciation.☆350Updated 2 weeks ago
- Multilingual G2P in 100 languages☆351Updated 2 years ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆403Updated last year
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆534Updated last week
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆242Updated 3 years ago
- Deep learning based speech and pronunciation assessment API for 8 languages.☆47Updated last year
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…☆18Updated 9 months ago
- Timething is a library for aligning text transcripts with their audio recordings.☆122Updated 9 months ago
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆28Updated last year
- Running the F5-TTS by ONNX Runtime☆176Updated 3 weeks ago
- A lightweight end-to-end text-to-speech model☆119Updated 6 months ago
- Official implementation of the TTS model Lina-Speech☆168Updated 7 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆419Updated 11 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆173Updated last week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆337Updated 9 months ago
- A toolkit for speaker diarization.☆279Updated 3 weeks ago
- We Speech Transcript based on LLM, in 300 lines of code.☆176Updated 2 months ago
- Collection of pretrained models for the Montreal Forced Aligner☆161Updated 2 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆103Updated 10 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆75Updated 5 months ago
- Voice gender classifier using ECAPA-TDNN☆56Updated 7 months ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆155Updated last year
- Official Implementation of StyleTTS☆444Updated 7 months ago