rudder-analytics / Goodness-of-PronounciationLinks
☆43Updated last year
Alternatives and similar repositories for Goodness-of-Pronounciation
Users that are interested in Goodness-of-Pronounciation are comparing it to the libraries listed below
Sorting:
- A non-native English corpus for pronunciation scoring task☆161Updated last month
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆191Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆95Updated 11 months ago
- Text to speech alignment using CTC forced alignment☆386Updated 3 months ago
- Timething is a library for aligning text transcripts with their audio recordings.☆126Updated 11 months ago
- Multilingual G2P in 100 languages☆365Updated 2 years ago
- ☆18Updated 7 months ago
- On-device voice activity detection (VAD) powered by deep learning☆233Updated last week
- Open source inference code for Rev's model☆433Updated 7 months ago
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…☆23Updated 11 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆100Updated 7 months ago
- A curated list of awesome voice activity detection☆69Updated last year
- Official implementation of the TTS model Lina-Speech☆175Updated 10 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆408Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆179Updated this week
- A lightweight end-to-end text-to-speech model☆123Updated 9 months ago
- Colab notebooks for Next-gen Kaldi☆30Updated last month
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆28Updated 2 years ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆253Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆103Updated last year
- Collection of pretrained models for the Montreal Forced Aligner☆177Updated last month
- C++ version of pyannote audio speaker diarizaiton pipeline☆22Updated last year
- A toolkit for speaker diarization.☆330Updated last week
- Finetune VITS and MMS using HuggingFace's tools☆177Updated last year
- Running the F5-TTS by ONNX Runtime☆182Updated 3 weeks ago
- Fine-Tune Whisper with Transformers and PEFT☆58Updated 2 years ago
- Universal multilingual automatic speech transcription into IPA☆72Updated 9 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆432Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorch☆130Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆347Updated last year