rudder-analytics / Goodness-of-PronounciationLinks
☆43Updated last year
Alternatives and similar repositories for Goodness-of-Pronounciation
Users that are interested in Goodness-of-Pronounciation are comparing it to the libraries listed below
Sorting:
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆196Updated 2 years ago
- A non-native English corpus for pronunciation scoring task☆164Updated 3 months ago
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- Text to speech alignment using CTC forced alignment☆421Updated 2 months ago
- ☆20Updated 9 months ago
- Timething is a library for aligning text transcripts with their audio recordings.☆128Updated last year
- This tool uses AI to evaluate your pronunciation.☆427Updated 5 months ago
- Multilingual G2P in 100 languages☆373Updated 2 years ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆258Updated last year
- Fine-Tune Whisper with Transformers and PEFT☆58Updated 2 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆104Updated 10 months ago
- Universal multilingual automatic speech transcription into IPA☆74Updated 11 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆186Updated this week
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated 2 months ago
- Official implementation of the TTS model Lina-Speech☆176Updated last year
- A toolkit for speaker diarization.☆384Updated 2 weeks ago
- A lightweight end-to-end text-to-speech model☆126Updated 11 months ago
- Deep learning based speech and pronunciation assessment API for 8 languages.☆57Updated last year
- Running the F5-TTS by ONNX Runtime☆191Updated 3 weeks ago
- F5-TTS 推理加速,速度提升约4倍!☆122Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆243Updated last week
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆257Updated 3 years ago
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆29Updated 2 years ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆183Updated 5 months ago
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…☆25Updated last year
- Cantonese Text to Speech with VITS implementation☆37Updated 2 years ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆86Updated last year
- ☆192Updated last year
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆413Updated last year