Thiagohgl / ai-pronunciation-trainer
This tool uses AI to evaluate your pronunciation.
☆153Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ai-pronunciation-trainer
- A non-native English corpus for pronunciation scoring task☆112Updated 4 months ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆150Updated last year
- ☆20Updated 7 months ago
- ☆10Updated 2 months ago
- Text to speech alignment using CTC forced alignment☆141Updated 3 weeks ago
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆20Updated last year
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆262Updated 2 months ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆207Updated 2 years ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆259Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆275Updated last week
- Goodness of Pronunciation using Kaldi on Epa-DB database☆33Updated 10 months ago
- ONNX Inference of Pyannote Segmentation☆66Updated 2 months ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆46Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learning☆179Updated last week
- Deep learning based speech and pronunciation assessment API for 8 languages.☆30Updated 5 months ago
- Live-Transcription (STT) with Whisper PoC☆155Updated 5 months ago
- Spoken Language assessment☆41Updated 4 years ago
- 📈 A forced aligner intended for synchronization of narrated text☆85Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆132Updated 5 months ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆117Updated this week
- Grapheme to phoneme conversion with deep learning.☆358Updated 11 months ago
- ☆73Updated last month
- Kaldi-based goodness of pronunciation (GOP)☆147Updated 3 years ago
- Official Implementation of StyleTTS☆401Updated 11 months ago
- A modified VITS that utilizes phoneme duration's ground truth for better robustness☆118Updated last year
- Python forced alignment☆73Updated 7 months ago
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆317Updated 3 weeks ago
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆71Updated 5 months ago
- XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)☆307Updated 4 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆141Updated 6 months ago