Thiagohgl / ai-pronunciation-trainer
This tool uses AI to evaluate your pronunciation.
☆280Updated 2 weeks ago
Alternatives and similar repositories for ai-pronunciation-trainer
Users that are interested in ai-pronunciation-trainer are comparing it to the libraries listed below
Sorting:
- ☆37Updated last year
- A non-native English corpus for pronunciation scoring task☆135Updated 10 months ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆172Updated 2 years ago
- Text to speech alignment using CTC forced alignment☆285Updated last month
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆226Updated 3 years ago
- ☆15Updated last month
- Deep learning based speech and pronunciation assessment API for 8 languages.☆42Updated 11 months ago
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆26Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆307Updated last year
- ☆138Updated 5 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆325Updated 6 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆243Updated 11 months ago
- ☆227Updated last month
- Update ASR paper everyday☆212Updated this week
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆383Updated 3 years ago
- [WIP] Scripts for fine-tuning Whisper☆220Updated last year
- 😎 Awesome lists about Speech Emotion Recognition☆88Updated 4 months ago
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆60Updated 3 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆301Updated 3 years ago
- Official Implementation of StyleTTS☆432Updated 4 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆81Updated 11 months ago
- HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools☆452Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆94Updated last year
- Finetune Wa2vec 2.0 For Speech Recognition☆131Updated 3 months ago
- React / Vanilla JS Text to Speech with highlighting the words and sentences that are being spoken using audio files, text to speech API, …☆148Updated this week
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆628Updated last year
- VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design☆562Updated last year
- Kaldi-based goodness of pronunciation (GOP)☆151Updated 4 years ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆77Updated 6 months ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year