Thiagohgl / ai-pronunciation-trainer
This tool uses AI to evaluate your pronunciation.
☆272Updated 3 months ago
Alternatives and similar repositories for ai-pronunciation-trainer:
Users that are interested in ai-pronunciation-trainer are comparing it to the libraries listed below
- ☆35Updated last year
- A non-native English corpus for pronunciation scoring task☆131Updated 9 months ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆170Updated 2 years ago
- Text to speech alignment using CTC forced alignment☆270Updated last month
- A tool to practice English speaking☆42Updated last year
- ☆13Updated 2 weeks ago
- Efficient approach to speaker diarization using voice characteristics extraction☆91Updated last year
- Deep learning based speech and pronunciation assessment API for 8 languages.☆43Updated 10 months ago
- ☆130Updated 4 months ago
- On-device voice activity detection (VAD) powered by deep learning☆206Updated last week
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆247Updated 2 years ago
- Converts English text to IPA notation☆381Updated last year
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆680Updated 4 months ago
- Open source inference code for Rev's model☆399Updated this week
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆26Updated last year
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆82Updated 10 months ago
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆60Updated 3 years ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆224Updated 2 years ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆302Updated last year
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS☆52Updated last year
- Goodness of Pronunciation (GOP) for oral reading assessment.☆50Updated 3 years ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆77Updated 10 months ago
- Simple text to phones converter for multiple languages☆1,366Updated 7 months ago
- A modified VITS that utilizes phoneme duration's ground truth for better robustness☆135Updated last year
- Command line utility for forced alignment using Kaldi☆1,452Updated last month
- A python package for deep multilingual punctuation prediction.☆119Updated 8 months ago
- 📈 A forced aligner intended for synchronization of narrated text☆91Updated 2 years ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆242Updated 10 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆72Updated 5 months ago
- Finetune VITS and MMS using HuggingFace's tools☆145Updated last year