Thiagohgl / ai-pronunciation-trainer
This tool uses AI to evaluate your pronunciation.
☆198Updated 2 weeks ago
Alternatives and similar repositories for ai-pronunciation-trainer:
Users that are interested in ai-pronunciation-trainer are comparing it to the libraries listed below
- A non-native English corpus for pronunciation scoring task☆121Updated 6 months ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆161Updated last year
- ☆25Updated 9 months ago
- Text to speech alignment using CTC forced alignment☆206Updated last week
- ☆12Updated 5 months ago
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆23Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.☆112Updated last month
- ONNX Inference of Pyannote Segmentation☆81Updated last month
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆217Updated 2 years ago
- Deep learning based speech and pronunciation assessment API for 8 languages.☆33Updated 7 months ago
- A tool to practice English speaking☆33Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆138Updated 8 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆298Updated 2 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆91Updated 8 months ago
- ☆195Updated 3 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆62Updated 7 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆278Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorch☆121Updated 2 months ago
- Kaldi-based goodness of pronunciation (GOP)☆147Updated 3 years ago
- ☆108Updated last month
- Goodness of Pronunciation (GOP) for oral reading assessment.☆47Updated 3 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆132Updated 9 months ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆120Updated last year
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆241Updated 2 years ago
- Python forced alignment☆78Updated 9 months ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆25Updated 5 years ago
- On-device voice activity detection (VAD) powered by deep learning☆192Updated 2 weeks ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆150Updated 6 months ago
- Universal multilingual automatic speech transcription into IPA☆57Updated 5 months ago
- Diarization scoring tools.☆233Updated last year