rudder-analytics / Goodness-of-Pronounciation
☆35Updated last year
Alternatives and similar repositories for Goodness-of-Pronounciation:
Users that are interested in Goodness-of-Pronounciation are comparing it to the libraries listed below
- ☆13Updated last week
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆170Updated 2 years ago
- A non-native English corpus for pronunciation scoring task☆131Updated 9 months ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆50Updated 3 years ago
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆26Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆62Updated 3 weeks ago
- A lightweight end-to-end text-to-speech model☆112Updated 2 months ago
- This tool uses AI to evaluate your pronunciation.☆272Updated 3 months ago
- Text to speech alignment using CTC forced alignment☆270Updated last month
- Official Code for ParrotTTS☆48Updated 6 months ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆224Updated 2 years ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆126Updated 5 months ago
- ONNX Inference of Pyannote Segmentation☆85Updated 4 months ago
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆82Updated 10 months ago
- ☆91Updated 2 years ago
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆60Updated 3 years ago
- Multilingual G2P in 100 languages☆321Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆206Updated last week
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆20Updated 5 months ago
- ☆26Updated 2 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆158Updated last week
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆16Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.☆117Updated 4 months ago
- Flask webapp/endpoint that compares the user's speech with different accents and assigns similarity scores based on speed, voice (DTW/MFC…☆17Updated 7 years ago
- Kaldi-based goodness of pronunciation (GOP)☆149Updated 4 years ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆130Updated 2 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆26Updated last week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆112Updated 2 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year