rudder-analytics / Goodness-of-PronounciationLinks
☆38Updated last year
Alternatives and similar repositories for Goodness-of-Pronounciation
Users that are interested in Goodness-of-Pronounciation are comparing it to the libraries listed below
Sorting:
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆173Updated 2 years ago
- A non-native English corpus for pronunciation scoring task☆141Updated 10 months ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆52Updated 3 years ago
- ☆15Updated last month
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆27Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆163Updated 3 weeks ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆232Updated 3 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆62Updated 2 months ago
- Multilingual G2P in 100 languages☆327Updated 2 years ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆157Updated last week
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆132Updated 2 years ago
- Universal multilingual automatic speech transcription into IPA☆65Updated 3 months ago
- Target Speaker Extraction Toolkit☆169Updated 2 months ago
- Kaldi-based goodness of pronunciation (GOP)☆151Updated 4 years ago
- ONNX Inference of Pyannote Segmentation☆90Updated 5 months ago
- ☆92Updated 2 years ago
- Deep learning based speech and pronunciation assessment API for 8 languages.☆42Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 7 months ago
- Colab notebooks for Next-gen Kaldi☆27Updated last month
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆136Updated 3 months ago
- finetune llm part for spark-tts model☆79Updated 2 months ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆60Updated 4 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆152Updated last week
- Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.☆44Updated last month
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆91Updated last year
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆71Updated 9 months ago
- Charsiu: A neural phonetic aligner.☆301Updated 2 years ago
- A lightweight end-to-end text-to-speech model☆115Updated 3 months ago