spaceraccoon / accent-trainer
Flask webapp/endpoint that compares the user's speech with different accents and assigns similarity scores based on speed, voice (DTW/MFCC), and accuracy. The accents are generated from Amazon Polly and accuracy analysis using Bing Speech API speech to text.
☆16Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for accent-trainer
- ☆25Updated 2 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆46Updated 2 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆57Updated 3 years ago
- it's ASR decoder and make graph project☆32Updated 2 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆42Updated 2 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆33Updated 9 months ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆13Updated last year
- ☆22Updated 5 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆14Updated 2 years ago
- ☆59Updated 4 years ago
- Huawei Grad-TTS for Chinese☆45Updated last year
- Improving the Goodness of Pronunciation with DNNs and RNNs☆31Updated 6 years ago
- Chinese Text Normalization and Dataset☆81Updated 2 years ago
- E2E ASR system☆14Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆46Updated 3 years ago
- ☆86Updated 2 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆12Updated 3 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆61Updated last year
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆15Updated 2 years ago
- ☆16Updated 2 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆34Updated last year
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆113Updated 2 years ago
- Official Implementation of Mockingjay in Pytorch☆52Updated last year
- ☆41Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆63Updated 2 years ago
- Went online decode demo☆29Updated 3 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆21Updated 2 years ago
- A random forest classifier to predict the age-group and gender of a speaker from voice measurements.☆16Updated 5 years ago