spaceraccoon / accent-trainerLinks

Flask webapp/endpoint that compares the user's speech with different accents and assigns similarity scores based on speed, voice (DTW/MFCC), and accuracy. The accents are generated from Amazon Polly and accuracy analysis using Bing Speech API speech to text.

☆17

Alternatives and similar repositories for accent-trainer

Users that are interested in accent-trainer are comparing it to the libraries listed below

Sorting:

tzyll / goparrot
Goodness of Pronunciation (GOP) for oral reading assessment.
☆52Updated 3 years ago
MarceloSancinetti / epa-gop-pykaldi
☆25Updated 3 years ago
lovemefan / CT-Transformer-punctuation
A enterprise-grade Chinese-English code switch punctuator from funasr.
☆24Updated last year
skysbird / g2p-zh-en
Chinese and English Bilinguish G2P
☆21Updated last year
cageyoko / CTC-Attention-Mispronunciation
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆60Updated 4 years ago
swshon / lre15_siam
Language identification using Siamese network based on i-vector
☆7Updated 7 years ago
MaxMax2016 / Grad-TTS-Chinese
Huawei Grad-TTS for Chinese
☆50Updated last year
vocaliodmiku / wav2vec2mdd
End-to-End Mispronunciation Detection via wav2vec2.0
☆46Updated 3 years ago
xcmyz / Transformer-TTS
TTS model based on Transformer.
☆58Updated 5 years ago
Yaoming95 / UniPunc
The case study and multilingfual performance of ICASSP submission
☆24Updated 2 years ago
amirharati / kaldi-alligner
scripts to align a given wave to its transcription using trained models by Kaldi
☆32Updated 5 years ago
yuwchen / MultiPA
☆15Updated 2 months ago
Riroaki / Chinese-Rhythm-Predictor
基于随机森林和条件随机场的中文韵律预测模型
☆28Updated 11 months ago
lovemefan / fsmn-vad
A enterprise-grade Voice Activity Detector from modelscope and funasr.
☆102Updated 2 years ago
atomicoo / chn_text_norm
Chinese text normalization. 中文文本规范化。
☆55Updated 4 years ago
daanzu / wenet_stt_python
☆33Updated 3 years ago
hschen0712 / textgrid-parser
Parse textgrid files and convert them to json
☆9Updated 8 years ago
thuhcsi / LightGrad
☆65Updated last year
dave-fernandes / SpeakerClassifier
A random forest classifier to predict the age-group and gender of a speaker from voice measurements.
☆18Updated 6 years ago
Sundy1219 / eesen-for-thchs30
ASR for Chinese Mandarin
☆75Updated 7 years ago
AkishinoShiame / Chinese-Speech-Emotion-Datasets
Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.
☆37Updated 6 years ago
Hannes1 / react-native-wenet
Wenet speech to text for react native
☆10Updated 2 years ago
lezasantaizi / audio_cut
语音切割，python ，webrtc
☆10Updated 6 years ago
rhss10 / joint-apa-mdd-mtl
Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…
☆21Updated last year
thuhcsi / FlatTN
Chinese Text Normalization and Dataset
☆85Updated 3 years ago
amritkromana / disfluency_detection_from_audio
☆22Updated 10 months ago
yakouyang / VAD
voice active detection (python ver/simple and easy-to-use)
☆12Updated 8 years ago
jackyyy0228 / WFST-decoder-for-phoneme-posterior
☆22Updated 5 years ago
JasonWei512 / wavenet_vocoder
（已过时）WaveNet 声码器
☆21Updated 5 years ago
wangyu09 / exkaldi-rt
An online speech recognition extension toolkit of Kaldi
☆56Updated 4 years ago