alicex2020 / Mandarin-Tone-ClassificationLinks

Deep learning using CNN for Mandarin Chinese tone classification

☆35

Alternatives and similar repositories for Mandarin-Tone-Classification

Users that are interested in Mandarin-Tone-Classification are comparing it to the libraries listed below

Sorting:

saber5433 / ToneNet
ToneNet: A CNN Model of Tone Classification of Mandarin Chinese
☆18Updated 5 years ago
petronny / g2p
Pre-trained grapheme-to-phoneme (G2P) models
☆25Updated 4 years ago
gwinterstein / CantoMap
An audio and transcribed corpus of contemporary Hong Kong Cantonese
☆38Updated 4 years ago
AkishinoShiame / Chinese-Speech-Emotion-Datasets
Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.
☆36Updated 7 years ago
JazminVidal / gop-dnn-epadb
Goodness of Pronunciation using Kaldi on Epa-DB database
☆35Updated last year
daanzu / wenet_stt_python
☆33Updated 3 years ago
andi611 / ZeroSpeech-TTS-without-T
A Pytorch implementation for the ZeroSpeech 2019 challenge.
☆112Updated 5 years ago
tzyll / goparrot
Goodness of Pronunciation (GOP) for oral reading assessment.
☆52Updated 3 years ago
ronggong / interspeech2018_submission01
Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…
☆46Updated 7 years ago
HLTCHKUST / cantonese-asr
☆86Updated last year
prosodylab / prosobeast-annotation-tool
☆40Updated 3 years ago
pariajm / e2e-asr-and-disfluency-removal-evaluator
A new metric for evaluating end-to-end speech recognition and disfluency removal systems
☆19Updated 4 years ago
dipjyoti92 / SC-WaveRNN
Official PyTorch implementation of Speaker Conditional WaveRNN
☆110Updated 3 years ago
topel / goodness-of-pronunciation-HTK
Phone-level evaluation of L2 speakers (GOP algorithm)
☆27Updated 8 years ago
KunZhou9646 / Speaker-independent-emotional-voice-conversion-based-on-conditional-VAW-GAN-and-CWT
This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…
☆90Updated 4 years ago
tbornt / phoneme_ctc
Bidirectional dynamic RNN + CTC for phoneme recognition
☆46Updated 5 years ago
espnet / espnet_tts_frontend
Text frontend for ESPnet tts recipes
☆34Updated 4 years ago
jcsilva / multilingual-g2p
Multilingual Grapheme to Phoneme
☆50Updated 9 years ago
moisesveleta / GOP-LSTM
Improving the Goodness of Pronunciation with DNNs and RNNs
☆32Updated 6 years ago
lingjzhu / probing-TTS-models
Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf
☆32Updated 2 years ago
thuhcsi / NeuFA
Neural network-based forced alignment with bidirectional attention mechanism
☆77Updated 6 months ago
philipperemy / speaker-change-detection
Paper: https://arxiv.org/abs/1702.02285
☆64Updated 6 years ago
eastonYi / wav2vec
a simplified version of wav2vec(1.0, vq, 2.0) in fairseq
☆157Updated 4 years ago
celebrity-audio-collection / videoprocess
CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.
☆74Updated 5 years ago
Appen / UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
☆101Updated 2 years ago
azraelkuan / voice-conversion
an tutorial implement of voice conversion using pytorch
☆35Updated 7 years ago
cageyoko / CTC-Attention-Mispronunciation
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆61Updated 4 years ago
irebai / SpecAugment_KALDI
A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆14Updated 5 years ago
begeekmyfriend / tacotron2
Forked from NVIDIA/tacotron2 and merged with Rayhane-mamah/Tacotron-2
☆82Updated 4 years ago
xinjli / kaldi-cmake
create CMakeLists.txt for kaldi
☆20Updated 5 years ago