alicex2020 / Mandarin-Tone-ClassificationLinks
Deep learning using CNN for Mandarin Chinese tone classification
☆35Updated 6 years ago
Alternatives and similar repositories for Mandarin-Tone-Classification
Users that are interested in Mandarin-Tone-Classification are comparing it to the libraries listed below
Sorting:
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆18Updated 5 years ago
- Pre-trained grapheme-to-phoneme (G2P) models☆25Updated 4 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆38Updated 4 years ago
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆36Updated 7 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- ☆33Updated 3 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆52Updated 3 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 7 years ago
- ☆86Updated last year
- ☆40Updated 3 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 3 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Updated 8 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Updated 4 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆46Updated 5 years ago
- Text frontend for ESPnet tts recipes☆34Updated 4 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf☆32Updated 2 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆77Updated 6 months ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- a simplified version of wav2vec(1.0, vq, 2.0) in fairseq☆157Updated 4 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆74Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- an tutorial implement of voice conversion using pytorch☆35Updated 7 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆61Updated 4 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Updated 5 years ago
- Forked from NVIDIA/tacotron2 and merged with Rayhane-mamah/Tacotron-2☆82Updated 4 years ago
- create CMakeLists.txt for kaldi☆20Updated 5 years ago