BoragoCode / AttentionBasedProsodyPrediction
Encoder and Decoder and Attention Based Prosody Prediction
☆67Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for AttentionBasedProsodyPrediction
- 基于随机森林和条件随机场 的中文韵律预测模型☆27Updated 3 months ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆50Updated 4 years ago
- The code for aishell-3 baseline acoustic model☆68Updated 3 years ago
- Chinese Text Normalization and Dataset☆81Updated 2 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- ☆55Updated 4 years ago
- 论文复现,使用pos标记进行中文多音字消歧☆21Updated 5 years ago
- Predict prosody labels for Chinese sentences.☆40Updated 2 years ago
- it's ASR decoder and make graph project☆32Updated 2 years ago
- ☆69Updated 3 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- The Implementation of FastSpeech2 Based on Pytorch.☆52Updated last year
- ☆74Updated 2 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆63Updated 5 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆35Updated 6 years ago
- style token with tacotron2☆61Updated last year
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆75Updated 3 years ago
- Automatic Speech Recognition with TensorFlow(CNN+BLSTM+CTC)☆12Updated 6 years ago
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆99Updated 7 years ago
- Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf☆31Updated last year
- SpEx+(tied) source code☆75Updated last year
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆37Updated 4 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆43Updated 5 years ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 5 years ago
- Chinese text normalization. 中文文本规范化。☆48Updated 3 years ago