hoaaoh / Audio2VecLinks
Audio2Vec with multi lingual
☆8Updated 7 years ago
Alternatives and similar repositories for Audio2Vec
Users that are interested in Audio2Vec are comparing it to the libraries listed below
Sorting:
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"☆11Updated 6 years ago
- Tensorflow implementation of Nvidia Waveglow☆41Updated 6 years ago
- Tools for Ahocoder data processing and evaluation metrics☆14Updated last year
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- c++ Kaldi IO lib (static and dynamic).☆25Updated 6 years ago
- ☆48Updated 4 years ago
- TTS model based on Transformer.☆58Updated 5 years ago
- A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆93Updated 6 years ago
- an tutorial implement of voice conversion using pytorch☆36Updated 7 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆125Updated 6 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 6 years ago
- An implementation of Tacotron and Tacotron2☆81Updated 3 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- A simple, portable decoder☆10Updated 6 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆50Updated 7 years ago
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Updated 5 years ago
- Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)☆52Updated 6 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 5 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆36Updated 7 years ago
- Fast parallel RNN-Transducer.☆10Updated 5 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- ☆24Updated 5 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆66Updated 6 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago