gary083 / GAN_Harmonized_with_HMMs
Code:Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Models
☆25Updated 5 years ago
Alternatives and similar repositories for GAN_Harmonized_with_HMMs:
Users that are interested in GAN_Harmonized_with_HMMs are comparing it to the libraries listed below
- ☆31Updated 3 years ago
- PyTorch implementation of: Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences☆11Updated 5 years ago
- TensorFlow Implementation of CDVAE-VC.☆54Updated last year
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 6 years ago
- ☆10Updated 6 years ago
- ☆15Updated 3 years ago
- ☆20Updated 3 years ago
- ☆22Updated 5 years ago
- ☆10Updated 2 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- ☆22Updated 4 years ago
- ☆35Updated 4 years ago
- ☆31Updated last year
- Meta-Learning for End-to-End ASR☆10Updated 4 years ago
- An evaluation toolkit for voice conversion models.☆40Updated 3 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago
- ASR text preprocessing utility☆21Updated 6 months ago
- Non-Autoregressive Predictive Coding☆50Updated 4 years ago
- Gaussian Mixture VAE Tacotron☆53Updated last year
- ☆51Updated 6 years ago
- ☆97Updated 3 years ago
- NIST SPH File reader (e.g. for TEDLIUM Corpus)☆25Updated 4 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Updated last year
- An implementation of SkipVQVC with various settings.☆75Updated 4 years ago
- Re-implementation the code used in Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder☆148Updated 5 years ago
- Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless…☆35Updated last year
- Stellenbosch University ZeroSpeech 2019 System☆10Updated 5 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated 2 years ago
- Tacotron2 with Global Style Tokens☆64Updated 5 years ago
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆22Updated last year