iamyuanchung / Autoregressive-Predictive-Coding
Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning
☆185Updated 5 years ago
Alternatives and similar repositories for Autoregressive-Predictive-Coding:
Users that are interested in Autoregressive-Predictive-Coding are comparing it to the libraries listed below
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆141Updated 2 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆167Updated last year
- ☆273Updated 4 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Code to train and run Blow☆143Updated 5 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆183Updated 4 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆173Updated 4 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆84Updated 5 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆141Updated 4 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆126Updated 5 years ago
- INTERSPEECH 2019 Tutorial Materials☆193Updated 3 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆81Updated 3 years ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆401Updated last year
- A pure python module for reading and writing kaldi ark files☆253Updated last week
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆208Updated 2 years ago
- ☆186Updated 10 months ago
- A CRF-based ASR Toolkit☆330Updated 7 months ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆358Updated 3 years ago
- Mel cepstral distortion (MCD) computations in python.☆222Updated 7 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆126Updated 5 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆141Updated last year
- PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)☆80Updated 6 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 2 years ago
- ☆255Updated last year
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆312Updated 4 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆141Updated last year
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Updated 4 years ago
- This is the GitHub page for publicly available emotional speech data.☆343Updated 3 years ago