iamyuanchung / Autoregressive-Predictive-Coding
Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning
☆184Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Autoregressive-Predictive-Coding
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆137Updated 2 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆166Updated last year
- Code to train and run Blow☆143Updated 5 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆82Updated 5 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆173Updated 4 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆137Updated 4 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆79Updated 2 years ago
- ☆272Updated 3 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆138Updated last year
- Fatcord's Alternative WaveRNN (Faster training)☆126Updated 5 years ago
- A pure python module for reading and writing kaldi ark files☆249Updated last year
- Implementation code of non-parallel sequence-to-sequence VC☆250Updated last year
- Tensorflow implementation of x-vector topology on top of Kaldi recipe☆119Updated 5 years ago
- Mel cepstral distortion (MCD) computations in python.☆213Updated 7 years ago
- INTERSPEECH 2019 Tutorial Materials☆193Updated 3 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆182Updated 4 years ago
- A pytorch implementation of xvector embedding☆78Updated 4 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆126Updated 5 years ago
- PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)☆77Updated 6 years ago
- Probabilistic Linear Discriminant Analysis & classification, written in Python.☆127Updated 2 years ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆98Updated 4 years ago
- ☆182Updated 6 months ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- Implementation of audio degradation processes☆101Updated 8 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆216Updated 3 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆87Updated 3 years ago
- Tensorflow implementation of the speech model described in Neural Discrete Representation Learning (a.k.a. VQ-VAE)☆128Updated 6 years ago
- experiments with RETURNN☆154Updated 2 weeks ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆390Updated last year
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆362Updated last year