An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆368Oct 12, 2021Updated 4 years ago
Alternatives and similar repositories for CPC_audio
Users that are interested in CPC_audio are comparing it to the libraries listed below
Sorting:
- A library for speech data augmentation in time-domain☆683Aug 30, 2021Updated 4 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆191Jan 29, 2020Updated 6 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Oct 19, 2022Updated 3 years ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆408Jul 7, 2021Updated 4 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆146Aug 5, 2022Updated 3 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,530Jun 13, 2025Updated 8 months ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆415Aug 29, 2023Updated 2 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆521Jul 11, 2023Updated 2 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆54May 25, 2022Updated 3 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆564Apr 2, 2023Updated 2 years ago
- Problem Agnostic Speech Encoder☆447Jul 6, 2023Updated 2 years ago
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- Moved to https://github.com/k2-fsa/icefall☆146Oct 13, 2022Updated 3 years ago
- Large, modern dataset for speech recognition☆721Feb 26, 2024Updated 2 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆339Jul 6, 2023Updated 2 years ago
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆520Mar 1, 2022Updated 4 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆150Aug 25, 2023Updated 2 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Jan 14, 2021Updated 5 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆211May 30, 2025Updated 9 months ago
- Official code for Cotatron @ INTERSPEECH 2020☆214Jul 25, 2024Updated last year
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆479Apr 5, 2024Updated last year
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆96Nov 20, 2024Updated last year
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆114Dec 2, 2020Updated 5 years ago
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆83Nov 13, 2021Updated 4 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- ☆276Jan 15, 2021Updated 5 years ago
- Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.☆140Sep 25, 2024Updated last year
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year
- Wavenet Autoencoder for Unsupervised speech representation learning (after Chorowski, Jan 2019)☆176Sep 16, 2020Updated 5 years ago
- A system works on singing voice synthesis☆79Jan 11, 2023Updated 3 years ago
- Library for Textless Spoken Language Processing☆555Aug 29, 2023Updated 2 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆698Oct 23, 2024Updated last year