An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆369Oct 12, 2021Updated 4 years ago
Alternatives and similar repositories for CPC_audio
Users that are interested in CPC_audio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Oct 19, 2022Updated 3 years ago
- A library for speech data augmentation in time-domain☆684Aug 30, 2021Updated 4 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆191Jan 29, 2020Updated 6 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,538Mar 12, 2026Updated last week
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆146Aug 5, 2022Updated 3 years ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆408Jul 7, 2021Updated 4 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆416Aug 29, 2023Updated 2 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆522Jul 11, 2023Updated 2 years ago
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆54May 25, 2022Updated 3 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆566Apr 2, 2023Updated 2 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- Problem Agnostic Speech Encoder☆447Jul 6, 2023Updated 2 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- Large, modern dataset for speech recognition☆721Feb 26, 2024Updated 2 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆149Aug 25, 2023Updated 2 years ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆97Nov 20, 2024Updated last year
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆521Mar 1, 2022Updated 4 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆339Jul 6, 2023Updated 2 years ago
- Moved to https://github.com/k2-fsa/icefall☆146Oct 13, 2022Updated 3 years ago
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆479Apr 5, 2024Updated last year
- Official code for Cotatron @ INTERSPEECH 2020☆214Jul 25, 2024Updated last year
- ☆277Jan 15, 2021Updated 5 years ago
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆221Jan 14, 2021Updated 5 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆211May 30, 2025Updated 9 months ago
- Library for Textless Spoken Language Processing☆555Aug 29, 2023Updated 2 years ago
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆115Dec 2, 2020Updated 5 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆699Oct 23, 2024Updated last year
- Tools for handling multimodal data in machine learning projects.☆1,121Mar 11, 2026Updated last week
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆92Jun 9, 2022Updated 3 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Jul 25, 2024Updated last year
- Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.☆140Sep 25, 2024Updated last year
- Implementation of multi-level Contrastive Predictive Coding (CPC) methods☆20Jan 12, 2023Updated 3 years ago
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year