Vector Quantized Autoregressive Predictive Coding (VQ-APC)
☆37Nov 11, 2020Updated 5 years ago
Alternatives and similar repositories for VQ-APC
Users that are interested in VQ-APC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆28Feb 22, 2022Updated 4 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆191Jan 29, 2020Updated 6 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Feb 15, 2021Updated 5 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- Meta-learning model agnostic (MAML) implementation for cross-accented ASR☆45Feb 9, 2024Updated 2 years ago
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- Representation learning for NLP @ JSALT19☆41Oct 31, 2020Updated 5 years ago
- TensorFlow Implementation of CDVAE-VC.☆54Mar 24, 2023Updated 3 years ago
- ☆31Apr 24, 2021Updated 4 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆369Oct 12, 2021Updated 4 years ago
- ☆37Jun 30, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- WildVSR☆21Dec 13, 2023Updated 2 years ago
- JAX implementation of Graph Attention Networks☆13Jan 29, 2022Updated 4 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Sep 17, 2019Updated 6 years ago
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 5 years ago
- ☆76Mar 18, 2022Updated 4 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"☆15Feb 27, 2019Updated 7 years ago
- Project page for paper Self-supervised Representation Learning with Relative Predictive Coding☆19Jul 8, 2021Updated 4 years ago
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆67Jan 7, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)☆20Mar 17, 2025Updated last year
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Jul 25, 2024Updated last year
- ☆17Nov 25, 2019Updated 6 years ago
- Coqui Inference Engine☆40Aug 3, 2021Updated 4 years ago
- multilingual speech aligner☆76Nov 19, 2023Updated 2 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆66Aug 31, 2018Updated 7 years ago
- Kaldi Speech Processing Tools☆25Nov 16, 2018Updated 7 years ago
- Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)☆17Oct 12, 2021Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Implementation of WaveNet with Gluon☆16Dec 27, 2018Updated 7 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Mar 4, 2024Updated 2 years ago
- [CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation☆46Sep 6, 2024Updated last year
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Nov 12, 2019Updated 6 years ago
- ☆36Sep 6, 2025Updated 6 months ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Dec 4, 2023Updated 2 years ago
- ☆55Jun 4, 2022Updated 3 years ago