iamyuanchung / VQ-APCView external linksLinks
Vector Quantized Autoregressive Predictive Coding (VQ-APC)
☆37Nov 11, 2020Updated 5 years ago
Alternatives and similar repositories for VQ-APC
Users that are interested in VQ-APC are comparing it to the libraries listed below
Sorting:
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆27Feb 22, 2022Updated 3 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆189Jan 29, 2020Updated 6 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Feb 15, 2021Updated 5 years ago
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- Representation learning for NLP @ JSALT19☆40Oct 31, 2020Updated 5 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- Meta-learning model agnostic (MAML) implementation for cross-accented ASR☆45Feb 9, 2024Updated 2 years ago
- JAX implementation of Graph Attention Networks☆13Jan 29, 2022Updated 4 years ago
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 5 years ago
- TensorFlow Implementation of CDVAE-VC.☆54Mar 24, 2023Updated 2 years ago
- WildVSR☆21Dec 13, 2023Updated 2 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆367Oct 12, 2021Updated 4 years ago
- ☆37Jun 30, 2022Updated 3 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Sep 17, 2019Updated 6 years ago
- Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"☆15Feb 27, 2019Updated 6 years ago
- ICLR 2022 paper☆16May 6, 2022Updated 3 years ago
- ☆31Apr 24, 2021Updated 4 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Sep 24, 2021Updated 4 years ago
- Coqui Inference Engine☆40Aug 3, 2021Updated 4 years ago
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆67Jan 7, 2026Updated last month
- Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)☆20Mar 17, 2025Updated 10 months ago
- Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization (IEEE TPAMI 2021)☆17Jun 4, 2021Updated 4 years ago
- This repository implements the latent optimization using automatic differentiation from the paper LOGAN.☆12Apr 10, 2020Updated 5 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)☆17Oct 12, 2021Updated 4 years ago
- [CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation☆45Sep 6, 2024Updated last year
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Mar 4, 2024Updated last year
- multilingual speech aligner☆76Nov 19, 2023Updated 2 years ago
- Moved to https://github.com/k2-fsa/icefall☆146Oct 13, 2022Updated 3 years ago
- ☆18May 15, 2021Updated 4 years ago
- ☆24Jun 13, 2022Updated 3 years ago
- Implementation of WaveNet with Gluon☆16Dec 27, 2018Updated 7 years ago
- Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)☆72Oct 20, 2020Updated 5 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆90Jun 9, 2022Updated 3 years ago
- ☆22Nov 19, 2024Updated last year