☆45Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for latent_quantization
Users that are interested in latent_quantization are comparing it to the libraries listed below
Sorting:
- ☆17Mar 2, 2023Updated 3 years ago
- Official pytorch implement of paper InfoNet: Neural Estimation of Mutual Information without Test-Time Optimization☆21Jul 10, 2024Updated last year
- ☆12Apr 19, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- Behavioral probing of language acquisition models at the lexical and syntactic level☆17Jul 17, 2023Updated 2 years ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆15Aug 1, 2024Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆13Jul 22, 2024Updated last year
- ☆13Sep 13, 2023Updated 2 years ago
- A PyTorch reimplementation of Local Implicit Grid Representations for 3D Scenes☆17Jul 21, 2021Updated 4 years ago
- Use quantized versions of Whisper to speed up inference☆12Oct 16, 2024Updated last year
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆38Feb 24, 2025Updated last year
- [NeurIPS 2023] Learning Energy-Based Prior Model with Diffusion-Amortized MCMC☆13Oct 21, 2023Updated 2 years ago
- My vocoder experiments☆31Jul 26, 2025Updated 7 months ago
- A spoken version of the textual story cloze benchmark☆20Aug 6, 2023Updated 2 years ago
- Update: Ignore this repo, check out @lucidrains' implementation https://github.com/lucidrains/musiclm-pytorch☆15Jan 27, 2023Updated 3 years ago
- The Multi-band Excited WaveNet☆15Feb 2, 2023Updated 3 years ago
- ☆16Dec 12, 2023Updated 2 years ago
- A Chinese version of A Neural Parametric Singing Synthesizer☆13Feb 12, 2022Updated 4 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- This is the code for the paper Jacobian-based Causal Discovery with Nonlinear ICA, demonstrating how identifiable representations (partic…☆22Sep 5, 2024Updated last year
- This repository contains the code for our CVPR 2022 paper on "Non-isotropy Regularization for Proxy-based Deep Metric Learning".☆15Mar 10, 2023Updated 2 years ago
- ☆13Sep 12, 2024Updated last year
- ☆40Jan 24, 2023Updated 3 years ago
- source code of EfficientTTS 2☆20Feb 18, 2024Updated 2 years ago
- Search3D: Hierarchical Open-Vocabulary 3D Segmentation☆21May 20, 2025Updated 9 months ago
- ☆43May 3, 2024Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆25Apr 12, 2024Updated last year
- ☆18Jun 26, 2023Updated 2 years ago
- Vocal Remover using Deep Neural Networks☆19Dec 31, 2024Updated last year
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆22Jun 18, 2025Updated 8 months ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆19Feb 9, 2025Updated last year
- Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization☆20Dec 1, 2025Updated 3 months ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Jul 24, 2024Updated last year
- Official PyTorch implementation for paper: Energy-Based Sliced Wasserstein Distance☆18Feb 21, 2025Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- ☆23Mar 21, 2023Updated 2 years ago
- ☆19Feb 2, 2023Updated 3 years ago