yukara-ikemiya / minimal-sqvaeView external linksLinks
A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony
☆33Oct 16, 2023Updated 2 years ago
Alternatives and similar repositories for minimal-sqvae
Users that are interested in minimal-sqvae are comparing it to the libraries listed below
Sorting:
- ☆66Aug 16, 2023Updated 2 years ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated last year
- Viterbi decoding in PyTorch☆40Sep 10, 2025Updated 5 months ago
- ☆18Aug 24, 2024Updated last year
- Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)☆193Jul 20, 2022Updated 3 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- ☆20Jul 13, 2022Updated 3 years ago
- PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.☆63Sep 8, 2025Updated 5 months ago
- Prosody and Pronunciation Modification Network☆62May 5, 2025Updated 9 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- SelfRemaster: SSL Speech Restoration☆94Jan 5, 2024Updated 2 years ago
- A repository of Japanese Phoneme-Level BERT☆22Dec 16, 2023Updated 2 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated last year
- Unofficial implementation of miipher☆135Apr 19, 2024Updated last year
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Official code repository for NeurIPS 2024 paper "Recurrent Complex-Weighted Autoencoders for Unsupervised Object Discovery"☆11Jan 8, 2025Updated last year
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆15Aug 1, 2024Updated last year
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 4 months ago
- A Battery Intraday Trading Engine, based on dynamic programming approximations, written in C++, wrapped for Python☆32Feb 5, 2026Updated last week
- ☆68Jul 29, 2023Updated 2 years ago
- ☆26Mar 20, 2024Updated last year
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆28Sep 12, 2024Updated last year
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated 9 months ago
- ICASSP 2023 Accepted☆189May 6, 2024Updated last year
- Audio tokenization, in the fastest way possible!☆53Aug 26, 2024Updated last year
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 9 months ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆80May 29, 2023Updated 2 years ago
- 日本音響学会誌用BibTeXスタイルファイル☆11Jan 24, 2022Updated 4 years ago
- Tensorflow implementation of MuZero algorithm☆11Aug 23, 2022Updated 3 years ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆13Jul 22, 2024Updated last year
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆18Aug 8, 2024Updated last year
- Official implementation of Self-Remixing☆17Feb 3, 2024Updated 2 years ago
- High-Fidelity Neural Phonetic Posteriorgrams☆122Feb 23, 2025Updated 11 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆150Feb 11, 2023Updated 3 years ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆59Jul 1, 2025Updated 7 months ago