yukara-ikemiya/minimal-sqvae

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yukara-ikemiya/minimal-sqvae)

yukara-ikemiya / minimal-sqvae

A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony

☆33

Alternatives and similar repositories for minimal-sqvae

Users that are interested in minimal-sqvae are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
sony / diffiner
View on GitHub
☆68Aug 16, 2023Updated 2 years ago
minju0821 / musical_instrument_retrieval
View on GitHub
☆29Jun 8, 2023Updated 3 years ago
sony / sqvae
View on GitHub
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
☆196Jul 20, 2022Updated 4 years ago
kyamauchi1023 / PL-BERT-ja
View on GitHub
A repository of Japanese Phoneme-Level BERT
☆24Dec 16, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
GallagherCommaJack / modulax
View on GitHub
☆18Aug 24, 2024Updated last year
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
MuyangDu / T5Voice
View on GitHub
T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …
☆28Nov 7, 2025Updated 8 months ago
sony / bigvsan_eval
View on GitHub
Evaluation tool used in the BigVSAN paper
☆14Mar 22, 2024Updated 2 years ago
opooladz / Preconditioned-Stochastic-Gradient-Descent
View on GitHub
A repo based on XiLin Li's PSGD repo that extends some of the experiments.
☆14Oct 7, 2024Updated last year
kelechi-c / dit_flow
View on GitHub
DiT (training + flow matching) in Jax
☆12Jan 5, 2025Updated last year
fidel-schaposnik / muzero
View on GitHub
Tensorflow implementation of MuZero algorithm
☆11Aug 23, 2022Updated 3 years ago
ituvisionlab / EdVAE
View on GitHub
Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"
☆14Sep 20, 2024Updated last year
maxrmorrison / promonet
View on GitHub
Prosody and Pronunciation Modification Network
☆64May 5, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
revsic / torch-whisper-guided-vc
View on GitHub
Torch implementation of Whisper-guided DDPM based Voice Conversion
☆49Mar 7, 2023Updated 3 years ago
nttrd-mdlab / wearable-seld-dataset
View on GitHub
☆10Feb 18, 2022Updated 4 years ago
maxrmorrison / torbi
View on GitHub
Viterbi decoding in PyTorch
☆42May 5, 2026Updated 2 months ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
XZWY / MSLDM
View on GitHub
Implementation of Multi-Source Music Generation with Latent Diffusion.
☆29Sep 12, 2024Updated last year
xcmyz / CLONE
View on GitHub
☆20Jul 13, 2022Updated 4 years ago
cnaigithub / Auto_Tuning_Zeroshot_TTS_and_VC
View on GitHub
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…
☆80May 29, 2023Updated 3 years ago
archinetai / aligner-pytorch
View on GitHub
Sequence alignement methods with helpers for PyTorch.
☆24Nov 30, 2022Updated 3 years ago
Takaaki-Saeki / ssl_speech_restoration
View on GitHub
SelfRemaster: SSL Speech Restoration
☆94Jan 5, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Wataru-Nakata / miipher
View on GitHub
Unofficial implementation of miipher
☆137Apr 19, 2024Updated 2 years ago
f-dangel / sirfshampoo
View on GitHub
[ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)
☆15Nov 4, 2024Updated last year
yukara-ikemiya / wavefit-pytorch
View on GitHub
PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.
☆70Jul 13, 2026Updated 2 weeks ago
Audio-AGI / dcase2024_task9_baseline
View on GitHub
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
☆26Mar 27, 2024Updated 2 years ago
KinWaiCheuk / slakh_loader
View on GitHub
A PyTorch Dataset for Slakh2100
☆10Feb 14, 2024Updated 2 years ago
maum-ai / phaseaug
View on GitHub
ICASSP 2023 Accepted
☆191May 6, 2024Updated 2 years ago
yukara-ikemiya / friendly-stable-audio-tools
View on GitHub
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…
☆218Jul 25, 2024Updated 2 years ago
tonnetonne814 / PL-Bert-VITS2
View on GitHub
VITS2 using Phoneme-Level Japanese BERT
☆14Dec 17, 2023Updated 2 years ago
yukara-ikemiya / floss-torch
View on GitHub
PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind
☆97Nov 24, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hirokisince1998 / jasj-bibtex
View on GitHub
日本音響学会誌用BibTeXスタイルファイル
☆11Jan 24, 2022Updated 4 years ago
yukara-ikemiya / minimal-musicgen-for-developers
View on GitHub
[PyTorch] Minimal codebase for MusicGen models
☆63Jan 7, 2025Updated last year
hr0nix / omega
View on GitHub
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆44Sep 19, 2022Updated 3 years ago
dlwh / jax_sourceror
View on GitHub
Turn jitted jax functions back into python source code
☆23Dec 16, 2024Updated last year
timeolord / Reinforcement-Learning-Stock-Trader
View on GitHub
Using a modified version of Werner Duvaud's MuZero implementation (https://github.com/werner-duvaud/muzero-general) this reinforcement ag…
☆20Jun 24, 2026Updated last month
hs-oh-prml / DiffProsody
View on GitHub
☆69Jul 29, 2023Updated 3 years ago
AlanBaade / SyllableLM
View on GitHub
Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models
☆63Jul 1, 2025Updated last year