☆107Oct 16, 2023Updated 2 years ago
Alternatives and similar repositories for jukebox-diffusion
Users that are interested in jukebox-diffusion are comparing it to the libraries listed below
Sorting:
- Codebase and project page for EDMSound☆35Nov 20, 2023Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ☆19Mar 22, 2024Updated last year
- A collection of pre-trained audio models, in PyTorch.☆115Jan 27, 2023Updated 3 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Apr 27, 2023Updated 2 years ago
- ☆55Nov 5, 2024Updated last year
- ☆111Jun 18, 2024Updated last year
- WavJourney: Compositional Audio Creation with LLMs☆540Sep 28, 2023Updated 2 years ago
- The official Implementation of PeriodWave and PeriodWave-Turbo☆219Apr 14, 2025Updated 10 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- Trainer for audio-diffusion-pytorch☆129Jan 13, 2023Updated 3 years ago
- Pytorch implementation of SoundCTM☆100Mar 31, 2025Updated 11 months ago
- ☆67Aug 16, 2023Updated 2 years ago
- music generation with masked transformers!☆350May 16, 2025Updated 9 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆94Jun 2, 2024Updated last year
- Train the next generation of TTS systems.☆171Sep 13, 2024Updated last year
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆82Feb 9, 2021Updated 5 years ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆57Oct 31, 2023Updated 2 years ago
- Dissimilarity Matrix and Sounds from Timbre Space Representation of a Subtractive Synthesizer (Timbre, 2020)☆12Dec 17, 2021Updated 4 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 3 years ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆151Feb 11, 2023Updated 3 years ago
- Examples for ICASSP2024 paper "StemGen: A music generation model that listens"☆35Dec 19, 2023Updated 2 years ago
- ☆36Sep 6, 2025Updated 5 months ago
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆115Jun 23, 2025Updated 8 months ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression☆22Oct 23, 2023Updated 2 years ago
- ☆41May 15, 2023Updated 2 years ago
- Autoencoder Based Real-Time Timbre Interpolation Algorithm☆12Aug 17, 2020Updated 5 years ago
- ☆22Jul 30, 2025Updated 7 months ago
- ☆258Mar 15, 2024Updated last year
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆186May 29, 2024Updated last year
- Pytorch implementation of BigVSAN☆203Dec 9, 2025Updated 2 months ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆148Jan 15, 2024Updated 2 years ago
- ☆69May 19, 2023Updated 2 years ago
- ☆13Mar 11, 2025Updated 11 months ago
- A differentiable version of SPTK☆193Feb 3, 2026Updated 3 weeks ago
- VoiceLDM: Text-to-Speech with Environmental Context☆191Aug 9, 2024Updated last year