☆107Oct 16, 2023Updated 2 years ago
Alternatives and similar repositories for jukebox-diffusion
Users that are interested in jukebox-diffusion are comparing it to the libraries listed below
Sorting:
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Codebase and project page for EDMSound☆35Nov 20, 2023Updated 2 years ago
- Autoencoder Based Real-Time Timbre Interpolation Algorithm☆12Aug 17, 2020Updated 5 years ago
- ☆19Mar 22, 2024Updated 2 years ago
- ☆55Nov 5, 2024Updated last year
- ☆67Aug 16, 2023Updated 2 years ago
- ☆112Jun 18, 2024Updated last year
- Dissimilarity Matrix and Sounds from Timbre Space Representation of a Subtractive Synthesizer (Timbre, 2020)☆12Dec 17, 2021Updated 4 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Apr 27, 2023Updated 2 years ago
- A collection of pre-trained audio models, in PyTorch.☆116Jan 27, 2023Updated 3 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- WavJourney: Compositional Audio Creation with LLMs☆541Sep 28, 2023Updated 2 years ago
- Pytorch implementation of SoundCTM☆101Mar 31, 2025Updated 11 months ago
- Examples for ICASSP2024 paper "StemGen: A music generation model that listens"☆35Dec 19, 2023Updated 2 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- Train the next generation of TTS systems.☆171Sep 13, 2024Updated last year
- Official Implementation of EnCLAP (ICASSP 2024)☆94Jun 2, 2024Updated last year
- The official Implementation of PeriodWave and PeriodWave-Turbo☆220Apr 14, 2025Updated 11 months ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆151Feb 11, 2023Updated 3 years ago
- VoiceLDM: Text-to-Speech with Environmental Context☆192Aug 9, 2024Updated last year
- music generation with masked transformers!☆351May 16, 2025Updated 10 months ago
- ☆41May 15, 2023Updated 2 years ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆82Feb 9, 2021Updated 5 years ago
- ☆88Nov 1, 2022Updated 3 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆38Mar 24, 2023Updated 2 years ago
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated 2 years ago
- ☆36Sep 6, 2025Updated 6 months ago
- BigVGAN with Neural Source-Filter☆56Sep 21, 2023Updated 2 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- Trainer for audio-diffusion-pytorch☆129Jan 13, 2023Updated 3 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- A differentiable version of SPTK☆196Feb 26, 2026Updated 3 weeks ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- Official implementation of the TTS model Lina-Speech☆179Jan 9, 2025Updated last year
- Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression☆22Oct 23, 2023Updated 2 years ago
- ☆259Mar 15, 2024Updated 2 years ago
- ☆12Mar 11, 2025Updated last year
- The reproduced code for Google's SoundStorm☆272Oct 7, 2023Updated 2 years ago