jmoso13/jukebox-diffusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jmoso13/jukebox-diffusion)

jmoso13 / jukebox-diffusion

☆107

Alternatives and similar repositories for jukebox-diffusion

Users that are interested in jukebox-diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
AgentCooper2002 / EDMSound
View on GitHub
Codebase and project page for EDMSound
☆35Nov 20, 2023Updated 2 years ago
JTColonel / timbre-interp
View on GitHub
Autoencoder Based Real-Time Timbre Interpolation Algorithm
☆12Aug 17, 2020Updated 5 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
sony / diffiner
View on GitHub
☆68Aug 16, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aisynth / diffmoog
View on GitHub
☆114Jun 18, 2024Updated 2 years ago
bfs18 / armel
View on GitHub
poorman's ar-dit tts
☆45Dec 31, 2025Updated 6 months ago
cyrusasfa / timbre2020-sounds
View on GitHub
Dissimilarity Matrix and Sounds from Timbre Space Representation of a Subtractive Synthesizer (Timbre, 2020)
☆12Dec 17, 2021Updated 4 years ago
sony / diffusion-timbre-transfer
View on GitHub
☆56Nov 5, 2024Updated last year
tamlablinz / RAVE_PCA
View on GitHub
Interactive Performance, Analysis and Visualization of RAVE Latent Spaces via PCA and OSC Integration
☆21Jul 15, 2025Updated last year
hcy71o / AutoVocoder
View on GitHub
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆71Dec 2, 2022Updated 3 years ago
sony / soundctm
View on GitHub
Pytorch implementation of SoundCTM
☆101Mar 31, 2025Updated last year
adelacvg / ttts
View on GitHub
Train the next generation of TTS systems.
☆169Sep 13, 2024Updated last year
Kinyugo / msanii
View on GitHub
A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.
☆196Apr 27, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Audio-AGI / WavJourney
View on GitHub
WavJourney: Compositional Audio Creation with LLMs
☆542Sep 28, 2023Updated 2 years ago
zceng / LVCNet
View on GitHub
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
☆80Feb 24, 2021Updated 5 years ago
julian-parker / stemgen
View on GitHub
Examples for ICASSP2024 paper "StemGen: A music generation model that listens"
☆35Dec 19, 2023Updated 2 years ago
archinetai / archisound
View on GitHub
A collection of pre-trained audio models, in PyTorch.
☆116Jan 27, 2023Updated 3 years ago
jaeyeonkim99 / EnCLAP
View on GitHub
Official Implementation of EnCLAP (ICASSP 2024)
☆96Jun 2, 2024Updated 2 years ago
glory20h / VoiceLDM
View on GitHub
VoiceLDM: Text-to-Speech with Environmental Context
☆194Aug 9, 2024Updated last year
revsic / torch-nansypp
View on GitHub
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
☆152Feb 11, 2023Updated 3 years ago
ssarfjoo / improvedsegan
View on GitHub
This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…
☆38Mar 24, 2023Updated 3 years ago
DolbyLaboratories / neural-upsampling-artifacts-audio
View on GitHub
Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356
☆81Feb 9, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
voidful / vall-e-encodec
View on GitHub
☆41May 15, 2023Updated 3 years ago
sh-lee-prml / PeriodWave
View on GitHub
The official Implementation of PeriodWave and PeriodWave-Turbo
☆225Apr 14, 2025Updated last year
chomeyama / HN-UnifiedSourceFilterGAN
View on GitHub
☆88Nov 1, 2022Updated 3 years ago
hugofloresgarcia / vampnet
View on GitHub
music generation with masked transformers!
☆357May 16, 2025Updated last year
ex3ndr / supervoice-gpt
View on GitHub
GPT-style network for phonemization with durations of text
☆68Mar 21, 2024Updated 2 years ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
PlayVoice / BigVGAN
View on GitHub
BigVGAN with Neural Source-Filter
☆58Sep 21, 2023Updated 2 years ago
zjumml / DiffSinger
View on GitHub
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
☆10Mar 8, 2022Updated 4 years ago
Mddct / transformer-vocos
View on GitHub
☆35Sep 6, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
fluxions-ai / stftvae
View on GitHub
Inference for the STFT-VAE continuous audio codec (24kHz, 3.125Hz latent)
☆43Jul 12, 2026Updated last week
theodorblackbird / lina-speech
View on GitHub
Official implementation of the TTS model Lina-Speech
☆178Jan 9, 2025Updated last year
MuyangDu / T5Voice
View on GitHub
T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …
☆28Nov 7, 2025Updated 8 months ago
eloimoliner / unconditional-diff-STFT
View on GitHub
Unconditional music synthesis using a diffusion model in the STFT domain
☆12May 31, 2022Updated 4 years ago
sp-nitech / diffsptk
View on GitHub
A differentiable version of SPTK
☆201Jul 14, 2026Updated last week
hcy71o / SNAC
View on GitHub
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…
☆57Aug 7, 2023Updated 2 years ago
ZacharyNovack / Lead-AE
View on GitHub
Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression
☆22Oct 23, 2023Updated 2 years ago