Stability-AI / stable-audio-2-demo
☆20Updated 9 months ago
Alternatives and similar repositories for stable-audio-2-demo:
Users that are interested in stable-audio-2-demo are comparing it to the libraries listed below
- The demo page of UniAudio☆33Updated last year
- ☆41Updated 6 months ago
- Music production for silent film clips.☆22Updated last week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- Flexible LoRA Implementation to use with stable-audio-tools☆68Updated 8 months ago
- Pytorch implementation of SoundCTM☆93Updated last month
- Codebase and project page for EDMSound☆34Updated last year
- A comprehensive codebase for training and finetuning Image <> Latent models.☆31Updated 2 months ago
- ☆77Updated 6 months ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆54Updated 2 weeks ago
- Official source codes of airsep☆36Updated last year
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…☆84Updated 4 months ago
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆45Updated 8 months ago
- Animatediff implementation. Includes a ControlNet pipeline.☆19Updated last year
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆41Updated 7 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆95Updated 6 months ago
- Controlled audio inpainting using SD-fine tuned model Riffusion in a ControlNet Architecture☆30Updated last year
- Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…☆183Updated 9 months ago
- YuE with mp3 extend, exllama and GUI☆49Updated 2 months ago
- Official repo for DiscoDiff: Coarse-to-Fine Text-to-Music Latent Diffusion presented at ICASSP 2025☆12Updated last month
- automatic audio labelling with laion-clap☆17Updated 10 months ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆29Updated last year
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆17Updated last month
- AudioSR-Upsampling (any -> 48kHz)☆40Updated last year
- BEGANSing - Korean SVS + SVC + AudioSR☆11Updated last year
- fine-tuning MusicGen without prompts to generate music with a specific style☆63Updated last year
- Guide diffusion on ImageBind embedding similarity☆28Updated last year
- ☆13Updated last month
- ☆62Updated 9 months ago
- ☆66Updated last year