Stability-AI / stable-audio-2-demoLinks
☆22Updated 10 months ago
Alternatives and similar repositories for stable-audio-2-demo
Users that are interested in stable-audio-2-demo are comparing it to the libraries listed below
Sorting:
- ☆44Updated 7 months ago
- Music production for silent film clips.☆25Updated last month
- Codebase and project page for EDMSound☆34Updated last year
- Pytorch implementation of SoundCTM☆96Updated 2 months ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆24Updated 3 weeks ago
- Flexible LoRA Implementation to use with stable-audio-tools☆72Updated 8 months ago
- The demo page of UniAudio☆33Updated last year
- ☆78Updated 7 months ago
- Official repo for DiscoDiff: Coarse-to-Fine Text-to-Music Latent Diffusion presented at ICASSP 2025☆12Updated last month
- ☆15Updated 2 months ago
- ☆170Updated 5 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆98Updated 2 weeks ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆58Updated last month
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆15Updated last year
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆164Updated last year
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆41Updated 8 months ago
- XMIDI Dataset: A large-scale symbolic music dataset with emotion and genre labels.☆22Updated 4 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆90Updated 5 months ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆38Updated last week
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆46Updated 8 months ago
- official code for CVPR'24 paper Diff-BGM☆63Updated 7 months ago
- Controlled audio inpainting using SD-fine tuned model Riffusion in a ControlNet Architecture☆31Updated 2 years ago
- Guide diffusion on ImageBind embedding similarity☆29Updated 2 years ago
- Official implementation for FlowSep☆50Updated 5 months ago
- Code for paper "Network Bending of Diffusion Models for Audio-Visual Generation" at DAFx 2024☆15Updated 11 months ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆24Updated 8 months ago
- Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…☆190Updated 10 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆73Updated 8 months ago
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆109Updated 4 months ago
- Real-time end-to-end singing voice convertion☆22Updated 7 months ago