Stability-AI / stable-audio-2-demo
☆13Updated 2 months ago
Related projects: ⓘ
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆21Updated last week
- Pytorch implementation of SoundCTM☆68Updated 3 weeks ago
- ☆54Updated last month
- ☆27Updated this week
- Codebase and project page for EDMSound☆29Updated 10 months ago
- Zero-Shot Emotion Style Transfer☆33Updated 5 months ago
- The demo page of UniAudio☆34Updated 7 months ago
- My vocoder experiments☆20Updated last month
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last month
- ☆130Updated last week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated 5 months ago
- NeMo: a toolkit for conversational AI☆9Updated 2 weeks ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆36Updated last year
- ☆44Updated this week
- official code for CVPR'24 paper Diff-BGM☆38Updated 5 months ago
- Guide diffusion on ImageBind embedding similarity☆27Updated last year
- BEGANSing - Korean SVS + SVC + AudioSR☆12Updated 7 months ago
- Multispeaker Community Vocoder Model for DiffSinger☆34Updated 4 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆68Updated 2 months ago
- Animatediff implementation. Includes a ControlNet pipeline.☆19Updated 8 months ago
- Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…☆111Updated last month
- Brand new TTS solution☆8Updated this week
- The official GitHub page for the survey paper "Foundation Models for Music: A Survey".☆79Updated 2 weeks ago
- Flexible LoRA Implementation to use with stable-audio-tools☆37Updated last week
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 7 months ago
- ☆40Updated 2 months ago
- ☆106Updated 11 months ago
- VALL-E 2 reproduction☆72Updated 2 months ago
- Ultimate Vocal Remover with Gradio UI☆41Updated last week
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆26Updated last week