serp-ai / ai-text-to-audio-latent-diffusionLinks
text-to-audio-latent-diffusion
☆37Updated last year
Alternatives and similar repositories for ai-text-to-audio-latent-diffusion
Users that are interested in ai-text-to-audio-latent-diffusion are comparing it to the libraries listed below
Sorting:
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆38Updated last year
- fine-tuning MusicGen without prompts to generate music with a specific style☆64Updated last year
- ☆11Updated last year
- ☆11Updated last year
- AudioLDM text to audio colab☆18Updated last year
- Codebase and project page for EDMSound☆34Updated last year
- ☆107Updated last year
- A collection of pre-trained audio models, in PyTorch.☆113Updated 2 years ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆30Updated last year
- ☆66Updated last year
- ☆44Updated 7 months ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆24Updated 3 weeks ago
- Flexible LoRA Implementation to use with stable-audio-tools☆72Updated 8 months ago
- ☆170Updated 5 months ago
- The demo page of UniAudio☆33Updated last year
- Trainer for audio-diffusion-pytorch☆129Updated 2 years ago
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆15Updated 2 years ago
- ☆8Updated 9 months ago
- Audio generation using diffusion models, in PyTorch.☆46Updated last year
- Real-time end-to-end singing voice convertion☆22Updated 7 months ago
- Controlled audio inpainting using SD-fine tuned model Riffusion in a ControlNet Architecture☆31Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆59Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆41Updated last year
- Official source codes of airsep☆36Updated last year
- Create training data for training a voice cloner for bark text to speech.☆45Updated last year
- ☆62Updated 10 months ago
- open soundstream-ish VAE codecs for downstream neural audio synthesis☆117Updated last year
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year