serp-ai / ai-text-to-audio-latent-diffusion
text-to-audio-latent-diffusion
☆37Updated last year
Alternatives and similar repositories for ai-text-to-audio-latent-diffusion:
Users that are interested in ai-text-to-audio-latent-diffusion are comparing it to the libraries listed below
- Code for Investigating Personalization Methods in Text to Music Generation☆37Updated last year
- Trainer for audio-diffusion-pytorch☆129Updated 2 years ago
- A collection of pre-trained audio models, in PyTorch.☆113Updated 2 years ago
- Audio generation using diffusion models, in PyTorch.☆47Updated last year
- The demo page of UniAudio☆33Updated last year
- ☆40Updated 5 months ago
- ☆107Updated last year
- AudioLDM text to audio colab☆19Updated last year
- music generation with perceiver-ar model☆26Updated 2 years ago
- ☆76Updated 6 months ago
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆14Updated 2 years ago
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆31Updated last year
- ☆11Updated last year
- Flexible LoRA Implementation to use with stable-audio-tools☆67Updated 7 months ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆29Updated last year
- Song Describer is a data collection platform for annotating music with textual descriptions.☆57Updated 4 months ago
- ☆84Updated last year
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆113Updated last year
- ☆65Updated last year
- fine-tuning MusicGen without prompts to generate music with a specific style☆63Updated last year
- tools to manipulate audio with riffusion☆93Updated last year
- Fork of AudioLDM as a TuneFlow plugin☆40Updated 2 years ago
- Codebase and project page for EDMSound☆34Updated last year
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Updated last year
- alchemy with embeddings☆34Updated last year
- Controlled audio inpainting using SD-fine tuned model Riffusion in a ControlNet Architecture☆28Updated last year
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Official source codes of airsep☆36Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆57Updated last year
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆25Updated last year