serp-ai / ai-text-to-audio-latent-diffusionLinks
text-to-audio-latent-diffusion
☆37Updated 2 years ago
Alternatives and similar repositories for ai-text-to-audio-latent-diffusion
Users that are interested in ai-text-to-audio-latent-diffusion are comparing it to the libraries listed below
Sorting:
- ☆11Updated last year
- ☆107Updated last year
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…☆87Updated 8 months ago
- A collection of pre-trained audio models, in PyTorch.☆114Updated 2 years ago
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆40Updated last year
- fine-tuning MusicGen without prompts to generate music with a specific style☆66Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆59Updated last year
- ☆62Updated last year
- ☆11Updated last year
- ☆181Updated 7 months ago
- Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding☆29Updated last month
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆31Updated last year
- Codebase and project page for EDMSound☆34Updated last year
- Fork of AudioLDM as a TuneFlow plugin☆41Updated 2 years ago
- Trainer for audio-diffusion-pytorch☆129Updated 2 years ago
- The demo page of UniAudio☆34Updated last year
- Real-time end-to-end singing voice convertion☆22Updated 9 months ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆30Updated this week
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Updated 2 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- ☆51Updated 9 months ago
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆157Updated last year
- Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…☆200Updated last year
- Audio generation using diffusion models, in PyTorch.☆49Updated last year
- Sing an idea ➡️ AI music sample🔥🎶☆116Updated last year
- ☆85Updated 2 years ago
- Your one-stop solution for voice dataset creation☆122Updated last year
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆94Updated 11 months ago
- Flexible LoRA Implementation to use with stable-audio-tools☆74Updated 11 months ago
- Official source codes of airsep☆37Updated last year