serp-ai / ai-text-to-audio-latent-diffusionLinks
text-to-audio-latent-diffusion
☆37Updated 2 years ago
Alternatives and similar repositories for ai-text-to-audio-latent-diffusion
Users that are interested in ai-text-to-audio-latent-diffusion are comparing it to the libraries listed below
Sorting:
- ☆107Updated 2 years ago
- ☆11Updated last year
- A collection of pre-trained audio models, in PyTorch.☆114Updated 2 years ago
- The demo page of UniAudio☆34Updated last year
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆44Updated last year
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆32Updated last year
- Trainer for audio-diffusion-pytorch☆130Updated 2 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆195Updated 2 years ago
- Fork of AudioLDM as a TuneFlow plugin☆43Updated 2 years ago
- ☆184Updated last month
- ☆62Updated last year
- ☆87Updated 2 years ago
- fine-tuning MusicGen without prompts to generate music with a specific style☆67Updated 2 years ago
- ☆12Updated 2 years ago
- ☆51Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆60Updated last year
- Real-time end-to-end singing voice convertion☆22Updated last year
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆15Updated 3 years ago
- Codebase and project page for EDMSound☆35Updated 2 years ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆36Updated 2 months ago
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆164Updated 2 years ago
- open soundstream-ish VAE codecs for downstream neural audio synthesis☆120Updated 2 years ago
- Song Describer is a data collection platform for annotating music with textual descriptions.☆60Updated last year
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…☆88Updated last year
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆114Updated 2 years ago
- Download the MusicCaps dataset for music captioning☆112Updated 10 months ago
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆27Updated 2 years ago
- Sing an idea ➡️ AI music sample🔥🎶☆119Updated last year
- ☆178Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.☆48Updated 2 years ago