[ICASSP'24] Investigating Personalization Methods in Text to Music Generation
☆45Mar 27, 2024Updated last year
Alternatives and similar repositories for DreamSound
Users that are interested in DreamSound are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Aug 4, 2025Updated 7 months ago
- ☆14Sep 21, 2022Updated 3 years ago
- A Max for Live device based on nn~ for real-time latent interaction and bending in Ableton.☆20Jul 8, 2025Updated 8 months ago
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆13Jun 15, 2024Updated last year
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆32Mar 14, 2025Updated last year
- Pytorch implementation of SoundCTM☆101Mar 31, 2025Updated 11 months ago
- ☆40Updated this week
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆22Jun 10, 2024Updated last year
- ☆19Aug 11, 2025Updated 7 months ago
- ☆191Nov 19, 2025Updated 4 months ago
- Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024☆63Feb 19, 2025Updated last year
- A simple library for Fréchet Audio Distance (FAD) calculation☆250Aug 22, 2025Updated 7 months ago
- Interactive Performance, Analysis and Visualization of RAVE Latent Spaces via PCA and OSC Integration☆13Jul 15, 2025Updated 8 months ago
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆13Jul 18, 2025Updated 8 months ago
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆15Jan 6, 2025Updated last year
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆38Dec 8, 2022Updated 3 years ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Jul 10, 2024Updated last year
- ☆18May 4, 2025Updated 10 months ago
- Code for "A Principled Framework for Multi-View Contrastive Learning"☆20Jul 10, 2025Updated 8 months ago
- Official implementation of WildFX Dataset Generating pipeline.☆15Oct 21, 2025Updated 5 months ago
- ☆12Nov 7, 2024Updated last year
- ☆21Jul 15, 2024Updated last year
- Official implementation for FlowSep☆70Jan 2, 2025Updated last year
- ☆28Jul 7, 2025Updated 8 months ago
- Copyright-free Artificial Lyrics Dataset (ISMIR 2024 LBD)☆12Sep 1, 2024Updated last year
- AudioLDM training, finetuning, evaluation and inference.☆297Dec 13, 2024Updated last year
- Examples for ICASSP2024 paper "StemGen: A music generation model that listens"☆35Dec 19, 2023Updated 2 years ago
- Download audioset data super fastly with youtube-dl, ffmpeg and python multiprocessing☆48Aug 1, 2024Updated last year
- ☆44Apr 2, 2025Updated 11 months ago
- Rhythm generator using Variational Autoencoder(VAE)☆40May 19, 2022Updated 3 years ago
- a list of demo websites for automatic music generation research☆776Updated this week
- a notebook containing scripts, documentation, and examples for finetuning musicgen☆99Apr 10, 2024Updated last year
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- Implementation of Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching (NeurIPS'24)☆59Apr 3, 2025Updated 11 months ago
- Style-based Neural Drum Synthesis with GAN inversion☆32Nov 9, 2021Updated 4 years ago
- Differentiable audio signal processors in PyTorch☆287Dec 4, 2023Updated 2 years ago
- Explaining audio differences using language☆16Feb 11, 2025Updated last year
- Audio production style transfer with inference-time optimization☆49Nov 18, 2024Updated last year