[ICASSP'24] Investigating Personalization Methods in Text to Music Generation
☆47Mar 27, 2024Updated 2 years ago
Alternatives and similar repositories for DreamSound
Users that are interested in DreamSound are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Aug 4, 2025Updated 8 months ago
- ☆14Sep 21, 2022Updated 3 years ago
- A Max for Live device based on nn~ for real-time latent interaction and bending in Ableton.☆20Jul 8, 2025Updated 9 months ago
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆13Jun 15, 2024Updated last year
- Pytorch implementation of SoundCTM☆101Mar 31, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆35Mar 14, 2025Updated last year
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆22Jun 10, 2024Updated last year
- ☆191Nov 19, 2025Updated 5 months ago
- ☆20Aug 11, 2025Updated 8 months ago
- Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024☆67Feb 19, 2025Updated last year
- ☆44Mar 17, 2026Updated last month
- A simple library for Fréchet Audio Distance (FAD) calculation☆260Aug 22, 2025Updated 8 months ago
- Interactive Performance, Analysis and Visualization of RAVE Latent Spaces via PCA and OSC Integration☆13Jul 15, 2025Updated 9 months ago
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆13Jul 18, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆15Jan 6, 2025Updated last year
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆38Dec 8, 2022Updated 3 years ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Jul 10, 2024Updated last year
- ☆18May 4, 2025Updated 11 months ago
- Code for "A Principled Framework for Multi-View Contrastive Learning"☆20Jul 10, 2025Updated 9 months ago
- ☆21Jul 15, 2024Updated last year
- ☆12Nov 7, 2024Updated last year
- Official implementation for FlowSep☆74Jan 2, 2025Updated last year
- ☆29Jul 7, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Copyright-free Artificial Lyrics Dataset (ISMIR 2024 LBD)☆12Sep 1, 2024Updated last year
- Examples for ICASSP2024 paper "StemGen: A music generation model that listens"☆35Dec 19, 2023Updated 2 years ago
- Official implementation of WildFX Dataset Generating pipeline.☆17Oct 21, 2025Updated 6 months ago
- AudioLDM training, finetuning, evaluation and inference.☆300Dec 13, 2024Updated last year
- Download audioset data super fastly with youtube-dl, ffmpeg and python multiprocessing☆48Aug 1, 2024Updated last year
- ☆45Apr 2, 2025Updated last year
- Rhythm generator using Variational Autoencoder(VAE)☆41May 19, 2022Updated 3 years ago
- a list of demo websites for automatic music generation research☆779Apr 22, 2026Updated last week
- a notebook containing scripts, documentation, and examples for finetuning musicgen☆100Apr 10, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago
- Implementation of Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching (NeurIPS'24)☆60Apr 3, 2025Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- Style-based Neural Drum Synthesis with GAN inversion☆32Nov 9, 2021Updated 4 years ago
- Differentiable audio signal processors in PyTorch☆291Dec 4, 2023Updated 2 years ago
- Audio production style transfer with inference-time optimization☆51Nov 18, 2024Updated last year
- The hybrid architecture is based on the idea that we could simply apply a GAN method (GANSpace) to another GAN model (GANSynth).☆25Aug 16, 2021Updated 4 years ago