liuhuadai / AudioLCM
PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.
☆10Updated 8 months ago
Alternatives and similar repositories for AudioLCM:
Users that are interested in AudioLCM are comparing it to the libraries listed below
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆40Updated 4 months ago
- Codebase and project page for EDMSound☆34Updated last year
- Project for MIDI to Audio Synthesis☆22Updated last year
- A spoken version of the textual story cloze benchmark☆14Updated last year
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆22Updated 5 months ago
- Video Background Music Generation Using Unpaired Audio-Visual Data☆23Updated 4 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- ☆37Updated 8 months ago
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆46Updated 4 months ago
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆17Updated last month
- ☆39Updated 3 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆44Updated 5 months ago
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆53Updated 3 weeks ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆76Updated last month
- ☆9Updated 8 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆23Updated 9 months ago
- Timbre Transfer using Denoising Diffusion Implicit Models (ISMIR 2023)☆27Updated 11 months ago
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆36Updated last year
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆33Updated 2 months ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆40Updated last year
- music semantic understanding evaluation benchmark☆25Updated last year
- ☆43Updated last year
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆12Updated 5 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆37Updated 5 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆46Updated last week
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆79Updated 5 months ago
- Code for Investigating Personalization Methods in Text to Music Generation☆36Updated 10 months ago