liuhuadai / AudioLCMView external linksLinks
PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.
☆13Jun 15, 2024Updated last year
Alternatives and similar repositories for AudioLCM
Users that are interested in AudioLCM are comparing it to the libraries listed below
Sorting:
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆34May 25, 2024Updated last year
- Code for paper "Network Bending of Diffusion Models for Audio-Visual Generation" at DAFx 2024☆16Aug 26, 2025Updated 5 months ago
- Official implementation of Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models☆43Mar 3, 2025Updated 11 months ago
- Modeling of nonlinear audio effects with end-to-end deep neural networks - website:☆17May 11, 2020Updated 5 years ago
- The official codebase for Reflected Flow Matching (ICML 2024)☆22Jun 19, 2024Updated last year
- Implementation of Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching (NeurIPS'24)☆59Apr 3, 2025Updated 10 months ago
- Timbre Transfer using Denoising Diffusion Implicit Models (ISMIR 2023)☆28Mar 22, 2025Updated 10 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆118May 19, 2025Updated 8 months ago
- Project for MIDI to Audio Synthesis☆27Mar 13, 2023Updated 2 years ago
- Video Background Music Generation Using Unpaired Audio-Visual Data☆30Oct 8, 2024Updated last year
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆32Mar 4, 2025Updated 11 months ago
- ☆68Jul 23, 2023Updated 2 years ago
- official code for CVPR'24 paper Diff-BGM☆71Oct 12, 2024Updated last year
- ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation☆38Nov 20, 2024Updated last year
- Code and demo for paper: Zhao et al., Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling, in NeurIPS 2024.☆40Jan 17, 2026Updated 3 weeks ago
- A convolutional generative audio synthesis model☆32Jun 17, 2022Updated 3 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- ☆37Jul 4, 2024Updated last year
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆41Mar 10, 2025Updated 11 months ago
- Pytorch implementation of SoundCTM☆100Mar 31, 2025Updated 10 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- This repository provides basic scripts that apply the Impulse Pattern Formulation (IPF) in different programming languages. Thus, it help…☆12Jun 13, 2025Updated 8 months ago
- ☆39Oct 19, 2025Updated 3 months ago
- Recommendation System Using three different approaches Simple Recommendation Using Content based( TF-IDF & Bag of words ), Using KNN and …☆11Jun 27, 2022Updated 3 years ago
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆45Mar 27, 2024Updated last year
- ☆17May 14, 2025Updated 9 months ago
- ☆14Sep 21, 2022Updated 3 years ago
- The Molecular Dynamics teaching code.☆12Oct 17, 2025Updated 3 months ago
- ☆10Dec 8, 2025Updated 2 months ago
- ☆43Feb 21, 2023Updated 2 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- Dataset Generator for Musical Devices☆17Dec 2, 2025Updated 2 months ago
- Code for ChordSync, a conformer-based audio-to-chord synchroniser☆13Oct 17, 2025Updated 3 months ago
- This is the repository for Learning to Generate Piano Music With Sustain Pedals☆12Nov 23, 2023Updated 2 years ago
- Interactive Performance, Analysis and Visualization of RAVE Latent Spaces via PCA and OSC Integration☆13Jul 15, 2025Updated 6 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- Source codes for the paper "Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning" (PDMER) which p…☆14Mar 24, 2025Updated 10 months ago
- Stellenbosch University ZeroSpeech 2019 System☆10Apr 4, 2019Updated 6 years ago
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year