PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.
☆13Jun 15, 2024Updated last year
Alternatives and similar repositories for AudioLCM
Users that are interested in AudioLCM are comparing it to the libraries listed below
Sorting:
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆34May 25, 2024Updated last year
- Code for paper "Network Bending of Diffusion Models for Audio-Visual Generation" at DAFx 2024☆17Aug 26, 2025Updated 6 months ago
- Official implementation of Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models☆43Mar 3, 2025Updated last year
- Modeling of nonlinear audio effects with end-to-end deep neural networks - website:☆17May 11, 2020Updated 5 years ago
- The official codebase for Reflected Flow Matching (ICML 2024)☆22Jun 19, 2024Updated last year
- Implementation of Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching (NeurIPS'24)☆59Apr 3, 2025Updated 11 months ago
- Timbre Transfer using Denoising Diffusion Implicit Models (ISMIR 2023)☆28Mar 22, 2025Updated 11 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆119May 19, 2025Updated 9 months ago
- Project for MIDI to Audio Synthesis☆27Mar 13, 2023Updated 2 years ago
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆32Mar 4, 2025Updated last year
- Video Background Music Generation Using Unpaired Audio-Visual Data☆30Oct 8, 2024Updated last year
- ☆68Jul 23, 2023Updated 2 years ago
- official code for CVPR'24 paper Diff-BGM☆71Oct 12, 2024Updated last year
- ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation☆39Nov 20, 2024Updated last year
- Code and demo for paper: Zhao et al., Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling, in NeurIPS 2024.☆40Jan 17, 2026Updated last month
- A convolutional generative audio synthesis model☆32Jun 17, 2022Updated 3 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- ☆37Jul 4, 2024Updated last year
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆41Mar 10, 2025Updated 11 months ago
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆45Mar 27, 2024Updated last year
- ☆41Oct 19, 2025Updated 4 months ago
- This repository provides basic scripts that apply the Impulse Pattern Formulation (IPF) in different programming languages. Thus, it help…☆12Jun 13, 2025Updated 8 months ago
- Pytorch implementation of SoundCTM☆101Mar 31, 2025Updated 11 months ago
- ☆10Dec 8, 2025Updated 2 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- Recommendation System Using three different approaches Simple Recommendation Using Content based( TF-IDF & Bag of words ), Using KNN and …☆11Jun 27, 2022Updated 3 years ago
- ☆17May 14, 2025Updated 9 months ago
- ☆14Sep 21, 2022Updated 3 years ago
- The Molecular Dynamics teaching code.☆12Oct 17, 2025Updated 4 months ago
- ☆43Feb 21, 2023Updated 3 years ago
- Implementation of the paper "Exploiting Time-Frequency Conformers for Music Audio Enhancement"☆12Mar 21, 2025Updated 11 months ago
- Robust and ready-to-use tasks and workflows for a variety of bioinformatics pipelines. Use Flyte and Union to orchestrate anything from v…☆12Apr 2, 2025Updated 11 months ago
- ☆11Dec 16, 2024Updated last year
- Sound2Synth Plug-Ins☆13Jul 28, 2022Updated 3 years ago
- 语音合成服务☆12Mar 18, 2023Updated 2 years ago
- ☆12Jun 9, 2025Updated 8 months ago
- BachDuet enables a human performer to improvise a duet counterpoint with a computer agent in real time.☆14Aug 8, 2022Updated 3 years ago
- Graph-based neural tactic prediction models for Coq.☆15Sep 17, 2025Updated 5 months ago
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year