zachary-shah / riff-cnet
Controlled audio inpainting using SD-fine tuned model Riffusion in a ControlNet Architecture
☆28Updated last year
Alternatives and similar repositories for riff-cnet:
Users that are interested in riff-cnet are comparing it to the libraries listed below
- Code for Investigating Personalization Methods in Text to Music Generation☆36Updated 10 months ago
- Unofficial download repository for MusicCaps☆45Updated last year
- Official source codes of airsep☆36Updated 10 months ago
- Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…☆52Updated last year
- ☆63Updated 10 months ago
- [ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT☆45Updated last year
- Codebase and project page for EDMSound☆34Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆38Updated last year
- [PyTorch] Minimal codebase for MusicGen models☆56Updated last month
- " Music Style Transfer with Time-Varying Inversion of Diffusion Models"☆40Updated 6 months ago
- Pytorch implementation of SoundCTM☆82Updated last week
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆76Updated last month
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆40Updated last year
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆66Updated 7 months ago
- ☆39Updated 3 months ago
- ☆71Updated 4 months ago
- ☆41Updated last year
- Project for MIDI to Audio Synthesis☆22Updated last year
- ☆43Updated 8 months ago
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆114Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆46Updated 5 months ago
- Robust Singing Voice Transcription and MIDI Extraction☆69Updated 3 months ago
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆48Updated 4 months ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆41Updated 4 months ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆36Updated 8 months ago
- A collection of audio autoencoders, in PyTorch.☆39Updated last year
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year
- The demo page of UniAudio☆34Updated last year