zachary-shah / riff-cnet
Controlled audio inpainting using SD-fine tuned model Riffusion in a ControlNet Architecture
☆28Updated last year
Alternatives and similar repositories for riff-cnet:
Users that are interested in riff-cnet are comparing it to the libraries listed below
- Code for Investigating Personalization Methods in Text to Music Generation☆36Updated 9 months ago
- The demo page of UniAudio☆34Updated 11 months ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆66Updated 6 months ago
- Official source codes of airsep☆35Updated 9 months ago
- ☆62Updated 9 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆44Updated 4 months ago
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- Codebase and project page for EDMSound☆33Updated last year
- Pytorch implementation of SoundCTM☆75Updated last month
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆74Updated 3 weeks ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 11 months ago
- Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…☆52Updated last year
- Unofficial download repository for MusicCaps☆45Updated last year
- Robust Singing Voice Transcription and MIDI Extraction☆66Updated 2 months ago
- Test code disclosure for the research paper "UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model", as a supplementa…☆19Updated last year
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆51Updated last year
- ☆43Updated 7 months ago
- [ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT☆39Updated last year
- ☆67Updated 3 months ago
- " Music Style Transfer with Time-Varying Inversion of Diffusion Models"☆37Updated 5 months ago
- ☆79Updated last year
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆21Updated 4 months ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆77Updated 4 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆91Updated 2 months ago
- Flexible LoRA Implementation to use with stable-audio-tools☆56Updated 4 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆90Updated 7 months ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆61Updated 9 months ago
- Deep Performer: Score-to-audio music performance synthesis☆42Updated last year
- Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls☆78Updated 6 months ago
- Implementation of Emo-StarGAN☆46Updated last year