teticio / audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
☆719Updated last month
Related projects ⓘ
Alternatives and complementary repositories for audio-diffusion
- ☆383Updated 6 months ago
- Audio generation using diffusion models, in PyTorch.☆1,963Updated last year
- Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.☆523Updated last year
- Audio Dataset for training CLAP and other models☆637Updated 9 months ago
- simple trainer for musicgen/audiocraft☆12Updated 4 months ago
- Tools to train a generative model on arbitrary audio samples☆1,080Updated 6 months ago
- Fast Infinite Waveform Music Generation☆662Updated 2 years ago
- This toolbox aims to unify audio generation model evaluation for easier comparison.☆304Updated last month
- Contrastive Language-Audio Pretraining☆1,422Updated 4 months ago
- Symphony Generation with Permutation Invariant Language Model☆250Updated 2 years ago
- MIDI / symbolic music tokenizers for Deep Learning models 🎶☆692Updated this week
- Official PyTorch implementation of BigVGAN (ICLR 2023)☆895Updated 2 months ago
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆284Updated 7 months ago
- Trainer for audio-diffusion-pytorch☆127Updated last year
- A straightforward collection of Music Generation research resources.☆574Updated last year
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆194Updated last year
- a list of demo websites for automatic music generation research☆635Updated last week
- Mustango: Toward Controllable Text-to-Music Generation☆342Updated 3 months ago
- Symbolic Music Generation with Diffusion Models☆221Updated this week
- DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/☆369Updated last year
- Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".☆310Updated 7 months ago
- List of academic resources on Multimodal ML for Music☆283Updated last year
- This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.☆327Updated last year
- Collection of audio-focused loss functions in PyTorch☆744Updated 3 months ago
- The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"☆349Updated last year
- music generation with masked transformers!☆303Updated last week
- Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)☆347Updated 4 months ago
- State of the Art of Music Generation with Deep Learning and AI☆276Updated last year
- DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.☆775Updated 7 months ago
- Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch☆2,444Updated last week