teticio / audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
☆743Updated 5 months ago
Alternatives and similar repositories for audio-diffusion:
Users that are interested in audio-diffusion are comparing it to the libraries listed below
- Audio generation using diffusion models, in PyTorch.☆2,017Updated last year
- Official PyTorch implementation of BigVGAN (ICLR 2023)☆961Updated 5 months ago
- Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.☆536Updated last year
- Audio Dataset for training CLAP and other models☆668Updated last year
- ☆390Updated last month
- Tools to train a generative model on arbitrary audio samples☆1,092Updated 10 months ago
- This toolbox aims to unify audio generation model evaluation for easier comparison.☆320Updated 5 months ago
- Contrastive Language-Audio Pretraining☆1,543Updated 3 months ago
- DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/☆381Updated last year
- simple trainer for musicgen/audiocraft☆20Updated 7 months ago
- Fast Infinite Waveform Music Generation☆670Updated 2 years ago
- Collection of audio-focused loss functions in PyTorch☆765Updated 7 months ago
- MIDI / symbolic music tokenizers for Deep Learning models 🎶☆743Updated last week
- Learning audio concepts from natural language supervision☆528Updated 5 months ago
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆306Updated 10 months ago
- This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.☆345Updated last year
- Trainer for audio-diffusion-pytorch☆128Updated 2 years ago
- State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.☆1,309Updated 7 months ago
- music generation with masked transformers!☆320Updated last week
- a list of demo websites for automatic music generation research☆663Updated this week
- Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".☆353Updated 10 months ago
- Symphony Generation with Permutation Invariant Language Model☆254Updated 2 years ago
- Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch☆2,506Updated last month
- Synthesis of MIDI with DDSP (https://midi-ddsp.github.io/)☆307Updated 2 years ago
- Symbolic Music Generation with Diffusion Models☆233Updated last month
- DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.☆807Updated 11 months ago
- Mustango: Toward Controllable Text-to-Music Generation☆354Updated 7 months ago
- WavJourney: Compositional Audio Creation with LLMs☆531Updated last year
- AudioLDM training, finetuning, evaluation and inference.☆234Updated 2 months ago
- List of academic resources on Multimodal ML for Music☆289Updated last year