albertfgu / diffwave-sashimi
Implementation of DiffWave and SaShiMi audio generation models
☆112Updated last year
Related projects: ⓘ
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆77Updated last year
- PyTorch Dataset for Speech and Music audio☆73Updated 2 months ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆101Updated last year
- ☆49Updated last week
- Source code for "FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control"☆138Updated 5 months ago
- Pitch Estimating Neural Networks (PENN)☆227Updated last month
- Audiogen Codec☆116Updated 2 months ago
- A collection of audio autoencoders, in PyTorch.☆37Updated last year
- A DDSP-based neural voice synthesiser.☆95Updated last week
- ☆62Updated 3 weeks ago
- A lightweight library for Frechet Audio Distance calculation.☆230Updated 2 weeks ago
- A collection of useful audio datasets and transforms for PyTorch.☆130Updated last year
- A simple library for Fréchet Audio Distance (FAD) calculation☆137Updated 2 weeks ago
- A reimplementation of NSynth in PyTorch.☆13Updated 4 years ago
- PyTorch wrappers for using your model in audacity!☆172Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆81Updated last month
- Official implementation of "Contrastive Audio-Language Learning for Music" (ISMIR 2022)☆103Updated last year
- ☆78Updated last year
- PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model☆69Updated 9 months ago
- Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)☆149Updated 3 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆56Updated last year
- A repository for benchmarking neural vocoders by their quality and speed.☆201Updated 3 weeks ago
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆86Updated 3 years ago
- efficient neural audio synthesis in the waveform domain☆185Updated 3 years ago
- Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)☆76Updated 2 years ago
- Simple package for binding functions to CLI or config files.☆43Updated last month
- ☆71Updated last year
- Self-supervised VQ-VAE for One-Shot Music Style Transfer☆84Updated 11 months ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆46Updated 2 years ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆109Updated 8 months ago