albertfgu / diffwave-sashimi
Implementation of DiffWave and SaShiMi audio generation models
☆121Updated last year
Alternatives and similar repositories for diffwave-sashimi:
Users that are interested in diffwave-sashimi are comparing it to the libraries listed below
- PyTorch Dataset for Speech and Music audio☆73Updated 8 months ago
- A collection of audio autoencoders, in PyTorch.☆40Updated 2 years ago
- Fast and differentiable time domain all-pole filter in PyTorch.☆57Updated 2 months ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆114Updated 2 years ago
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆85Updated last year
- Pitch Estimating Neural Networks (PENN)☆247Updated 7 months ago
- PyTorch wrappers for using your model in audacity!☆175Updated last year
- Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)☆156Updated 4 years ago
- Self-supervised VQ-VAE for One-Shot Music Style Transfer☆90Updated last month
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆64Updated last year
- ☆66Updated 3 weeks ago
- efficient neural audio synthesis in the waveform domain☆185Updated 3 years ago
- A DDSP-based neural voice synthesiser.☆114Updated 4 months ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆110Updated last year
- Audiogen Codec☆130Updated 8 months ago
- A repository for benchmarking neural vocoders by their quality and speed.☆208Updated last week
- Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - un…☆61Updated 4 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆95Updated 8 months ago
- Reproducible Subjective Evaluation☆59Updated last year
- ☆83Updated last year
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆85Updated 5 months ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆63Updated 2 weeks ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆58Updated 2 years ago
- Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)☆80Updated 3 months ago
- A reimplementation of NSynth in PyTorch.☆15Updated 5 years ago
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year
- spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io☆49Updated last year
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆88Updated 3 years ago
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆54Updated 2 years ago
- Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpo…☆108Updated last year