Implementation of DiffWave and SaShiMi audio generation models
☆128Apr 4, 2023Updated 2 years ago
Alternatives and similar repositories for diffwave-sashimi
Users that are interested in diffwave-sashimi are comparing it to the libraries listed below
Sorting:
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆90Apr 13, 2021Updated 4 years ago
- DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.☆887Mar 26, 2024Updated last year
- ☆87May 21, 2023Updated 2 years ago
- ☆18Jan 17, 2022Updated 4 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline☆432Apr 19, 2023Updated 2 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆159Jul 16, 2022Updated 3 years ago
- ☆25Mar 12, 2022Updated 4 years ago
- Temporary anonymous version☆22Mar 20, 2024Updated 2 years ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆126Mar 15, 2024Updated 2 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.☆43Apr 13, 2021Updated 4 years ago
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- Official implementation of SawSing (ISMIR'22)☆272Aug 28, 2022Updated 3 years ago
- Training code and trained checkpoints for ASGAN.☆62Dec 27, 2023Updated 2 years ago
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆252Jun 5, 2025Updated 9 months ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch☆136Feb 3, 2025Updated last year
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆121Mar 14, 2023Updated 3 years ago
- A differentiable version of SPTK☆196Feb 26, 2026Updated 3 weeks ago
- ☆12Mar 11, 2025Updated last year
- ☆77Feb 19, 2026Updated last month
- DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/☆408May 30, 2023Updated 2 years ago
- efficient neural audio synthesis in the waveform domain☆191Apr 14, 2025Updated 11 months ago
- Official implementation of the source-filter HiFiGAN vocoder☆270Jul 29, 2023Updated 2 years ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆37Jun 24, 2025Updated 8 months ago
- ☆122Oct 24, 2022Updated 3 years ago
- Collection of audio-focused loss functions in PyTorch☆855Jul 30, 2024Updated last year
- ☆37May 8, 2021Updated 4 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- Audio generation using diffusion models, in PyTorch.☆2,096Jun 12, 2023Updated 2 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Jul 25, 2024Updated last year
- Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis (Meso-DTFA)☆21Jul 6, 2023Updated 2 years ago
- Code for the "NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks" paper.☆39Jul 8, 2024Updated last year
- Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch☆508Oct 28, 2023Updated 2 years ago
- Trainer for audio-diffusion-pytorch☆129Jan 13, 2023Updated 3 years ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Jun 3, 2023Updated 2 years ago