Implementation of DiffWave and SaShiMi audio generation models
☆128Apr 4, 2023Updated 2 years ago
Alternatives and similar repositories for diffwave-sashimi
Users that are interested in diffwave-sashimi are comparing it to the libraries listed below
Sorting:
- ☆87May 21, 2023Updated 2 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆91Apr 13, 2021Updated 4 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆159Jul 16, 2022Updated 3 years ago
- DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.☆884Mar 26, 2024Updated last year
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- ☆18Jan 17, 2022Updated 4 years ago
- Temporary anonymous version☆22Mar 20, 2024Updated last year
- Single channel speech source separation by diffusion process (ICASSP 2023)☆124Mar 15, 2024Updated last year
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year
- ☆25Mar 12, 2022Updated 3 years ago
- Official implementation of SawSing (ISMIR'22)☆272Aug 28, 2022Updated 3 years ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- ☆13Mar 11, 2025Updated 11 months ago
- PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline☆432Apr 19, 2023Updated 2 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆252Jun 5, 2025Updated 8 months ago
- Official implementation of the source-filter HiFiGAN vocoder☆268Jul 29, 2023Updated 2 years ago
- ☆121Oct 24, 2022Updated 3 years ago
- Training code and trained checkpoints for ASGAN.☆62Dec 27, 2023Updated 2 years ago
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆91Nov 24, 2025Updated 3 months ago
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆104Jul 12, 2023Updated 2 years ago
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- ☆32Jul 27, 2022Updated 3 years ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch☆135Feb 3, 2025Updated last year
- ☆46Apr 16, 2023Updated 2 years ago
- A differentiable version of SPTK☆193Feb 3, 2026Updated 3 weeks ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆91Jun 9, 2022Updated 3 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Apr 27, 2023Updated 2 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆152Jun 17, 2022Updated 3 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆139May 8, 2022Updated 3 years ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆121Mar 14, 2023Updated 2 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆69Aug 3, 2021Updated 4 years ago
- Collection of audio-focused loss functions in PyTorch☆851Jul 30, 2024Updated last year
- DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/☆405May 30, 2023Updated 2 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Jul 25, 2024Updated last year