albertfgu/diffwave-sashimi

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/albertfgu/diffwave-sashimi)

albertfgu / diffwave-sashimi

Implementation of DiffWave and SaShiMi audio generation models

☆128

Alternatives and similar repositories for diffwave-sashimi

Users that are interested in diffwave-sashimi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

philsyn / DiffWave-Vocoder
View on GitHub
Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.
☆90Apr 13, 2021Updated 5 years ago
lmnt-com / diffwave
View on GitHub
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
☆885Mar 26, 2024Updated 2 years ago
iamycy / diffwave-sr
View on GitHub
☆87May 21, 2023Updated 3 years ago
roholazandie / ryan-tts
View on GitHub
☆18Jan 17, 2022Updated 4 years ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
View on GitHub
☆16Apr 4, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Rongjiehuang / ProDiff
View on GitHub
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
☆432Apr 19, 2023Updated 3 years ago
zceng / LVCNet
View on GitHub
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
☆80Feb 24, 2021Updated 5 years ago
rishikksh20 / HiFiplusplus-pytorch
View on GitHub
HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
☆160Jul 16, 2022Updated 4 years ago
rishikksh20 / NU-Wave2-pytorch
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆25Jul 5, 2022Updated 4 years ago
philsyn / DiffWave-unconditional
View on GitHub
Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.
☆43Apr 13, 2021Updated 5 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
fakufaku / diffusion-separation
View on GitHub
Single channel speech source separation by diffusion process (ICASSP 2023)
☆126Mar 15, 2024Updated 2 years ago
speechnovateur / languagecodec_tmp
View on GitHub
Temporary anonymous version
☆22Mar 20, 2024Updated 2 years ago
eloimoliner / unconditional-diff-STFT
View on GitHub
Unconditional music synthesis using a diffusion model in the STFT domain
☆12May 31, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
sh-lee97 / grafx
View on GitHub
GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
☆139Jun 29, 2026Updated 3 weeks ago
maum-ai / phaseaug
View on GitHub
ICASSP 2023 Accepted
☆191May 6, 2024Updated 2 years ago
brentspell / torch-yin
View on GitHub
Yin pitch estimator in PyTorch
☆119Nov 7, 2022Updated 3 years ago
TEAMuP-dev / audacitorch
View on GitHub
PyTorch wrappers for using your model in audacity!
☆181Aug 13, 2023Updated 2 years ago
YatingMusic / ddsp-singing-vocoders
View on GitHub
Official implementation of SawSing (ISMIR'22)
☆275Aug 28, 2022Updated 3 years ago
RF5 / simple-asgan
View on GitHub
Training code and trained checkpoints for ASGAN.
☆62Dec 27, 2023Updated 2 years ago
eloimoliner / CQTdiff
View on GitHub
Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23
☆122Mar 14, 2023Updated 3 years ago
keonlee9420 / DailyTalk
View on GitHub
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
☆259Jun 5, 2025Updated last year
sp-nitech / diffsptk
View on GitHub
A differentiable version of SPTK
☆201Jul 14, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
csteinmetz1 / auraloss
View on GitHub
Collection of audio-focused loss functions in PyTorch
☆874Jul 30, 2024Updated last year
ben-hayes / neural-waveshaping-synthesis
View on GitHub
efficient neural audio synthesis in the waveform domain
☆191Apr 14, 2025Updated last year
sony / creativeai
View on GitHub
☆79Jul 7, 2026Updated 2 weeks ago
adobe-research / DeepAFx-ST
View on GitHub
DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/
☆414May 30, 2023Updated 3 years ago
chomeyama / SiFiGAN
View on GitHub
Official implementation of the source-filter HiFiGAN vocoder
☆275Jul 29, 2023Updated 2 years ago
nomonosound / log-wmse-audio-quality
View on GitHub
logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…
☆39Jun 24, 2025Updated last year
rhoposit / multilingual_VQVAE
View on GitHub
☆37May 8, 2021Updated 5 years ago
KunZhou9646 / Mixed_Emotions
View on GitHub
☆123Oct 24, 2022Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
archinetai / audio-diffusion-pytorch
View on GitHub
Audio generation using diffusion models, in PyTorch.
☆2,098Jun 12, 2023Updated 3 years ago
shang0712 / HierTTS
View on GitHub
☆47Apr 16, 2023Updated 3 years ago
k2kobayashi / crank
View on GitHub
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
☆171Jul 25, 2024Updated last year
cyrusasfa / meso-dtfa
View on GitHub
Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis (Meso-DTFA)
☆21Jun 30, 2026Updated 3 weeks ago
sarulab-speech / multi-speaker-dgp
View on GitHub
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Mar 23, 2021Updated 5 years ago
adrianbarahona / noisebandnet
View on GitHub
Code for the "NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks" paper.
☆39Jul 8, 2024Updated 2 years ago
acids-ircam / ddsp_pytorch
View on GitHub
Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch
☆518Oct 28, 2023Updated 2 years ago