eloimoliner/CQTdiff

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eloimoliner/CQTdiff)

eloimoliner / CQTdiff

Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23

☆122

Alternatives and similar repositories for CQTdiff

Users that are interested in CQTdiff are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iamycy / diffwave-sr
View on GitHub
☆87May 21, 2023Updated 3 years ago
eloimoliner / audio-inpainting-diffusion
View on GitHub
☆74Apr 4, 2024Updated 2 years ago
archinetai / cqt-pytorch
View on GitHub
An invertible and differentiable implementation of the Constant-Q Transform (CQT).
☆73Dec 9, 2022Updated 3 years ago
eloimoliner / CQT_pytorch
View on GitHub
Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters
☆36Jul 7, 2026Updated 2 weeks ago
eloimoliner / BABE2-music-restoration
View on GitHub
☆61Apr 22, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
chomeyama / DualCycleGAN
View on GitHub
Official implementation of DualCycleGAN for nonparallel audio super resolution
☆54Nov 1, 2022Updated 3 years ago
sp-nitech / diffsptk
View on GitHub
A differentiable version of SPTK
☆201Jul 14, 2026Updated last week
eloimoliner / unconditional-diff-STFT
View on GitHub
Unconditional music synthesis using a diffusion model in the STFT domain
☆12May 31, 2022Updated 4 years ago
AaltoAcousticsLab / aalto-papers
View on GitHub
A list of publications that have accompanying open-source code
☆22Oct 30, 2023Updated 2 years ago
eloimoliner / BABE
View on GitHub
Zero-Shot Blind Audio Bandwidth Extension
☆27May 25, 2023Updated 3 years ago
eloimoliner / bwe_historical_recordings
View on GitHub
Bandwidth Extension of Historical Recordings using Generative Adversarial Networks
☆38May 25, 2023Updated 3 years ago
maum-ai / phaseaug
View on GitHub
ICASSP 2023 Accepted
☆191May 6, 2024Updated 2 years ago
yuan1615 / AdaVocoder
View on GitHub
Adaptive Vocoder for Custom Voice
☆61Sep 22, 2022Updated 3 years ago
brentspell / torch-yin
View on GitHub
Yin pitch estimator in PyTorch
☆119Nov 7, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
junjun3518 / alias-free-torch
View on GitHub
Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample
☆101Jul 26, 2022Updated 3 years ago
neoncloud / mdctGAN
View on GitHub
Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"
☆66Jun 3, 2023Updated 3 years ago
YangAi520 / NSPP
View on GitHub
☆55Mar 2, 2023Updated 3 years ago
haiciyang / LaDiffCodec
View on GitHub
ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.
☆56Nov 16, 2025Updated 8 months ago
rishikksh20 / HiFiplusplus-pytorch
View on GitHub
HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
☆160Jul 16, 2022Updated 4 years ago
X-LANCE / UniCATS-CTX-vec2wav
View on GitHub
[AAAI 2024] Code for CTX-vec2wav in UniCATS
☆130Jun 11, 2024Updated 2 years ago
csteinmetz1 / auraloss
View on GitHub
Collection of audio-focused loss functions in PyTorch
☆874Jul 30, 2024Updated last year
interactiveaudiolab / penn
View on GitHub
Pitch Estimating Neural Networks (PENN)
☆277Apr 2, 2025Updated last year
XZWY / SpatialCodec
View on GitHub
Implementation of SpatialCodec.
☆71Sep 23, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
avi33 / StyleMelGan-Unofficial
View on GitHub
☆23Sep 14, 2021Updated 4 years ago
yangdongchao / AcademiCodec
View on GitHub
AcademiCodec: An Open Source Audio Codec Model for Academic Research
☆674Dec 27, 2023Updated 2 years ago
haoheliu / ssr_eval
View on GitHub
Evaluation and Benchmarking of Speech Super-resolution Methods
☆157Jun 17, 2022Updated 4 years ago
ljuvela / GELP
View on GitHub
☆27Apr 21, 2021Updated 5 years ago
csteinmetz1 / dasp-pytorch
View on GitHub
Differentiable audio signal processors in PyTorch
☆297Dec 4, 2023Updated 2 years ago
sony / diffiner
View on GitHub
☆68Aug 16, 2023Updated 2 years ago
X-LANCE / VoiceFlow-TTS
View on GitHub
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
☆376Sep 3, 2024Updated last year
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
slp-rl / aero
View on GitHub
This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)
☆244May 1, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Aria-K-Alethia / laughter-synthesis
View on GitHub
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆77Jul 16, 2023Updated 3 years ago
jyhan03 / dpccn
View on GitHub
This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.
☆13Dec 8, 2021Updated 4 years ago
sp-uhh / sgmse_crp
View on GitHub
☆32Jan 9, 2024Updated 2 years ago
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
YatingMusic / ddsp-singing-vocoders
View on GitHub
Official implementation of SawSing (ISMIR'22)
☆275Aug 28, 2022Updated 3 years ago
lifeiteng / VoiceBox
View on GitHub
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
☆29Aug 4, 2023Updated 2 years ago
DiffAPF / torchlpc
View on GitHub
Fast and differentiable time domain all-pole filter in PyTorch.
☆72Feb 5, 2026Updated 5 months ago