SamsungLabs / UndiffLinks

Test code disclosure for the research paper "UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model", as a supplementary material for the paper accepted to the upcoming Interspeech2023 conference.

☆20

Alternatives and similar repositories for Undiff

Users that are interested in Undiff are comparing it to the libraries listed below

Sorting:

WangHelin1997 / Fast-GeCo
Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction
☆39Updated 6 months ago
exercise-book-yq / Supercodec
☆47Updated 2 months ago
WangHelin1997 / SoloAudio
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.
☆88Updated 5 months ago
seongho608 / RingFormer
☆46Updated 4 months ago
sp-uhh / sgmse_crp
☆23Updated last year
jjunak-yun / FLowHigh_code
[ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"
☆63Updated 4 months ago
haoheliu / SemantiCodec
☆43Updated 11 months ago
philgzl / brever
Speech enhancement in noisy and reverberant environments using deep neural networks
☆20Updated 2 months ago
anton-jeran / MULTI-AUDIODEC
This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.
☆49Updated 2 months ago
gwh22 / LAFMA
LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)
☆38Updated 11 months ago
YangAi520 / NSPP
☆51Updated 2 years ago
archinetai / aligner-pytorch
Sequence alignement methods with helpers for PyTorch.
☆24Updated 2 years ago
justinlovelace / SESD
☆61Updated 7 months ago
ETH-DISCO / discoder
Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025
☆28Updated 3 months ago
kaistmm / fregrad
☆29Updated last year
yxlu-0102 / IDEA-TTS
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis
☆26Updated 2 months ago
XiaoyuBIE1994 / SDCodec
(ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec
☆33Updated 2 weeks ago
Takaaki-Saeki / ssl_speech_restoration_v2
☆15Updated last year
caizexin / GenVC
Self-supervised Generative LM-based Voice Conversion
☆36Updated last month
SonyResearch / VRVQ
Variable Bitrate Residual Vector Quantization for Audio Coding
☆41Updated last month
alessandroragano / scoreq
SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)
☆72Updated 4 months ago
asappresearch / simple-tts
Contains the code associated with the ICLR submission for our text-to-speech diffusion model
☆53Updated last year
BakerBunker / FreeV
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
☆91Updated 11 months ago
PhonemeHallucinator / Phoneme_Hallucinator
☆46Updated last year
XZWY / SpatialCodec
Implementation of SpatialCodec.
☆58Updated last year
tomermistrix / mosnet-speech-enhancement
Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement
☆23Updated 2 years ago
sony / diffiner
☆62Updated last year
ftshijt / Interspeech2024_DiscreteSpeechChallenge
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
☆32Updated last year
Yip-Jia-Qi / codecformer
☆17Updated 10 months ago
Audio-AGI / FlowSep
Official implementation for FlowSep
☆50Updated 5 months ago