thanhluantrinh / LDDGANView external linksLinks
☆29Jan 15, 2025Updated last year
Alternatives and similar repositories for LDDGAN
Users that are interested in LDDGAN are comparing it to the libraries listed below
Sorting:
- ☆32Jan 9, 2024Updated 2 years ago
- Repository of published DNN speech separation recipes for a number of datasets☆12Jan 22, 2024Updated 2 years ago
- NOMAD: Non-Matching Audio Distance (ICASSP 2024)☆30Jun 17, 2025Updated 7 months ago
- This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.☆13Dec 8, 2021Updated 4 years ago
- Official repository for U-SAM (Interspeech 2025)☆25Jun 3, 2025Updated 8 months ago
- The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…☆11Aug 27, 2023Updated 2 years ago
- ☆15Sep 22, 2025Updated 4 months ago
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆38Jan 22, 2026Updated 3 weeks ago
- Implementation of Sheffield entry for Clarity enhancement challenge.☆18Apr 19, 2022Updated 3 years ago
- Official code for Accelerating Diffusion Sampling with Optimized Time Steps (CVPR 2024)☆38Mar 11, 2024Updated last year
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆43Jul 25, 2025Updated 6 months ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆46Sep 12, 2024Updated last year
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Jul 31, 2024Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆46Nov 19, 2024Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆25Apr 16, 2023Updated 2 years ago
- ☆21Jul 15, 2024Updated last year
- ☆57Apr 22, 2024Updated last year
- ☆41Jan 10, 2025Updated last year
- Landing Page for Divide and Remaster v3☆25Jul 29, 2025Updated 6 months ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆54Mar 5, 2025Updated 11 months ago
- ☆43Jan 13, 2025Updated last year
- Generative Modeling with Bayesian Sample Inference☆24May 17, 2025Updated 8 months ago
- Zero-Shot Blind Audio Bandwidth Extension☆26May 25, 2023Updated 2 years ago
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆56Jun 1, 2025Updated 8 months ago
- ☆57Apr 24, 2024Updated last year
- [NeurIPS 2024] Official implementation of "Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models."☆55Aug 14, 2025Updated 6 months ago
- An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control☆30Jan 13, 2026Updated last month
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆29Jul 24, 2025Updated 6 months ago
- (TASLP 2022) Unsupervised speech enhancement using DVAEs☆23Dec 16, 2024Updated last year
- ☆54Mar 2, 2023Updated 2 years ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Sep 19, 2025Updated 4 months ago
- Official implemention for Diffusion Models Are Innate One-Step Generators☆26Jun 25, 2025Updated 7 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Sep 1, 2023Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- A toolkit dedicate for speech evaluation.☆24Sep 26, 2024Updated last year
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Jun 3, 2023Updated 2 years ago