Repository of published DNN speech separation recipes for a number of datasets
☆12Jan 22, 2024Updated 2 years ago
Alternatives and similar repositories for PubSep
Users that are interested in PubSep are comparing it to the libraries listed below
Sorting:
- Whisper Speech Quality Assessment (WhiSQA)☆16Oct 14, 2025Updated 4 months ago
- A 1D implementation of a deformable convolutional layer in PyTorch with a few tricks.☆46Aug 17, 2023Updated 2 years ago
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆23Updated this week
- Carnatic singing voice separation trained with in-domain data with leakage☆11Nov 5, 2023Updated 2 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆13Jul 22, 2024Updated last year
- Implementation of Sheffield entry for Clarity enhancement challenge.☆18Apr 19, 2022Updated 3 years ago
- G.729А audio codec for python 3☆13Mar 18, 2020Updated 5 years ago
- 把 wave-u-net 网络应用于语音增强领域中☆14May 29, 2020Updated 5 years ago
- VAE and STCN with NMF for single-channel speech enhancement☆14Mar 24, 2021Updated 4 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 7 months ago
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 4 months ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Jul 31, 2024Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆46Nov 19, 2024Updated last year
- ☆18Oct 26, 2023Updated 2 years ago
- ☆21Jul 15, 2024Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆54Mar 5, 2025Updated 11 months ago
- Pytorch implemention of SDNet☆23Jun 1, 2021Updated 4 years ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆55Aug 15, 2025Updated 6 months ago
- ☆54Mar 2, 2023Updated 3 years ago
- Fast and high quality sample-rate conversion library for Python☆105Oct 12, 2025Updated 4 months ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Sep 19, 2025Updated 5 months ago
- ☆29Jan 15, 2025Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Dec 14, 2023Updated 2 years ago
- Streaming Vocos☆30Jun 10, 2025Updated 8 months ago
- ☆32Jan 9, 2024Updated 2 years ago
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆32Mar 14, 2025Updated 11 months ago
- NOMAD: Non-Matching Audio Distance (ICASSP 2024)☆30Jun 17, 2025Updated 8 months ago
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆38Feb 24, 2025Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆80Aug 20, 2024Updated last year
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 3 years ago
- InSales e-commerce platform API bindings☆14Jul 13, 2024Updated last year
- ☆38Jun 5, 2023Updated 2 years ago
- ☆30Jun 12, 2025Updated 8 months ago
- BandIt: Cinematic Audio Source Separation☆155Jul 29, 2025Updated 7 months ago
- ☆38Apr 17, 2025Updated 10 months ago