jwr1995 / PubSepLinks
Repository of published DNN speech separation recipes for a number of datasets
☆12Updated last year
Alternatives and similar repositories for PubSep
Users that are interested in PubSep are comparing it to the libraries listed below
Sorting:
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆22Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆20Updated 2 months ago
- ☆17Updated 10 months ago
- A neural speech codec based on discrete WavLM representations☆24Updated 9 months ago
- ☆17Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆12Updated 10 months ago
- ☆47Updated 8 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 9 months ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 2 months ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago
- ☆23Updated last year
- A small tool to calculate the distribution of audio durations in a directory☆14Updated 2 years ago
- ☆26Updated 2 years ago
- ☆12Updated 3 weeks ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆18Updated 2 years ago
- Spherical residual vector quantization (SRVQ)☆28Updated 9 months ago
- ☆11Updated 2 years ago
- Viterbi decoding in PyTorch☆34Updated 3 weeks ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆15Updated 3 months ago
- ☆12Updated last year
- ☆51Updated 2 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆41Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- SRTNet☆24Updated 2 years ago
- ☆14Updated 2 months ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆28Updated 10 months ago
- We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.☆29Updated last year
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆48Updated 8 months ago
- real-time speech enhance☆16Updated last year
- ☆21Updated last year