DmitryRyumin / Awesome-Speech-Enhancement
Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better understanding, and stay at the forefront of advances in speech enhancement with this repository! Don't forget to ⭐ if you find it helpful.
☆13Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Speech-Enhancement
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆33Updated last month
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆29Updated last month
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆50Updated 2 weeks ago
- ☆48Updated last year
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆27Updated 4 months ago
- ☆64Updated last year
- ☆15Updated 2 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆36Updated last month
- This is official repository of new SOTA diffusion models based method for speech enhancement☆33Updated 3 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆60Updated 9 months ago
- Learning differentiable temporal resolution on time-series data.☆33Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆34Updated 11 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆58Updated 3 months ago
- This is the official implementation of the LiSenNet☆15Updated last week
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆19Updated 11 months ago
- ☆26Updated last year
- A Diffusion Probabilistic Model for Target Sound Extraction☆35Updated last month
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆16Updated 3 months ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆20Updated 2 months ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆25Updated last month
- ☆27Updated last year
- ☆46Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- ☆32Updated 2 months ago
- Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"☆42Updated 4 months ago
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated last month
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆34Updated 8 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆45Updated 2 years ago
- ☆15Updated 4 months ago
- Query-conditioned target sound extraction model☆18Updated 3 weeks ago