Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better understanding, and stay at the forefront of advances in speech enhancement with this repository! Don't forget to ⭐ if you find it helpful.
☆26Apr 19, 2024Updated last year
Alternatives and similar repositories for Awesome-Speech-Enhancement
Users that are interested in Awesome-Speech-Enhancement are comparing it to the libraries listed below
Sorting:
- FG 2024 Papers: Explore a comprehensive collection of research papers presented at one of the premier conferences on automatic face and g…☆16May 18, 2024Updated last year
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆15Jan 6, 2025Updated last year
- ☆10Jun 24, 2021Updated 4 years ago
- ☆11Nov 7, 2024Updated last year
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 3 months ago
- Official implementation of WildFX Dataset Generating pipeline.☆15Oct 21, 2025Updated 4 months ago
- Dual-Path Attention and Recurrent Network for speech separation☆19Sep 12, 2024Updated last year
- ☆23Feb 2, 2022Updated 4 years ago
- Speech Separation☆18Mar 7, 2024Updated 2 years ago
- VAE and STCN with NMF for single-channel speech enhancement☆14Mar 24, 2021Updated 4 years ago
- Improved Speech Enhancement GANs☆12Jun 24, 2020Updated 5 years ago
- Acoustic Neighbor Embeddings☆28Jul 13, 2025Updated 7 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated last year
- In-car multi-channel speech transcription system of AISHELL-5.☆41Jun 9, 2025Updated 9 months ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆79May 21, 2025Updated 9 months ago
- Collection of papers, datasets and tools on the topic of Speech Dereverberation and Speech Enhancement☆25Jan 23, 2022Updated 4 years ago
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆32Mar 14, 2025Updated 11 months ago
- This is the official implementation of the LiSenNet☆150Nov 15, 2024Updated last year
- Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems☆82Jan 25, 2026Updated last month
- A research project and comparative study on various Active Noise Cancellation Algorithms like FxLMS, EMFN, Chebyshev filter and Hammerste…☆10Jul 3, 2022Updated 3 years ago
- ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore t…☆517May 5, 2025Updated 10 months ago
- ☆73Sep 6, 2022Updated 3 years ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆251Dec 12, 2025Updated 2 months ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆22Jan 19, 2026Updated last month
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- A pytorch implementation of D3Net.☆11Aug 8, 2021Updated 4 years ago
- Music2Emo: Towards Unified Music Emotion Recognition across Dimensional and Categorical Models☆45Aug 24, 2025Updated 6 months ago
- misc programming languages☆12Jan 10, 2023Updated 3 years ago
- Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis☆42Oct 3, 2020Updated 5 years ago
- [AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny 300M model!☆97Updated this week
- Official data preparation scripts for the URGENT 2024 Challenge☆87May 21, 2025Updated 9 months ago
- Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".☆37Jul 19, 2020Updated 5 years ago
- Dynamic and static models for real-time facial emotion recognition☆182Aug 2, 2024Updated last year
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆45Feb 17, 2026Updated 2 weeks ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆17Nov 19, 2025Updated 3 months ago
- Cardiovascular Disease Classification Employing Empirical Mode Decomposition (EMD) of Modified ECG☆12Oct 6, 2023Updated 2 years ago
- PAGAN: a phase-adapted GAN for speech enhancement☆36Sep 17, 2020Updated 5 years ago