DmitryRyumin / Awesome-Speech-EnhancementLinks
Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better understanding, and stay at the forefront of advances in speech enhancement with this repository! Don't forget to ⭐ if you find it helpful.
☆21Updated last year
Alternatives and similar repositories for Awesome-Speech-Enhancement
Users that are interested in Awesome-Speech-Enhancement are comparing it to the libraries listed below
Sorting:
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆70Updated 4 months ago
- ☆54Updated 2 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆41Updated 4 months ago
- Streaming Audiotransformers for online Audio tagging☆47Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- Prediction of sound event bounding boxes (SEBBs)☆29Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Updated 2 years ago
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆57Updated 3 months ago
- ☆65Updated 2 years ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆43Updated 4 months ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆49Updated last month
- ☆50Updated last year
- ☆32Updated 10 months ago
- ☆28Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆46Updated 6 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆38Updated 4 months ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆68Updated last year
- ☆63Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Updated last year
- ☆57Updated 2 years ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆29Updated last year
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆48Updated 5 months ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆76Updated 4 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆63Updated last year
- A toolkit dedicate for speech evaluation.☆23Updated last year
- (ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement☆66Updated 2 months ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆14Updated 2 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆43Updated 2 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated last year