DmitryRyumin / Awesome-Speech-EnhancementLinks
Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better understanding, and stay at the forefront of advances in speech enhancement with this repository! Don't forget to ⭐ if you find it helpful.
☆19Updated last year
Alternatives and similar repositories for Awesome-Speech-Enhancement
Users that are interested in Awesome-Speech-Enhancement are comparing it to the libraries listed below
Sorting:
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated 9 months ago
- List of direct speech-to-speech translation papers.☆37Updated 2 years ago
- ☆66Updated 9 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆59Updated 8 months ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆40Updated last month
- Official repository of NeXt-TDNN for speaker verification☆73Updated 8 months ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆33Updated 9 months ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆59Updated last month
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆20Updated last year
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆61Updated this week
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆33Updated 3 months ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated last year
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆50Updated last year
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆48Updated 8 months ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆46Updated 2 months ago
- Query-conditioned target sound extraction model☆23Updated 3 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 10 months ago
- ☆52Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated last year
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 3 months ago
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆48Updated last year
- ☆14Updated 3 years ago
- Prediction of sound event bounding boxes (SEBBs)☆28Updated 10 months ago
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆19Updated 2 weeks ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Updated last year
- This repository contains official pytorch implementation and pre-trained models for the MR-RawNet.☆16Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated last month