DmitryRyumin / Awesome-Speech-Enhancement
Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better understanding, and stay at the forefront of advances in speech enhancement with this repository! Don't forget to ⭐ if you find it helpful.
☆17Updated 11 months ago
Alternatives and similar repositories for Awesome-Speech-Enhancement:
Users that are interested in Awesome-Speech-Enhancement are comparing it to the libraries listed below
- List of direct speech-to-speech translation papers.☆37Updated 2 years ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆24Updated last month
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆28Updated 6 months ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated 6 months ago
- ☆48Updated 2 years ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆47Updated 2 months ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆45Updated last week
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆35Updated last week
- ☆45Updated 6 months ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 9 months ago
- Query-conditioned target sound extraction model☆20Updated last week
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 7 months ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆2Updated 2 weeks ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆41Updated last year
- Prediction of sound event bounding boxes (SEBBs)☆26Updated 8 months ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆36Updated last year
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆37Updated 9 months ago
- ☆32Updated 9 months ago
- ☆23Updated last year
- ☆64Updated 6 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- ☆65Updated last year
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆27Updated last year
- A pytorch implementation of D3Net.☆11Updated 3 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆51Updated 5 months ago
- TODO☆37Updated last year
- HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement (ICASSP 2023)☆80Updated last year
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆36Updated 3 weeks ago