chentuochao / Target-Conversation-Extraction
This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamics"
☆38Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Target-Conversation-Extraction
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆49Updated 2 weeks ago
- Fully Quantized Neural Networks For Speech Enhancement☆60Updated 9 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆27Updated 4 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆36Updated last month
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆57Updated 3 months ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆72Updated 2 months ago
- ☆32Updated 2 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆30Updated 3 months ago
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆33Updated last month
- This is official repository of new SOTA diffusion models based method for speech enhancement☆33Updated 3 months ago
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆18Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆33Updated this week
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆37Updated 2 months ago
- ☆64Updated last year
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆29Updated last month
- ☆28Updated 6 months ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆35Updated last month
- ☆48Updated last year
- ☆68Updated 2 years ago
- We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.☆22Updated last year
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆69Updated last month
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆28Updated last year
- ☆27Updated 7 months ago
- ☆48Updated 9 months ago
- ☆38Updated 6 months ago
- Dual-Path Attention and Recurrent Network for speech separation☆15Updated 2 months ago
- Query-conditioned target sound extraction model☆17Updated 2 weeks ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS …☆95Updated last month
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆20Updated 2 months ago
- NOMAD: Non-Matching Audio Distance (ICASSP 2024)☆24Updated last month