modelscope / ClearerVoice-Studio
ClearVoice
☆13Updated this week
Related projects ⓘ
Alternatives and complementary repositories for ClearerVoice-Studio
- ☆11Updated this week
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆16Updated 3 months ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆27Updated 4 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆45Updated 2 years ago
- acnn for text-independent speaker recognition☆9Updated 2 years ago
- Discriminative Training of VBx Diarization☆18Updated 2 months ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆16Updated 3 weeks ago
- ☆27Updated last year
- Unofficial implementation of SCP-GAN☆18Updated last year
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆29Updated last month
- ☆48Updated 9 months ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆20Updated 2 months ago
- This is the unofficial implementation of MFNet, from paper''a Mask Free Neural Network for Monaural Speech Enhancement''☆10Updated 10 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆58Updated 3 months ago
- Implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch.☆11Updated last year
- ☆48Updated 5 months ago
- ☆32Updated 2 months ago
- Neural network density models for speech separation.☆20Updated 3 years ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆28Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆50Updated 2 weeks ago
- ☆32Updated 3 years ago
- ☆14Updated last year
- ☆38Updated 7 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆30Updated 3 months ago
- ☆33Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆19Updated last year
- ☆23Updated 4 years ago
- ☆46Updated last year