ZhaoF-i / ASTWS-AECLinks
Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation
☆16Updated last week
Alternatives and similar repositories for ASTWS-AEC
Users that are interested in ASTWS-AEC are comparing it to the libraries listed below
Sorting:
- ☆17Updated 11 months ago
- offical code for Dense-TSNet☆12Updated 9 months ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 10 months ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago
- Speech Resynthesis and Language Modeling☆20Updated last month
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆15Updated last month
- ☆11Updated 4 months ago
- A Python library for blind source separation.☆4Updated 2 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆12Updated 11 months ago
- Spherical residual vector quantization (SRVQ)☆30Updated 10 months ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆10Updated 8 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆21Updated this week
- ☆16Updated 9 months ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Updated last year
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆13Updated 3 years ago
- ☆15Updated 3 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆21Updated last year
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆11Updated last year
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆13Updated last year
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆17Updated 11 months ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 3 years ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆21Updated 10 months ago
- A neural speech codec based on discrete WavLM representations☆24Updated 10 months ago
- ☆18Updated 10 months ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18Updated last year
- A simple command line tool to calculate WER for ASR.☆14Updated 8 months ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 3 months ago
- ☆13Updated 4 months ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year