Audio-WestlakeU / RVAE-EMLinks
Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]
☆45Updated 2 months ago
Alternatives and similar repositories for RVAE-EM
Users that are interested in RVAE-EM are comparing it to the libraries listed below
Sorting:
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆41Updated 6 months ago
- ☆21Updated last year
- Prediction of sound event bounding boxes (SEBBs)☆28Updated 10 months ago
- ☆47Updated 8 months ago
- Spherical residual vector quantization (SRVQ)☆28Updated 9 months ago
- Implementation of SpatialCodec.☆58Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 9 months ago
- Zero-Shot Blind Audio Bandwidth Extension☆22Updated 2 years ago
- ☆65Updated last year
- ☆51Updated 2 years ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆33Updated 2 weeks ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆38Updated 8 months ago
- Separate Anything in Audio with Zero Training☆25Updated this week
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- ☆47Updated 2 months ago
- ☆23Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆39Updated 6 months ago
- Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverb…☆17Updated last week
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Updated 2 years ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆74Updated 2 weeks ago
- Generation scripts for EARS-WHAM and EARS-Reverb☆33Updated last month
- ☆12Updated 3 weeks ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆46Updated last month
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 2 months ago
- A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions (Interspeech 2025)☆22Updated last week
- Algorithm for blind estimation of reverberation time☆22Updated 11 months ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆40Updated 8 months ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆57Updated last week
- Query-conditioned target sound extraction model☆23Updated 2 months ago