Audio-WestlakeU/RVAE-EM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Audio-WestlakeU/RVAE-EM)

Audio-WestlakeU / RVAE-EM

Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]

☆51

Alternatives and similar repositories for RVAE-EM

Users that are interested in RVAE-EM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Audio-WestlakeU / VINP
View on GitHub
Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverb…
☆36Feb 23, 2026Updated 5 months ago
DiegoLeon96 / Neural-Speech-Dereverberation
View on GitHub
Machine and Deep Learning models for speech dereverberation
☆120Feb 21, 2022Updated 4 years ago
Andong-Li-speech / Neural-Vocoders-as-Speech-Enhancers
View on GitHub
☆52Sep 10, 2024Updated last year
Audio-WestlakeU / McNet
View on GitHub
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
☆130Mar 24, 2023Updated 3 years ago
Audio-WestlakeU / UMA-ASR
View on GitHub
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
☆35Dec 17, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Audio-WestlakeU / SAR-SSL
View on GitHub
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…
☆40Oct 11, 2024Updated last year
GT-KIM / specmix
View on GitHub
This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Fea…
☆11Sep 27, 2022Updated 3 years ago
Audio-WestlakeU / RealMAN
View on GitHub
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…
☆175Apr 29, 2025Updated last year
sp-uhh / 2sderev
View on GitHub
Two-stage Dereverberation Algorithm using DNN-supported multi-channel linear filtering and single-channel non-linear post-filtering
☆15Jan 10, 2025Updated last year
ssi-research / FQSE
View on GitHub
Fully Quantized Neural Networks For Speech Enhancement
☆65Feb 15, 2024Updated 2 years ago
junyuchen-cjy / DTTNet-Pytorch
View on GitHub
An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation
☆109Mar 19, 2024Updated 2 years ago
tencent-ailab / FRA-RIR
View on GitHub
☆214Dec 4, 2023Updated 2 years ago
aleXiehta / Causal-SE
View on GitHub
Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"
☆28Feb 26, 2023Updated 3 years ago
Audio-WestlakeU / CleanMel
View on GitHub
Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".
☆94Feb 2, 2026Updated 5 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Beilong-Tang / TSELM
View on GitHub
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆60Apr 14, 2025Updated last year
Xiaobin-Rong / lite-rtse
View on GitHub
An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement
☆14Nov 19, 2023Updated 2 years ago
HaoFengyuan / X-TF-GridNet
View on GitHub
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…
☆115Sep 2, 2025Updated 10 months ago
JaeBinCHA7 / DEMUCS-for-Speech-Enhancement
View on GitHub
We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.
☆34Nov 8, 2023Updated 2 years ago
JusperLee / TFACM
View on GitHub
☆24Jul 16, 2025Updated last year
sp-uhh / derevdps
View on GitHub
☆13Jan 12, 2024Updated 2 years ago
ioyy900205 / MFNet
View on GitHub
This repo provides the processed samples of the manuscript "a Mask Free Neural Network for Monaural Speech Enhancement", which was accep…
☆37May 22, 2023Updated 3 years ago
Audio-WestlakeU / Rec-RIR
View on GitHub
Official PyTorch implementation of 'Blind Room Impulse Response Identification via Reverberant Speech Spectrum Reconstruction' [Interspee…
☆34Jun 4, 2026Updated last month
lixilinx / FilterbanksBestPractices
View on GitHub
Tips for best practices with filterbanks
☆45May 31, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cpystan / PSM
View on GitHub
Exploring Unsupervised Cell Recognition with Prior Self-activation Maps (MICCAI 2023)
☆13Oct 27, 2023Updated 2 years ago
cogmhear / avse_challenge
View on GitHub
COG-MHEAR Audio-Visual Speech Enhancement Challenge
☆48Feb 17, 2026Updated 5 months ago
nanless / universal-speech-enhancement
View on GitHub
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…
☆83Jul 29, 2024Updated 2 years ago
merlresearch / sebbs
View on GitHub
Prediction of sound event bounding boxes (SEBBs)
☆35Aug 2, 2024Updated last year
facebookresearch / ears_dataset
View on GitHub
Expressive Anechoic Recordings of Speech (EARS)
☆221Jun 25, 2024Updated 2 years ago
XinleiRen / MTFAA-Net
View on GitHub
An unofficial non-causal Tensorflow implementation of "Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Spee…
☆14Dec 27, 2022Updated 3 years ago
chentuochao / Target-Conversation-Extraction
View on GitHub
This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…
☆58Aug 15, 2025Updated 11 months ago
Audio-WestlakeU / FN-SSL
View on GitHub
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
☆159Mar 10, 2026Updated 4 months ago
Mddct / usm-tokenizer
View on GitHub
semantic tokenizer for speech and music
☆20Jul 6, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Audio-WestlakeU / FullSubNet
View on GitHub
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
☆608Aug 19, 2023Updated 2 years ago
urgent-challenge / urgent2024_challenge
View on GitHub
Official data preparation scripts for the URGENT 2024 Challenge
☆90May 21, 2025Updated last year
microsoft / EchoDensity
View on GitHub
Reference implementation and test synthetic data for Sorted Center Time echo density measure for acoustic impulse responses
☆15Mar 18, 2020Updated 6 years ago
donghoney0416 / DeFTAN-II
View on GitHub
Official page of "DeFTAN-II: Efficient multichannel speech enhancement with subgroup processing", IEEE/ACM Transactions on Audio, Speech,…
☆34Nov 21, 2024Updated last year
jdonley / Speech-Dereverberation-and-RIR-Estimation
View on GitHub
☆15Apr 18, 2023Updated 3 years ago
JusperLee / SPMamba
View on GitHub
☆227Dec 5, 2024Updated last year
SoulProficiency / speechseparation-Sandglasset
View on GitHub
☆13Jun 24, 2021Updated 5 years ago