Audio-WestlakeU/VINP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Audio-WestlakeU/VINP)

Audio-WestlakeU / VINP

Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverberation and Blind RIR Identification' [IEEE TASLP]

☆36

Alternatives and similar repositories for VINP

Users that are interested in VINP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Audio-WestlakeU / Rec-RIR
View on GitHub
Official PyTorch implementation of 'Blind Room Impulse Response Identification via Reverberant Speech Spectrum Reconstruction' [Interspee…
☆34Jun 4, 2026Updated last month
Audio-WestlakeU / UMA-ASR
View on GitHub
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
☆35Dec 17, 2024Updated last year
Audio-WestlakeU / CleanMel
View on GitHub
Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".
☆94Feb 2, 2026Updated 5 months ago
Audio-WestlakeU / RVAE-EM
View on GitHub
Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…
☆51Mar 6, 2025Updated last year
Audio-WestlakeU / RCT
View on GitHub
This repo gives the code for the official implementation of RCT.
☆13Jun 28, 2022Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Audio-WestlakeU / Mel-McNet
View on GitHub
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]
☆26May 14, 2026Updated 2 months ago
Audio-WestlakeU / SAR-SSL
View on GitHub
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…
☆40Oct 11, 2024Updated last year
sp-uhh / buddy
View on GitHub
BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models
☆66Oct 18, 2024Updated last year
cpystan / PSM
View on GitHub
Exploring Unsupervised Cell Recognition with Prior Self-activation Maps (MICCAI 2023)
☆13Oct 27, 2023Updated 2 years ago
jdonley / Speech-Dereverberation-and-RIR-Estimation
View on GitHub
☆15Apr 18, 2023Updated 3 years ago
Audio-WestlakeU / McNet
View on GitHub
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
☆130Mar 24, 2023Updated 3 years ago
Audio-WestlakeU / NBSS
View on GitHub
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
☆363Jan 1, 2025Updated last year
anton-jeran / AV-RIR
View on GitHub
Audio-Visual Room Impulse Response Estimation
☆25Jul 22, 2024Updated 2 years ago
Audio-WestlakeU / RealMAN
View on GitHub
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…
☆175Apr 29, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ArrayDPS / ArrayDPS
View on GitHub
☆40May 12, 2025Updated last year
Audio-WestlakeU / FS-EEND
View on GitHub
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …
☆183May 7, 2026Updated 2 months ago
Audio-WestlakeU / FN-SSL
View on GitHub
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
☆159Mar 10, 2026Updated 4 months ago
Audio-WestlakeU / ATST-SED
View on GitHub
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
☆174Jun 8, 2026Updated last month
Chutlhu / dEchorate
View on GitHub
Da - ECHO - RetrievAl - daTasEt
☆36Jul 7, 2024Updated 2 years ago
anton-jeran / Speech2RIR
View on GitHub
This is the official implementation of reverberant speech to room impulse response estimator
☆42Aug 7, 2024Updated last year
Audio-WestlakeU / pytorch_lightning_template_for_beginners
View on GitHub
A pytorch template for beginners based on pytorch_lightning
☆51Feb 1, 2024Updated 2 years ago
maj4e / pyrirtool
View on GitHub
Measuring room impulse responses with python and sounddevice
☆85Jun 30, 2019Updated 7 years ago
kyungyunlee / fins
View on GitHub
Implementation of FiNS model for RIR estimation
☆38Nov 1, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
facebookresearch / 6DoF-Auraliser
View on GitHub
An auralisation system that takes a head-worn microphone array recordings as input and renders the audio for binaural playback; taking in…
☆37Oct 10, 2023Updated 2 years ago
line / WaveTrainerFit
View on GitHub
Official implementation of "Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration towards High-Quality Speech G…
☆16Feb 6, 2026Updated 5 months ago
AmphionTeam / FlexiCodec
View on GitHub
[ICLR2026] FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates
☆51Jul 1, 2026Updated 3 weeks ago
Ion3rik / dark-velvet-noise-reverb
View on GitHub
Dark-velvet-noise reverb: Accurate model for late-reverberation with arbitrary temporal energy decay. Offline implementations in Matlab a…
☆24Nov 1, 2024Updated last year
xi-j / Mamba-TasNet
View on GitHub
☆116Oct 1, 2024Updated last year
georg-goetz / DecayFitNet
View on GitHub
☆26Dec 14, 2023Updated 2 years ago
smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 8 months ago
Audio-WestlakeU / audiossl
View on GitHub
A library built for easier audio self-supervised training, downstream tasks evaluation
☆140Sep 25, 2025Updated 10 months ago
Max1Wz / H-GTCRN
View on GitHub
A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions (Interspeech 2025)
☆111Mar 13, 2026Updated 4 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Xiaobin-Rong / TRT-SE
View on GitHub
An example of a speech enhancement model deployed with TensorRT.
☆88Mar 24, 2025Updated last year
polarch / shoebox-roomsim
View on GitHub
A fast Matlab shoebox room simulator using the image source method, tuned for spatial sound processing.
☆34Feb 5, 2024Updated 2 years ago
XiaoyuBIE1994 / DVAE_SE
View on GitHub
(TASLP 2022) Unsupervised speech enhancement using DVAEs
☆23Dec 16, 2024Updated last year
popcornell / FastMSS
View on GitHub
☆33Updated this week
sp-uhh / storm
View on GitHub
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
☆256Sep 13, 2024Updated last year
gdalsanto / rir2fdn
View on GitHub
Companion code for the DAFx24 paper RIR2FDN
☆23Oct 19, 2024Updated last year
JusperLee / SonicSim
View on GitHub
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
☆277Jan 22, 2025Updated last year