sp-uhh / storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
☆206Updated 5 months ago
Alternatives and similar repositories for storm:
Users that are interested in storm are comparing it to the libraries listed below
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆173Updated 2 months ago
- PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio☆165Updated last year
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆155Updated 2 years ago
- UT-Sarulab MOS prediction system using SSL models☆208Updated 10 months ago
- The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation☆255Updated last month
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆86Updated 5 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- It's a repository for implementations of neural speech editing algorithms.☆193Updated last year
- ☆97Updated last year
- Official data preparation scripts for the URGENT 2024 Challenge☆76Updated last month
- ☆152Updated 2 months ago
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆187Updated 10 months ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆146Updated 2 years ago
- Reference-aware automatic speech evaluation toolkit☆142Updated 2 months ago
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆115Updated 2 years ago
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆362Updated 3 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆114Updated this week
- Expressive Anechoic Recordings of Speech (EARS)☆148Updated 7 months ago
- Audio Codec Speech processing Universal PERformance Benchmark☆238Updated 3 months ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆98Updated 11 months ago
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆252Updated 9 months ago
- HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks☆213Updated 3 years ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆242Updated last year
- A library built for easier audio self-supervised training, downstream tasks evaluation☆113Updated 5 months ago
- Unofficial implementation of miipher☆119Updated 10 months ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆140Updated last year
- Target Speaker Extraction Toolkit☆144Updated 2 weeks ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆65Updated last year
- target speaker extraction and verification for multi-talker speech☆172Updated 4 years ago
- Versatile Evaluation of Speech and Audio☆157Updated this week