NVIDIA / CleanUNet
Official PyTorch Implementation of CleanUNet (ICASSP 2022)
☆304Updated last year
Alternatives and similar repositories for CleanUNet:
Users that are interested in CleanUNet are comparing it to the libraries listed below
- Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation☆576Updated 3 weeks ago
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆362Updated 3 months ago
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆449Updated 10 months ago
- Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEE…☆185Updated last year
- HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks☆213Updated 3 years ago
- Conformer-based Metric GAN for speech enhancement☆341Updated 9 months ago
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆252Updated 9 months ago
- Variational Bayes HMM over x-vectors diarization☆263Updated last year
- Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…☆314Updated last year
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆242Updated last year
- A library for speech data augmentation in time-domain☆655Updated 3 years ago
- A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf☆367Updated 3 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆262Updated 3 months ago
- An open source dataset for source separation☆405Updated last year
- Tools for Speech Enhancement integrated with Kaldi☆409Updated last year
- Speaker embedding (d-vector) trained with GE2E loss☆276Updated last year
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆206Updated 5 months ago
- see README☆335Updated 6 months ago
- ☆411Updated last year
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.☆325Updated 2 years ago
- End-to-End Neural Diarization☆395Updated 3 years ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆400Updated last year
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆131Updated 2 months ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆351Updated 7 months ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆396Updated 3 months ago
- Predicts the level of noise and reverberation on your audiofiles☆144Updated 8 months ago
- Spot the conversation: speaker diarisation in the wild☆134Updated 2 years ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆173Updated 2 months ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆318Updated 6 months ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆144Updated 2 years ago